Site Reliability Engineer With WCS Commerce
Neshent Tech
Springfield, MO
Posted On: Sep 16, 2025
Posted On: Sep 16, 2025
Job Overview
Salary
Depends on Experience
Required Skills
- SRE
- containerization
- cloud
- automation
- Prometheus
- Grafana
Job Description
Responsibilities
- Ensure system availability, define SLOs, SLIs, and SLAs, and implement SRE best practices.
- Automate deployments, monitoring, and incident responses using Terraform and Ansible.
- Use tools like Prometheus, Grafana, and ELK Stack to track system health and resolve issues proactively.
- Implement chaos engineering practices to test system resilience and improve reliability.
- Work closely with development and operations teams, ensuring continuous process improvement and adoption of best practices.
- Integrate security best practices and collaborate with the security team to ensure compliance.
Required Skills
- 5+ years of IT experience with expertise in SRE or a similar role.
- Strong knowledge of containerization (Docker, Kubernetes), cloud platforms (Azure, Google Cloud), and automation tools (Terraform, Ansible).
- Experience with monitoring tools like Prometheus, Grafana, and ELK Stack.
- Understanding of chaos engineering and system resilience principles.
- Bachelor’s degree in Computer Science, IT, or related field.
Job ID: NT250291