Job Summary:
We are looking for a skilled Operations Engineer with expertise in AWS infrastructure and Terraform to join our cloud operations team. You will be responsible for managing, automating, and optimizing our cloud infrastructure, ensuring high availability, scalability, and security of our production and development environments.
Responsibilities:
- Design, implement, and maintain AWS infrastructure using Terraform (IaC – Infrastructure as Code).
- Manage CI/CD pipelines and automate operational tasks using tools like Jenkins, GitHub Actions, or CodePipeline.
- Monitor infrastructure health using CloudWatch, Prometheus, Grafana, etc., and handle alerting with PagerDuty or similar tools.
- Implement and maintain backup, disaster recovery, and high availability strategies in AWS.
- Manage VPCs, subnets, routing, security groups, and IAM roles and policies.
- Perform cost optimization and rightsizing of AWS resources.
- Ensure security compliance and apply cloud security best practices (e.g., encryption, access control).
- Collaborate with development and security teams to support application deployment and governance.
Required Skills & Experience:
- 3+ years of hands-on experience in AWS Cloud (EC2, S3, IAM, RDS, Lambda, EKS/ECS, VPC, etc.).
- 2+ year’s experience with Terraform and strong understanding of IaC principles.
- Hands-on experience with Linux system administration and scripting (Bash, Python).
- Experience with DevOps tools such as Git, Docker, Jenkins, or similar.
- Proficiency in monitoring/logging tools like CloudWatch, ELK stack, Datadog, or New Relic.
- Familiarity with incident management, change management, and postmortem analysis processes.
- Knowledge of networking, DNS, TLS/SSL, firewalls, and cloud security concepts.
Good to Have:
- Certification: AWS Certified SysOps Administrator or DevOps Engineer or Terraform.
- Experience with Kubernetes (EKS) and container orchestration.
- Exposure to GitOps, ArgoCD, or FluxCD.
- Familiarity with Terraform Cloud, Terragrunt, or Pulumi.
- Experience working in Agile/Scrum environments and using Jira/Confluence.