We are looking for a highly skilled Azure Site Reliability Engineer (SRE) to join our team. The ideal candidate will be responsible for ensuring reliability, scalability, and performance of cloud infrastructure and applications deployed on Azure and other platforms. This role requires strong hands-on experience with deployment automation, container orchestration, and DevOps tools.
Roles & Responsibilities
- Implement and manage CI/CD pipelines using GitHub Actions.
- Maintain, upgrade, and manage Artifactory repositories; ensure best practices in artifact management.
- Deploy, manage, and troubleshoot containerized applications using Kubernetes and Docker.
- Work with deployment tools like uDeploy and Harness for automated releases.
- Support multiple platforms including OCP (OpenShift), Azure, PCF (Pivotal Cloud Foundry), and other cloud environments.
- Develop automation scripts using scripting languages to improve reliability and efficiency.
- Collaborate with cross-functional teams while also being able to work independently.
Required Skills
- Hands-on experience with Azure cloud services.
- Experience with CI/CD pipelines (GitHub Actions, Jenkins).
- Strong knowledge of Kubernetes, Docker, and containerized application deployment.
- Experience managing Artifactory repositories and builds.
- Familiarity with deployment automation tools like uDeploy or Harness.
Preferred Skills
- Python or Shell scripting for automation.
- Experience with multiple cloud platforms (OCP, PCF) is a plus.
- Strong troubleshooting, monitoring, and observability experience.