Job Description
Are you ready to architect the future of cloud computing? NexusScale Systems is looking for a Senior Cloud Infrastructure Engineer to join our high-impact SRE team. You will lead the design, implementation, and maintenance of our global multi-cloud infrastructure, ensuring 99.99% uptime for our mission-critical SaaS platforms.
We foster a culture of automation, rigorous testing, and continuous deployment. If you are passionate about building scalable, secure, and resilient distributed systems, we want to hear from you.
Responsibilities
- Design and manage multi-region AWS/GCP cloud environments using Terraform and Pulumi.
- Develop and maintain CI/CD pipelines to streamline deployment velocity.
- Implement proactive monitoring and observability strategies using Prometheus, Grafana, and ELK.
- Drive incident response and conduct blameless post-mortems to improve system reliability.
- Mentor junior engineers and promote best practices in Infrastructure-as-Code (IaC).
- Collaborate with cross-functional product teams to optimize system performance and cost.
- Manage container orchestration platforms with high-traffic Kubernetes clusters.
Qualifications
- 5+ years of experience in Cloud Engineering or Site Reliability Engineering.
- Expert-level proficiency with AWS or GCP services and IaC tooling (Terraform/CloudFormation).
- Strong coding skills in Python, Go, or Bash for automation and tooling.
- Deep understanding of Kubernetes architecture, networking, and service mesh.
- Experience with high-scale distributed systems and microservices architectures.
- Solid grasp of security best practices, IAM, and compliance standards (SOC2/GDPR).
- Excellent problem-solving skills and the ability to thrive in a fast-paced environment.