Title: CloudOps Lead
We are seeking a CloudOps Lead to oversee and drive the management, optimization, and scaling of our cloud infrastructure. The ideal candidate will bring strong expertise in AWS, IaC, Security, migration and networking, along with proven experience leading cloud operations teams. You will play a key role in ensuring security, reliability, and cost-efficiency of our cloud environments while guiding best practices and mentoring team members.
Key Responsibilities
Cloud Infrastructure Leadership
- Lead the design, deployment, and optimization of AWS-based cloud infrastructure.
- Provide technical direction for cloud operations, ensuring scalable, secure, and cost-effective environments.
- Develop and maintain cloud architecture roadmaps aligned with business objectives.
Kubernetes & Container Management
- Oversee the management of Kubernetes clusters, ensuring availability, scalability, and security.
- Support deployments, monitoring, and troubleshooting of containerized applications.
Infrastructure as Code (IaC)
- Lead infrastructure provisioning and automation using Terraform.
- Establish and enforce standards for IaC practices across teams.
Networking & Security
- Manage and troubleshoot networking solutions (routing, firewalls, load balancing, VPNs).
- Ensure cloud operations adhere to security standards and compliance policies.
- Implement and monitor identity/access management, encryption, logging, and auditing controls.
Cloud Services & Applications
- Manage AWS services including EC2, S3, RDS, VPCs, and Route 53.
- Leverage AWS application services (Lambda, API Gateway, SQS, SNS, Kinesis).
- Support hybrid and multi-cloud integration and migration projects.
Operations & Optimization
- Monitor and report on system performance, reliability, and cost metrics.
- Drive continuous improvements in operational efficiency through automation and best practices.
- Ensure high availability of web hosting solutions.
Team Leadership & Collaboration
- Mentor and guide CloudOps engineers.
- Collaborate with development, security, and PMO teams to align cloud operations with organizational goals.
- Lead incident response and troubleshooting efforts.
Key Requirements
- 5+ years of hands-on experience in cloud operations, engineering, or architecture roles.
- Bachelor’s Degree in Computer Science, Information Technology, Engineering, or related fields.
- Strong expertise with AWS infrastructure and application services.
- Proficiency in Kubernetes (2+ years managing production clusters).
- Skilled in Terraform and equivalent for infrastructure as code.
- Strong networking background (routing, firewalls, VPN, load balancing).
- Experience with Active Directory, DNS, and Group Policy management.
- Proficiency in scripting (Bash, PowerShell, Windows scripting).
- Ability to read and troubleshoot Python, Java, or JavaScript.
- Experience with on-premises to cloud migration projects.
Preferred Skills
- AWS Solutions Architect or similar certification.
- Familiarity with CI/CD pipelines and DevOps workflows.
- Experience with monitoring/logging tools (Prometheus, Grafana, CloudWatch).
- Strong problem-solving and analytical mindset.
- Excellent communication and leadership skills.