Lead Site Reliability Engineer - Bilingual
Concentrix.com
Office
at
Full Time
Job Title:
Lead Site Reliability Engineer - BilingualJob Description
As a Lead Site Reliability Engineer, you’ll play a strategic role in shaping and scaling our DevSecOps ecosystem. You’ll lead the design and implementation of automated CI/CD pipelines, enforce enterprise-grade security and compliance standards, and drive reliability across the entire software delivery lifecycle. Partnering closely with development and operations teams, you’ll define best practices, optimize deployment workflows, and ensure our applications are resilient, observable, and continuously improving. Your expertise will be key to accelerating innovation while maintaining the highest levels of quality and performance.Key Responsibilities:
- Stakeholder Management Working with key technology stakeholders to deliver SRE strategies and capabilities to drive and support the digital transformation agenda.
- Architect and Optimize CI/CD Pipelines Design and maintain cloud-native CI/CD workflows using tools like GitHub Actions, Jenkins, or ArgoCD. Automate build, test, and deployment processes for microservices across Kubernetes clusters and multi-cloud environments.
- Implement DevSecOps Practices Integrate security into every stage of the pipeline—automating vulnerability scans, secrets management, and policy enforcement using tools like Snyk, HashiCorp Vault, and OPA.
- Ensure High Availability and Resilience Build fault-tolerant systems using cloud-native patterns (e.g., self-healing, auto-scaling, blue/green deployments). Leverage Kubernetes, service meshes, and distributed tracing to maintain performance and uptime.
- Monitor, Alert, and Respond Deploy observability stacks (Prometheus, Grafana, ELK, OpenTelemetry) to monitor system health. Define SLOs/SLIs, set up intelligent alerting, and lead incident response and postmortems.
- Manage Infrastructure as Code (IaC) Using Terraform and cloud vendor tools to provision and manage cloud resources. Maintain version-controlled infrastructure and enforce change management practices.
- Enforce Compliance and Governance Ensure systems meet regulatory and organizational standards (e.g., SOC 2, HIPAA, ISO 27001). Automate audit trails and implement continuous compliance checks.
- Collaborate Across Engineering Teams Partner with developers, QA, and platform engineers to embed reliability and security into the SDLC. Advocate for cloud-native best practices and drive adoption of scalable patterns.
- Mentor and Lead by Example Guide junior engineers, conduct technical reviews, and foster a culture of ownership, automation, and continuous learning.
- Continuously Improve Systems and Processes Identify performance bottlenecks, reduce toil through automation, and evolve infrastructure to support rapid innovation and growth.
Skills Required:
- Cloud Platforms: Strong expertise in AWS, Azure, or GCP services (e.g., EC2, S3, IAM, Lambda, AKS, GKE)
- 3- 5 Years of related Experience
- Containers & Orchestration: Proficiency with Docker, Kubernetes, Helm, LGTm, Harbor
- CI/CD Tools: Experience with GitHub Actions, GitLab CI, Azure DevOps, or similar
- Infrastructure as Code: Skilled in Terraform, Pulumi, or CloudFormation
- Monitoring & Observability: Familiarity with Prometheus, Grafana, ELK stack, OpenTelemetry
- Security & Compliance: Knowledge of DevSecOps tools and practices (e.g., Vault, Snyk, OPA, CIS benchmarks)
- Programming/Scripting: Strong skills in Python, Go, Bash, or similar languages
- Version Control & Collaboration: Proficient in Git, GitOps workflows, and agile development practices
Location:
COL Work-at-HomeLanguage Requirements:
Time Type:
Full time2026-02-10If you are a California resident, by submitting your information, you acknowledge that you have read and have access to the Job Applicant Privacy Notice for California Residents
