Site Reliability Engineer (SRE) - TS/SCI ClearanceLocation: Washington, D.C.Type: Full-Time, Mid-LevelDepartment: IT
Our client is looking for a skilled Site Reliability Engineer (SRE) to join their team in Washington, D.C., playing a key role in enhancing observability, performance, and system reliability to support federal government operations. The successful candidate will drive continuous improvements, ensuring dependable and high-performance technology services.
About the ClientWith over 20 years of commitment to building top-tier teams, our client believes exceptional technology services come from dedicated, skilled individuals. Their mission centers around investing in their people and delivering value-driven results to their clients.
Responsibilities
-
System Monitoring: Oversee platform and containerized applications to maintain optimal performance and high availability.
-
Risk Mitigation: Proactively identify and address performance and availability risks.
-
Infrastructure Optimization: Contribute to developing and refining core platform functions for a resilient infrastructure.
-
Collaborative Engagement: Work closely with both internal teams and customers for ongoing system enhancements.
Qualifications
-
Experience: 8+ years in Site Reliability Engineering, with demonstrated expertise in building scalable, reliable systems.
-
Technical Skills:
-
Kubernetes (K8) expertise.
- Strong understanding of DevSecOps, including experience with source code repositories and CI/CD tools such as Team Foundation Server/Azure DevOps, Bitbucket, and GitHub.
- Proficiency in Infrastructure as Code (IaC), containerization, K8, and CI/CD automation.
- Familiarity with container orchestration tools such as Rancher and OpenShift.
-
Education: Bachelor's degree required.
-
Security Clearance: Active TS/SCI clearance.
-
Work Style: Able to work collaboratively and independently.
-
Onsite Requirement: Available to work onsite in Washington, D.C. (JBAB) at least three days per week.
Salary Range: $180,000 - $200,000Schedule: 3 days onsite, 2 days remote