Send this job to your inbox!
This role requires candidates who are currently authorized to work in the U.S. without sponsorship, and C2C arrangements are not accepted.
Overview:The Cloud Operations Engineer will be part of a high-performing IT infrastructure team in a fast-paced, cloud-first environment. This role is responsible for the day-to-day support, build, and architecture of cloud-based systems and services across all phases of the project and systems lifecycle. The engineer plays a key role in ensuring the reliability, scalability, and performance of enterprise-wide cloud infrastructure, applications, and services.
Key Responsibilities:
Participate in cloud architecture design discussions, reviewing business and technical use cases to ensure scalable and supportable solutions.
Support the build, configuration, and deployment of cloud-native and hybrid systems across multiple environments (development, staging, production).
Identify, implement, and improve cloud infrastructure standards with a focus on reliability, sustainability, and future scalability.
Evaluate and implement automated solutions to streamline infrastructure deployment and configuration, using infrastructure-as-code (IaC) practices.
Develop and maintain documentation for system architecture, configurations, procedures, and operations.
Integrate cloud operations with legacy systems and workflows, ensuring smooth transitions and interoperability.
Provide hands-on support and administration of cloud and on-prem infrastructure across a multi-vendor, multi-platform environment.
Manage vendor relationships and escalate support cases as needed to resolve service issues.
Mentor team members on best practices in system administration, security, and cloud operations.
Monitor infrastructure performance and participate in capacity planning and high availability initiatives.
Ensure successful data backups, test restorations, and maintain business continuity procedures.
Continuously stay current with emerging technologies and trends in cloud computing, DevOps, and automation.
Contribute to cross-functional technology projects and other duties as assigned.
Required Qualifications:
Bachelor’s degree in Computer Science, Information Systems, or related field; equivalent work experience accepted.
4+ years of experience administering Linux-based systems.
4+ years of experience managing Windows Server environments in multi-site, multi-domain networks.
Strong experience with virtualization technologies and provisioning compute/storage resources.
2+ years of experience designing, building, and supporting public cloud environments (IaaS, PaaS, SaaS).
Proficient with AWS and/or GCP services and architecture (e.g., EC2, IAM, VPC, Lambda, CloudFormation, etc.).
Experience with automation and configuration management tools such as Ansible, Terraform, or Puppet.
Solid scripting experience (e.g., PowerShell, Bash).
Working knowledge of container orchestration platforms such as Kubernetes.
Strong understanding of networking and firewalls, particularly next-gen technologies.
Familiarity with DevOps principles, CI/CD pipelines, and SDLC practices.
Hands-on experience with enterprise backup solutions such as Veeam.
AWS Certified Solutions Architect certification or equivalent cloud certification.
Preferred Qualifications:
Certifications in cloud platforms (e.g., GCP, Azure), Kubernetes (CKA), or Microsoft technologies (MCSE).
Experience integrating identity platforms like Okta.
Familiarity with network diagnostic tools (e.g., Wireshark, ExtraHop).
Background in managing large-scale, high-availability production environments through cloud providers.
Exposure to agile workflows and DevOps toolchains.
Skills & Competencies:
Strong analytical and troubleshooting skills with a proactive approach to system monitoring and problem resolution.
Solid understanding of Active Directory, Windows domain architecture, and permission models.
Working knowledge of SQL service configuration and performance tuning.
Practical knowledge of SAN/NAS technologies, provisioning models, and enterprise storage systems.
Effective verbal and written communication skills with the ability to document and present technical topics clearly.
Ability to work independently and collaboratively in a dynamic team environment, including occasional evenings/weekends for maintenance or emergencies.
Understanding of compliance standards and ability to support regulatory requirements across infrastructure systems.
Phone
Job Type
Remote Status
Get notified about new listings!
Can't find the job you want?
Submit a general applicationLoading Jobs...