Senior DevOps Engineer
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Worldwide
Fully-remote
full-time (40 hrs/week)
Flexible schedule
Long-term role

Senior DevOps Engineer   $100,000 USD/year

Description

You're the engineer who maintains uptime for 50+ SaaS products when no one else has the answers. We need DevOps engineers capable of diving into unknown AWS environments, restoring order from instability, and driving availability beyond 99.9% through disciplined monitoring, automation, and root cause analysis. You'll break down complex projects into one-day increments, deliver production-ready Python or JavaScript, and leverage AI as an accelerator.

Many organizations tout "cloud-native" credentials while manually nursing individual servers. We're building industrial-grade reliability across a portfolio of acquired products where original engineers have departed and documentation is incomplete. The challenge: you'll employ agents and contemporary tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate solutions so recurring failures are eliminated. Rather than evaluating certifications and vendor logos, we'll observe how you troubleshoot in real time, author a rigorous 5-Whys that identifies one actionable root cause, and construct automations that endure in production environments.

This is not a tier-two "execute the runbook" position. Here, you author the runbooks, architect the deployment path from development through staging to 10% and full rollout with soak periods and rollback criteria, and implement monitoring that captures outlier scenarios. You reject risky changes before deployment. You distinguish infrastructure issues under your ownership from application bugs owned by Engineering, and you route permanent remediation to the appropriate team.

You'll operate at the engineering center of reliability, taking ownership of infrastructure initiatives, incident triage with RCAs, and change execution backed by copy-paste-ready runbooks. If you've already managed a significant SaaS platform and want to apply that rigor across an entire fleet, join us. Bring advanced AWS proficiency, production-quality coding skills, disciplined scope management, and daily, mission-critical use of AI tooling. If you're prepared to sustain operational excellence, please apply.

What you will be doing

  • Executing sophisticated infrastructure migrations, consolidations, production-grade automation, and monitoring enhancements
  • Responding to production incidents, deploying immediate remediation, and authoring root cause analyses with permanent fixes routed to accountable teams
  • Drafting, reviewing, and deploying production changes, including safety validation of proposed modifications before execution

What you will NOT be doing

  • Spending your days in Jira and recurring status calls—we reward people who deliver solutions, not those who merely document issues
  • Preserving legacy systems forever—you'll be authorized to implement substantive upgrades
  • Waiting on layered approval bureaucracies—you'll hold the mandate to deploy urgent fixes during active incidents

Key responsibilities

  • Advance reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.

Candidate requirements

  • Deep AWS infrastructure expertise (this is our primary platform—other cloud experience alone won't cut it)
  • Experience managing production infrastructure at a scale of 1,000+ containers
  • Experience scripting with Python and Bash for day-to-day administration operations
  • Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Pass
proctored test.
STEP 5

Pass
proctored test.

Accept job offer.
STEP 6

Accept job offer.

Frequently asked questions

About the role

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of  AI first Remote WorkersAI-first remote workers.