DevOps Engineer
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Worldwide
Fully-remote
full-time (40 hrs/week)
Flexible schedule
Long-term role

DevOps Engineer   $100,000 USD/year

Description

You're the engineer who stabilizes production when everyone else is stuck guessing. We need DevOps professionals who can step into unfamiliar AWS environments, bring order to instability, and drive uptime beyond 99.9% through real monitoring, real automation, and real root cause analysis. You'll break down complex projects into executable one-day tasks, deliver production-quality Python or JavaScript, and leverage AI as your junior colleague.

Most organizations talk about "cloud native" while manually nursing infrastructure. We're building industrial-strength reliability across dozens of acquired SaaS products where original developers have left and documentation is incomplete. The challenge: you'll use agents and contemporary tooling to understand new systems 5–10x faster, document your findings in code, and automate solutions so recurring failures become impossible. Rather than evaluating you on certifications and vendor badges, we'll observe how you troubleshoot in real time, produce a genuine 5-Whys analysis that identifies one preventable root cause, and create automations that withstand production conditions.

This isn't an L2 "execute the playbook" position. Here, you author the playbooks, architect the deployment strategy from dev through staged to 10% to 100% with soak periods and rollback triggers, and create the monitoring that surfaces edge cases. You reject risky changes before they reach production. You distinguish infrastructure failures you're accountable for from application bugs Engineering owns, and you route permanent remediation to the correct team.

You'll operate at the engineering center of reliability, managing infrastructure initiatives, incident response with RCAs, and change requests accompanied by copy-paste-executable runbooks. If you've already owned a significant SaaS platform and want to extend that discipline across a fleet, join us. Bring expert-level AWS knowledge, production-grade development skills, ruthless scope discipline, and daily, critical application of AI tools. If you're prepared to keep the lights on, please apply.

What you will be doing

  • Leading complex infrastructure migrations, consolidations, production-grade automations, and monitoring enhancements
  • Triaging live production outages, deploying immediate remediations, and authoring root cause analyses with permanent fixes routed to responsible teams
  • Authoring, reviewing, and executing production changes, including validating the safety of proposed changes before execution

What you will NOT be doing

  • Drowning in Jira tickets and endless status meetings - we value engineers who can deliver solutions, not just document problems
  • Keeping legacy systems on life support forever - you'll be empowered to push meaningful improvements forward
  • Waiting for bureaucratic approval chains - you'll have the authority to implement immediate fixes during incidents

Key responsibilities

  • Drive reliability and standardization of cloud infrastructure across our growing product portfolio by implementing robust monitoring, automation, and AWS best practices.

Candidate requirements

  • Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
  • Experience owning large production infrastructure and troubleshooting production outages independently (not just following a runbook)
  • Experience scripting with Python and Bash for day-to-day administration operations
  • Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
  • Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)
  • Linux systems administration expertise

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Pass
proctored test.
STEP 5

Pass
proctored test.

Accept job offer.
STEP 6

Accept job offer.

Frequently asked questions

About the role

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of AI first Remote WorkersAI-first remote workers.