You're the engineer who stabilizes production when everyone else is stuck guessing. We need DevOps professionals who can step into unfamiliar AWS environments, bring order to instability, and drive uptime beyond 99.9% through real monitoring, real automation, and real root cause analysis. You'll break down complex projects into executable one-day tasks, deliver production-quality Python or JavaScript, and leverage AI as your junior colleague.
Most organizations talk about "cloud native" while manually nursing infrastructure. We're building industrial-strength reliability across dozens of acquired SaaS products where original developers have left and documentation is incomplete. The challenge: you'll use agents and contemporary tooling to understand new systems 5–10x faster, document your findings in code, and automate solutions so recurring failures become impossible. Rather than evaluating you on certifications and vendor badges, we'll observe how you troubleshoot in real time, produce a genuine 5-Whys analysis that identifies one preventable root cause, and create automations that withstand production conditions.
This isn't an L2 "execute the playbook" position. Here, you author the playbooks, architect the deployment strategy from dev through staged to 10% to 100% with soak periods and rollback triggers, and create the monitoring that surfaces edge cases. You reject risky changes before they reach production. You distinguish infrastructure failures you're accountable for from application bugs Engineering owns, and you route permanent remediation to the correct team.
You'll operate at the engineering center of reliability, managing infrastructure initiatives, incident response with RCAs, and change requests accompanied by copy-paste-executable runbooks. If you've already owned a significant SaaS platform and want to extend that discipline across a fleet, join us. Bring expert-level AWS knowledge, production-grade development skills, ruthless scope discipline, and daily, critical application of AI tools. If you're prepared to keep the lights on, please apply.
Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.






It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.
Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.
We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.
Join the world's largest community of AI-first remote workers.