You're the engineer who maintains uptime for 50+ SaaS products when no one else has the answers. We need DevOps engineers capable of diving into unknown AWS environments, restoring order from instability, and driving availability beyond 99.9% through disciplined monitoring, automation, and root cause analysis. You'll break down complex projects into one-day increments, deliver production-ready Python or JavaScript, and leverage AI as an accelerator.
Many organizations tout "cloud-native" credentials while manually nursing individual servers. We're building industrial-grade reliability across a portfolio of acquired products where original engineers have departed and documentation is incomplete. The challenge: you'll employ agents and contemporary tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate solutions so recurring failures are eliminated. Rather than evaluating certifications and vendor logos, we'll observe how you troubleshoot in real time, author a rigorous 5-Whys that identifies one actionable root cause, and construct automations that endure in production environments.
This is not a tier-two "execute the runbook" position. Here, you author the runbooks, architect the deployment path from development through staging to 10% and full rollout with soak periods and rollback criteria, and implement monitoring that captures outlier scenarios. You reject risky changes before deployment. You distinguish infrastructure issues under your ownership from application bugs owned by Engineering, and you route permanent remediation to the appropriate team.
You'll operate at the engineering center of reliability, taking ownership of infrastructure initiatives, incident triage with RCAs, and change execution backed by copy-paste-ready runbooks. If you've already managed a significant SaaS platform and want to apply that rigor across an entire fleet, join us. Bring advanced AWS proficiency, production-quality coding skills, disciplined scope management, and daily, mission-critical use of AI tooling. If you're prepared to sustain operational excellence, please apply.
Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.






It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.
Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.
We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.