AI Site Reliability Engineer
$100,000 USD/year Pay is set based on global value, not the local market. Most roles = hourly rate x 40 hrs x 50 weeks 

Fully-remote
Remote
full-time (40 hrs/week)
Flexible schedule
Long-term role

AI Site Reliability Engineer   $100,000 USD/year

Description

Conventional site reliability teams are known to struggle with manual monitoring, reactive incident response, and deployment processes that demand extensive human effort. AI offers a path forward, yet many organizations underutilize its capabilities, leading to suboptimal system performance and innovation roadblocks. Research indicates that 73% of organizations face deployment delays and operational downtime, largely due to legacy workflows and insufficient AI-driven automation.

IgniteTech is addressing these challenges directly by developing AI-first cloud solutions engineered to foresee and mitigate issues before they occur. We embed AI and machine learning across all aspects of cloud infrastructure management—from automated monitoring frameworks to intelligent CI/CD pipelines. This strategy produces environments that self-heal and adapt continuously, minimizing downtime, enhancing performance, and expanding the capabilities of cloud services.

This is not a conventional site reliability position focused on reacting to incidents and performing manual interventions. In this role, you will spearhead the development of AI-enhanced monitoring systems capable of detecting and resolving 95% of issues before end users are affected. You will also design and oversee AI-automated CI/CD pipelines that cut deployment times by 30% while minimizing manual effort. The successful candidate will thrive in AI-driven settings, embrace automation-first thinking, and take pride in advancing cloud infrastructure design.

You will be joining a global team of innovators who are transforming cloud infrastructure. Your contributions will be central to our objective of delivering next-generation, AI-powered operational excellence. We are looking for candidates who are passionate about AI and prepared to make a meaningful impact on the future of cloud services. If this resonates with you, we invite you to apply and become part of a transformative effort.

What you will be doing

  • Deploying AI-based monitoring services that automatically detect, predict, and resolve operational issues before they affect performance
  • Overseeing CI/CD pipelines with AI-driven automation to improve deployment efficiency and minimize manual intervention

What you will NOT be doing

  • Concentrating exclusively on manual monitoring, troubleshooting, and system maintenance; your objective will be to enable AI to handle these functions

Key responsibilities

  • Deliver seamless scalability and optimize performance for AI-powered cloud services, maintaining 99.99% uptime while executing AI-enhanced software upgrades and customizations aligned with clients' changing requirements

Candidate requirements

  • AI-First Mindset (if your default approach is to write code first and then use AI tools to verify or enhance your code, rather than the reverse, please do not apply)
  • Minimum of 3 years of DevOps experience, including automation of CI/CD pipelines and infrastructure management
  • Minimum of 2 years of experience with Amazon Web Services (AWS) or Google Cloud Platform (GCP)
  • Proficiency in AI and machine learning tools applied to monitoring, automation, and predictive analytics (or demonstrated willingness to learn and adapt to AI-driven technologies)
  • Strong programming and scripting capabilities, with experience automating tasks and developing AI-driven processes

Meet a successful candidate

Watch Interview
Anonymous
Anonymous  |  Elite Coder
Lebanon  

Have you ever made so much money you had to remain anonymous to protect yourself? How about being able to fix an impossible coding problem i...

Meet Anonymous

Applying for a role? Here’s what to expect.

Crossover's skill assessment process combines innovative AI power with decades of human research, to take the guesswork, human bias, and pointless filters out of recruiting high-performing teams.

Chat-style
screening interview.
STEP 1

Chat-style
screening interview.

Cognitive 
aptitude test.
STEP 2

Cognitive 
aptitude test.

Prove real-world 
job skills.
STEP 3

Prove real-world 
job skills.

Interview with the hiring manager.
STEP 4

Interview with the hiring manager.

Accept Job Offer.
STEP 5

Accept Job Offer.

Pass 
proctored test.
ONE LAST THING

Pass 
proctored test.

Frequently asked questions

About Crossover

Meet some people who've landed similar jobs

Why Crossover

Recruitment sucks. So we’re fixing it.

The Olympics of work

The Olympics of work

It’s super hard to qualify—extreme quality standards ensure every single team member is at the top of their game.

Premium pay for premium talent

Premium pay for premium talent

Over 50% of new hires double or triple their previous pay. Why? Because that’s what the best person in the world is worth.

Shortlist by skills, not bias

Shortlist by skills, not bias

We don’t care where you went to school, what color your hair is, or whether we can pronounce your name. Just prove you’ve got the skills.

Crossover Logo White
Follow us on
Have a question?

Get answers to common questions using our smart chatbot Crosby.

HELP AND FAQs

Join the world's largest community of AI first Remote WorkersAI-first remote workers.