Site Reliability Engineer

Are you looking for a hands-on CTO-like opportunity to be the owner for the infrastructure architecture and operations of multi-million-dollar revenue per year enterprise software products? If so, then this role is for you.

Site Reliability Engineer

$ 100k/Year  ($ 50/Hour for 40 hours of productive work per week)  Remote Position   Long-term

The Site Reliability Engineer combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.

You will have the opportunity to optimize existing systems, build infrastructure and eliminate work through automation.

As an SRE, you will ensure that both internally critical and externally-visible systems—have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity, performance, and cost.

Crossover recruits and builds world class high performing teams to power the fastest growing portfolio of software products in the world. No other company provides the training and the opportunities to test yourself on the depth and diversity of projects that we do. All roles are location independent so you are guaranteed to work with the best in the world. Challenge yourself. Be part of the change.

SREs are responsible for the big picture of how our systems relate to each other, we use a breadth of tools and approaches to solving a broad spectrum of problems.

Practices such as limiting time spent on operational work, cost optimization, blameless postmortems and proactive identification of potential outages factor into the iterative improvement that is key to both product quality and interesting and dynamic day-to-day work.

You will also be the driver for technical simplification initiatives throughout the organization.


To apply for a role at Crossover, you will go through a series of online tests, usually during the online hiring event. If you pass these tests, you will be offered the opportunity to participate in our four-full-time week Bootcamp training program. This is elite training taught by our top instructors.

Here’s what our graduates have to say about Bootcamp:

"I am very pleased to say that because of Crossover's unorthodox and unique way of transferring the knowledge through (Paired sessions, coaching sessions with CSMs), I have never been more confident in my technical skills and abilities for my role."
-Mikael F

"The CTO Bootcamp was another thing that motivated me. I wanted to see how CTOs across the globe work and learn from them."
-Javed Z

"I've been with the company since Aug (been part of the second Bootcamp) and since then I've learned SQL, databases, servers, tapes, other content management systems etc- and that's only been in 3 months. Usually when I'm in a new company, I learned a lot about their platform, their tools etc during the length of my time with them but never at this speed!"
-Monnaliza T

It is offered as soon as you want to get started. You will be compensated for 40 hrs/ week at the hourly rate for the role you are applying to. Bootcamp training is an excellent opportunity to learn about our culture, expectations, tools, processes, and procedures. It's an intensive and demanding program, but every graduate is guaranteed a job at the end of it.

A hiring event is a scheduled online event where all our relevant testing relating to a role is conducted on the same day. Submissions received during the event are graded the following week, and successful candidates notified if they have progressed to the next round which is an online interview with a Hiring Manager.


Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement

Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews

Maintain products/applications/services once they are in production, by measuring and monitoring availability, latency, and overall system health

Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity

Practice sustainable incident response and blameless postmortems

Eliminate complexity from both architecture and processes

Optimize cloud computing costs


Bachelor's degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience

3+ years of demonstrated experience managing and maintaining large-scale SaaS applications in one of the major platforms (Azure, GPC, AWS, IBM Cloud) and cloud orchestration tools (Kubernetes, Marathon, VMware, etc.)

2+ years of experience with Linux operating system (strong understanding)

3+ years of experience in at least one programming language: Java, C, C++, Java, Python, Go, Perl or Ruby

A systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive

Ability to debug and optimize code and automate routine tasks

(Desired) Experienced with declarative configuration management and provisioning tools like Ansible, Puppet or Chef


As a Site Reliability Engineer in Crossover, you will get an opportunity to learn and work on cutting-edge monitoring tools, which allow you to operate at a scale of thousands of servers for 70+ different software companies. The candidate will be exposed to our metrics-driven culture, which is the foundation of our success in measuring and improving every engineering process and product we deliver.

Working with the top 1% talent will help the Site Reliability Engineering master managerial as well as technical skills. Ultimately Crossover’s dynamic environment will allow the candidate to move fast, set and achieve aggressive goals.


Relevant Files and Links

Page 1

How ESW Capital & Crossover Build Infrastructure to Handle 100s of Software Products

This is an interview with one of our SaaSOps Engineering Manager, about deciding to build a Central repository to host all docker containers of all companies and products owned by ESW Capital.


and Answers

  • Are there any Cloud certifications required in the role of Site Reliability Engineer?

    At Crossover we are always fostering our team to learn continuously. We understand that some Cloud certifications may help you demonstrate your knowledge and skills, but there is no particular certification required for the role of SRE.

  • What would be the priorities for a SRE on a daily basis?

    The top drivers are uptime, cost and performance (or architectural simplicity).

  • What are the expectations on SRE during an outage?

    You have to be able to deep dive on all outages and keep track on them with the purpose to improve reliability over time. Additionally do Root Cause Analysis (RCA), prepare, implement and measure the effectiveness of an action plan to prevent outages.

  • What are the challenges to have a team of people spread across different countries and how the performance management process is conducted by Crossover?

    The main challenges are related to multiple time-zones and productivity reviews. On a weekly basis you get goals and performance reviews that accelerate your career path in this very fast paced environment.

  • What is the Crossover approach to acquire and integrate a new software company into the portfolio so aggressively?

    Before acquiring a company, we conduct 4 weeks due diligence. After the buying decision is made, we use our standard model to import the software company into Crossover centralized environment within 90 days. During this time we take whatever the company has created and re-engineer or restructure it, improving it to match our standardized model for software products.