Cloud Operations Engineer


Fully remote Cloud Operations Engineer position for leading enterprise software company pioneering the future of work.

Cloud Operations Engineer

$ 60k/Year   Flexible   Long-term
Good fit for: System Engineer, Devops Engineer, Cloud Support Engineer

Interested in utilizing your years of systems administration expertise to help enhance our cloud-based SaaS systems? Do you have the passion and ambition to grow your sysadmin skills, make decisions based on metrics and challenge yourself on a variety of products? If you can adapt to our roles, and learn new ideas to solve business problems, then this role may be for you.

The ideal candidate will have hands-on experience in managing and maintaining large-scale SaaS applications in one of the major platforms, such as AWS, VMWare or IBM Cloud. You will learn how to ‘lift & shift’ acquired company infrastructures into a modern Kubernetes/Docker/VMware cluster.

“The best achievement was improving my decision-making skills. I am an SEM, so I get the benefits of learning new technologies and at the same time being in touch with the executives - what better models can you have?.”
-Madalina S, Software Engineering Manager
, we have  full time partners from your country,   Let’s make it !


 
 
 
 
WHY CROSSOVER?
Crossover recruits and builds world class high performing teams to power the fastest growing portfolio of software products in the world. No other company provides the training and the opportunities to test yourself on the depth and diversity of projects that we do. All roles are location independent so you are guaranteed to work with the best in the world. Challenge yourself. Be part of the change.
 
 
 
WHAT YOU'LL BE DOING

As a Cloud Operations Engineer ($60K/year), you will be responsible for providing seasoned and experienced support to drive our infrastructure availability up for both internal and external clients while resolving incidents and service requests.

This role requires balancing availability, customer experience and the need to enhance the systems continually. We expect you to deep dive on all outages, perform root cause analysis and drive continuous product improvement.

There’s a breadth of opportunities for Cloud Operations Engineer in our organization. Starting with our infrastructure teams that manage and continuously improve our Kubernetes, Docker & VMware clusters or going all the way to our SaaS operations which will ensure great up-time and customer experience from our myriad of more than 100 products.

We are seeking for hands-on professionals who are comfortable reverse-engineering legacy products with manual build/deploy processes, ancient dependencies, fragile architecture, weak security and lack of scalability and performance. All this with the support and collaboration of highly experienced professionals in a truly international environment.

 
BOOTCAMP PROGRAM

To apply for a role at Crossover, you will go through a series of online tests, usually during the online hiring event. If you pass these tests, you will be offered the opportunity to participate in our four-full-time week Crossover University “Bootcamp” training program. This is elite training taught by our top instructors.

Here’s what our graduates have to say about Crossover University:

"I am very pleased to say that because of Crossover's unorthodox and unique way of transferring the knowledge through (Paired sessions, coaching sessions with CSMs), I have never been more confident in my technical skills and abilities for my role."
-Mikael F

"The CTO Bootcamp was another thing that motivated me. I wanted to see how CTOs across the globe work and learn from them."
-Javed Z

"I've been with the company since Aug (been part of the second Bootcamp) and since then I've learned SQL, databases, servers, tapes, other content management systems etc- and that's only been in 3 months. Usually when I'm in a new company, I learned a lot about their platform, their tools etc during the length of my time with them but never at this speed!."
-Monnaliza T

It is offered as soon as you want to get started. You will be compensated for 40 hrs/ week at the hourly rate for the role you are applying to. Crossover University is an excellent opportunity to learn about our culture, expectations, tools, processes, and procedures. It's an intensive and demanding program, but every graduate is guaranteed a job at the end of it.

 
ONLINE HIRING EVENT



A hiring event is a scheduled online event where all our relevant testing relating to a role is conducted on the same day. Submissions received during the event are graded the following week, and successful candidates notified if they have progressed to the next round which is an online interview with a Hiring Manager.

 
 
KEY RESPONSIBILITIES

Ensure that our multi-tenant infrastructure running more than 100 different products yields four nines

Eliminate complexity from both architecture and processes

Optimize our public cloud computing costs

Be proactive and work closely with the engineering teams to enhance our design and improve our platforms offering

Perform capacity planning

Employ modern instrumentation to enable production applications and infrastructure observability and then act upon the results

Practice sustainable incident response and blameless postmortems

 
CANDIDATE REQUIREMENTS

Bachelor's degree in Computer Science or related technical field involving IT or equivalent practical experience

3+ years of demonstrated experience managing and maintaining large-scale SaaS applications in one of the major platforms (VMWare, GPC, AWS, IBM Cloud) and cloud orchestration tools (Kubernetes, Marathon, VMware, etc.)

3+ years of experience with Linux and/or Windows Server operating systems (strong understanding)

Experience building and maintaining production systems on AWS using EC2, RDS, S3, ELB, Cloud Formation, etc. and familiarity interacting with the AWS APIs or VMWare / Azure experience

Deep experience administering Linux (Centos, RHEL, Ubuntu) systems OR Windows Server systems

Experience with Docker

Excellent knowledge of web application technology, including IIS, Tomcat, Apache, elasticsearch, nginx, haproxy etc.

Good network and file system skills

Experience with monitoring tools

Familiarity with ITIL processes, especially Incident, Change and Problem Management

Ability to debug and optimize code and automate routine tasks

Good proficiency in the English language

Nice to have:
  • Experienced with declarative configuration management and provisioning tools like Ansible, Puppet or Chef
  • Databases experience: MySQL, MSSQL, Oracle, PostgreSQL
  • Demonstrate success as a problem solver
  • Be a results-oriented individual
  • Comfortable “working virtually” with teammates and customers around the world
 
 
WHAT YOU WILL LEARN
 

As a Cloud Operations Engineer in Crossover, you will learn how to have an obsessive focus on improving the quality of your work and the quality of products through the use of First Time Acceptance Rate. You will get an opportunity to learn and work on cutting-edge monitoring tools, which allow you to operate at a scale of thousands of servers for 70+ different software companies. 

The candidate will be exposed to our metrics-driven culture, which is the foundation of our success in measuring and improving every engineering process and product we deliver.

Working with the top 1% talent will help you master your technical skills and become a specialist in some of the parts of our software factory model, such as the development of automated unit test or bug fixes. Ultimately Crossover’s dynamic environment will allow the candidate to move fast, set and achieve aggressive goals.

 
 
CAREER PATH
 
Senior Site Reliability Engineer
RESPONSIBLE FOR A PRODUCTS UPTIME, COST OPTIMIZATIONS, CHANGE REQUEST AND AUTOMATIONS.

Ensure that our multi-tenant infrastructure running more than 100 different products yields four nines and more of availability


Use IaaC to automate and enable scaling of environments and systems


Eliminate complexity from both architecture and processes


Optimize our public cloud computing costs


Manage the uptime error budget of your product


50
Cloud Operations Engineer
INDIVIDUAL CONTRIBUTOR IN A TEAM RESPONSIBLE FOR A PRODUCTS UPTIME, COST OPTIMIZATIONS, CHANGE REQUEST AND AUTOMATIONS.

Ensure that our multi-tenant infrastructure running more than 100 different products yields four nines


Eliminate complexity from both architecture and processes


Optimize our public cloud computing costs


Be proactive and work closely with the engineering teams to enhance our design and improve our platforms offering


Perform capacity planning


30
Cloud Operations Junior Engineer
INDIVIDUAL CONTRIBUTOR IN A TEAM RESPONSIBLE FOR A PRODUCTS UPTIME, COST OPTIMIZATIONS, CHANGE REQUEST AND AUTOMATIONS.

Ensure that our multi-tenant infrastructure running more than 100 different products yields four nines


Resolving outages based on SaaSOps playbooks and/or product knowledge


Monitor incoming tickets queue and follow documented processes for troubleshooting, recovery and service restoration processes


Optimize our public cloud computing costs


15
 
 
 
Work Examples
Assets
blog
This is an interview with one of our SaaSOps Engineering Manager, about deciding to build a Central repository to host all docker containers of all companies and products owned by Trilogy.
https://medium.com/the-crossover-cast/how-esw-capital-crossover-build-infrastructure-to-handle-100s-of-software-products-83eb62642d9
Medium
Relevant files and links
External resources
url
Here you can find a link to Brendan Gregg blog. He published a relevant book that will help you refine your SaaSOps technical skills. His book: Systems Performance: Enterprise and the Cloud, will provide you details and insights on modern terminologies, concepts, techniques and methodologies for analyzing and improving system performance.
http://www.brendangregg.com/sysperfbook.html
BrendanGregg
 

Questions
and Answers

  • What would be the priorities for a Cloud Operations Engineer on a daily basis?

    The top drivers are to ensure our infrastructure availability and our public cloud costs optimizations through eliminating complexity from both architecture and processes.

  • What are the expectations on a Cloud Operations Engineer during an outage?

    You are expected to be personally involved and drive towards a timely resolution. Also, you have to be able to deep dive on all outages, keep track of the underlying root causes and push towards their resolution, to improve reliability over time.

  • Are there any Cloud certifications required in the role of Cloud Operations Engineer?

    At Crossover we are always fostering our team to learn continuously. We understand that some Cloud certifications may help you demonstrate your knowledge and skills, but there is no particular certification required for the role of Site Reliability Engineer.

  • What are the challenges to have a team of people spread across different countries?

    The main challenges are related to bring together people from multiple time-zones and ensure they are in sync with the work to be delivered. We have developed an online productivity tool that helps remote workers manage their time more efficiently and receive a fair working environment.

  • What is the Crossover approach to acquire and integrate a new software company into the portfolio so aggressively?

    Before acquiring a company, we conduct due diligence. Once a buying decision is made, we use our standard model to import the SaaS products into our centralized environment. We then enhance the products to match our standardized model.

 
 
 
WHAT CROSSOVER MEMBERS SAY ABOUT THE ROLE

 
ABOUT THE ROLE

 
FAQs