Cloud - SRE - Reliability


Elastic is a search company that powers enterprise search, observability, and security solutions built on one technology stack that can be deployed anywhere. From finding documents to monitoring infrastructure to hunting for threats, Elastic makes data usable in real time and at scale. Thousands of organizations worldwide, including Barclays, Cisco, eBay, Fairfax, ING, Goldman Sachs, Microsoft, The Mayo Clinic, NASA, The New York Times, Wikipedia, and Verizon, use Elastic to power mission-critical systems. Founded in 2012, Elastic is a distributed company with Elasticians around the globe. Learn more at

Thanks to our ongoing expansion we have the opportunity to grow our Cloud SRE - Reliability team, the front-line owners of Incident Management, Investigation, and Response for the Elastic Cloud platform. We take a Site Reliability Engineering approach to addressing stability concerns, so we’re looking for people who are just as passionate about resolving distributed system issues as they are coding and collaborating with others. In this role you’ll be responsible for the health of thousands of Elasticsearch clusters spread across all major cloud providers.

Who you are:

  • You have outstanding interpersonal skills, and can effectively coordinate incident response across globally distributed teams in a dynamic, growing environment
  • You are a software engineer at heart, with a compulsion to automate yourself out of a job
  • You have production-grade experience operating Linux systems, with the ability to methodically diagnose system, network, and application issues
  • Experience with GovCloud is welcome

What you’ll do:

In this role you will:

  • participate in a weekly on-call rotation, using a follow-the-sun model; on-call shifts are aligned with local business hours
  • provide low-latency response to incidents and service instability, coordinating with internal and external teams as needed
  • contribute to tooling, automation, and system engineering efforts, freeing yourself and others from day-to-day toil
  • lead blameless post-mortems, ensuring preventative actions are prioritised appropriately
  • be an advocate for Elastic Cloud customers, sharing your deep insight into our production systems with other engineering teams

What you’ve done:

You don't need to have all of these items, but these represent the types of work you will do at Elastic Cloud

  • You have operated a SaaS product in a public cloud (AWS, GCP, Azure, or SoftLayer preferred), and have some stories to share
  • You are adept at writing software to automate orchestration tasks at scale; we commonly use Python, Go, and Shell scripting
  • You can use metrics systems (e.g. Elastic, Graphite, Prometheus, Influx) effectively to diagnose issues and quantify impacts
  • You have worked with cloud infrastructure-as-code tooling; Terraform, CloudFormation, or others
  • You've diagnosed and resolved Elastic Stack cluster issues
  • You are familiar with containerisation and container orchestration concepts

Additional Information - We Take Care of Our People

As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, Elastic is the type of company where you can balance great work with great life. Your age is only a number. It doesn’t matter if you’re just out of college or your children are; we need you for what you can do.

We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do.

  • Competitive pay based on the work you do here and not your previous salary
  • Health coverage for you and your family in many locations
  • Ability to craft your calendar with flexible locations and schedules for many roles
  • Generous number of vacation days each year
  • Double your charitable giving — we match up to 1% of your salary
  • Up to 40 hours each year to use toward volunteer projects you love
  • Embracing parenthood with minimum of 16 weeks of parental leave

Elastic is an Equal Employment employer committed to the principles of equal employment opportunity and affirmative action for all applicants and employees. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status or any other basis protected by federal, state or local law, ordinance or regulation. Elastic also makes reasonable accommodations for disabled employees consistent with applicable law.


When you apply to a job on this site, the personal data contained in your application will be collected by Elasticsearch, Inc. (“Elastic”) which is located at 800 W. El Camino Real, Suite 350 Mountain View, CA 94040 USA, and can be contacted by emailing Your personal data will be processed for the purposes of managing Elastic’s recruitment related activities, which include setting up and conducting interviews and tests for applicants, evaluating and assessing the results thereto, and as is otherwise needed in the recruitment and hiring processes. Such processing is legally permissible under Art. 6(1)(f) of Regulation (EU) 2016/679 (General Data Protection Regulation) as necessary for the purposes of the legitimate interests pursued by Elastic, which are the solicitation, evaluation, and selection of applicants for employment. Your personal data will be shared with Greenhouse Software, Inc., a cloud services provider located in the United States of America and engaged by Elastic to help manage its recruitment and hiring process on Elastic’s behalf. Accordingly, if you are located outside of the United States, your personal data will be transferred to the United States once you submit it through this site. Because the European Union Commission has determined that United States data privacy laws do not ensure an adequate level of protection for personal data collected from EU data subjects, the transfer will be subject to appropriate additional safeguards under the standard contractual clauses. You can obtain a copy of the standard contractual clauses by contacting us at Elastic’s data protection officer is Daniela Duda, who can be contacted at We plan to keep your data until our open role is filled. We cannot estimate the exact time period, but we will consider this period ended when a candidate accepts our job offer for the position for which we are considering you. When that period is over, we may keep your data for an additional period no longer than 3 years in case additional opportunities present themselves in which yours skills might be better suited. For additional details, please see our Elastic Privacy Statement

Learn about Elastic's Culture

Notify Me of Open Positions

Sign in with your social account to receive emails when Elastic posts open positions you might be interested in: