Resume

Site Reliability Engineer Resume Example & Writing Guide

Use this Site Reliability Engineer resume example and guide to improve your career and write a powerful resume that will separate you from the competition.

Site reliability engineers are responsible for maintaining the stability, reliability, and performance of their company’s websites and web applications. They work with other engineers to identify and fix issues that arise, and they’re often the first point of contact when something breaks.

Because site reliability engineers tend to work behind the scenes, you might not know they exist until something goes wrong. But when everything is running smoothly, you probably don’t even notice them. Their work is so important because poor performance can drive away customers.

If you want to work in an environment that’s fast-paced, challenging, and rewarding, consider a career as a site reliability engineer. Here are some tips plus an example to help you write a stellar site reliability engineer resume that hiring managers will love.

Mary Thompson
New York City, NY | (123) 456-7891 | [email protected]
Summary

Driven and innovative site reliability engineer with experience in high-pressure environments. Proven ability to maintain uptime and optimize systems for maximum performance. Excels at quickly diagnosing and resolving issues.

Education
Columbia University Jun '10
M.S. in Computer Science
Columbia University Jun '06
B.S. in Computer Science
Experience
Company A, Site Reliability Engineer Jan '17 – Current
  • Managed the deployment of over 100 Kubernetes clusters across multiple environments, including staging and production.
  • Developed a tool to automate the creation of Kubernetes clusters using Terraform and implemented it into our CI/CD pipeline for use by all engineering teams.
  • Implemented an automated health check system for Kubernetes clusters that reduced mean time to repair from 5 hours to 30 minutes after identifying issues before they affected customers.
  • Built a dashboard in Grafana to monitor cluster resource usage and performance metrics which was used by both SREs and engineers across the company to troubleshoot problems with their applications or infrastructure.
  • Created documentation on how to deploy Kubernetes clusters via Helm charts as well as best practices for deploying applications onto Kubernetes that is used by other SREs at my current company as well as others in the industry.
Company B, Site Reliability Engineer Jan '12 – Dec '16
  • Worked with the DevOps team to implement a monitoring system that improved uptime by 15%
  • Created an automation framework for deploying new servers and configuring them based on environment variables
  • Improved server utilization by identifying unnecessary resources and decommissioning them when possible
  • Reduced average ticket response time from 3 days to 1 day through better communication with customers
  • Implemented a logging system that tracked all actions performed on production machines, increasing accountability 10x
Company C, System Administrator Jan '09 – Dec '11
  • Installed, configured, and maintained servers, workstations, and other network devices.
  • Monitored system performance and took corrective action when necessary to ensure optimal performance.
  • Performed regular backups and recovery procedures to ensure data integrity in the event of a system failure.
Certifications
  • Google Cloud Platform Certification
  • Linux Foundation Certified System Administrator
  • AWS Certified Solutions Architect – Associate
Skills

Industry Knowledge: Linux, Python, Puppet, AWS, GCP, DevOps, Google Cloud, Kubernetes, Docker, Jenkins, Ansible, Terraform
Technical Skills: Google Cloud, AWS, GCP, Terraform, Ansible, Jenkins, Bash Scripting, Red Hat
Soft Skills: Communication, Leadership, Problem Solving, Teamwork, Attention to Detail, Self-Motivation

How to Write a Site Reliability Engineer Resume

Here’s how to write a site reliability engineer resume of your own.

Write Compelling Bullet Points

The best way to make your resume stand out is to use strong, specific language. Rather than saying you “managed server infrastructure,” you could say you “managed server infrastructure for high-traffic website, resulting in 100% uptime during peak holiday shopping season.”

The second bullet point is much more impressive because it provides specific details about what you did and the results of your work. It also provides a quantifiable result (100% uptime).

Related What Is a Site Reliability Engineer? How to Become One

Identify and Include Relevant Keywords

When you submit your resume for a site reliability engineer role, it’s likely that it will be scanned by an applicant tracking system (ATS) for certain keywords. ATS programs rank resumes based on how many of the desired keywords are found in the resume. If your resume doesn’t have enough of the right terms, the ATS might filter out your application.

One way to make sure your resume makes it past the ATS is to include keywords that are commonly found on site reliability engineer job postings. Here are a few examples:

  • Ansible
  • Jenkins
  • Git
  • Amazon Web Services (AWS)
  • Docker Products
  • Kubernetes
  • Go (Programming Language)
  • Bash
  • Continuous Integration
  • Python (Programming Language)
  • Site Reliability Engineering
  • Google Cloud Platform (GCP)
  • Puppet
  • Linux
  • Kubernetes Engine
  • Google Container Engine (Kubernetes)
  • Terraform
  • Jenkins Job Builder
  • Splunk
  • Docker Swarm
  • GCP products
  • MongoDB
  • C#
  • Amazon EC2
  • AWS Lambda
  • MySQL
  • GoLang
  • Microsoft Azure
  • Jenkins Blue Ocean
  • Java

Showcase Your Technical Skills

Site reliability engineers (SREs) are responsible for ensuring that the websites and applications that their company relies on are always up and running. They do this by constantly monitoring the system and troubleshooting any issues that may arise. SREs rely heavily on technology to do their job, so it is important to list your technical skills prominently on your resume.

Some of the programs and systems that SREs are typically expected to be proficient in include: monitoring and logging tools, configuration management tools, source control systems, and scripting languages. Additionally, SREs should have a working knowledge of web application architectures and how they function.

Related Resume Examples

Previous

Bakery Clerk Resume Example & Writing Guide

Back to Resume
Next

Unit Clerk Resume Example & Writing Guide