DE Jobs

Search from over 2 Million Available Jobs, No Extra Steps, No Extra Forms, Just DirectEmployers

Job Information

Overdrive, Inc. Site Reliability Engineer in Cleveland, Ohio

This position will require you to be in Cleveland, OH * We are in a hybrid schedule, 2 days on campus and 3 days WFH* 

 

The Site Reliability Engineer's (SRE) responsibilities include availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for existing and future services. They are expected to apply software development principles to the operational work they perform. They predict performance issues and correct them in advance of them affecting end-users. They work with application developers to ensure the application meets its uptime requirements. SREs participate in an on-call rotation that sometimes requires incident response during off hours.

Responsibilities:

  • Work on small projects and individual tasks with regular guidance from more senior developers.
  • Provide day-to-day support for development teams by building and/or running deploys, answering questions, etc. 
  • Provide monitoring of Service-Level Indicators for applications and systems.
  • Independently train in the systems and technologies that the team uses. 
  • Provide feedback to application developers from a system perspective to help meet application performance objectives.
  • Participate in on-call rotations to provide first-line support during incidents.
  • Work with applications in various languages in a Linux environment.

Requirements:

  • 2+ years experience in software development or system administration; 1+ years experience working in Linux environments.
  • Proficient understanding of how modern networks and the Internet function.
  • Experience in identifying and resolving outages and performance issues in complex, networked applications.
  • Experience in working with large cloud-based providers; Amazon Web Services (AWS) experience a plus.
  • Experience in use of scripting language to automate tasks (e.g. Ruby or Python).
  • Experience with configuration automation tools; Ansible experience a plus.
  • Experience with logging and monitoring frameworks.
  • Able to participate in on-call rotations that require responding to incidents outside of business hours.
  • Able to work with a geographically-distributed team, with infrequent in-person communication.

What's Next:

As you've probably guessed, OverDrive is a place that values individuality and variety. We don't want you to be like everyone else, we don't even want you to be like us---we want you to be like you! So, if you're interested in joining the OverDrive team, *apply below *and tell us what inspires you about OverDrive and why you think you are perfect for our team.

OverDrive values diversity and is proud to be an equal opportunity employer.

DirectEmployers