Senior Site Reliability Engineer
Curology
Software Engineering
United States · Remote
Posted on Sep 7, 2024
Mission of the Role:
Architect and lead the delivery of high-quality and reliable solutions through creative problem-solving and technical expertise to address our business problems on a frequent and regular cadence. Write software to automate and scale the operations of our engineering organization. Evangelize reliability-as-a-feature through monitoring, service-level objectives, automation, everything-as-code, and testing.
Essential Functions and Impact Areas:
- Architect and lead the high-quality and reliable solutions to address our business problems on a frequent and regular cadence.
- Continuously build domain understanding to help shape and build solutions within your expertise.
- Champions automation to reduce toil and increase development velocity.
- Building positive and collaborative relationships across the company.
- Helps define and instrument Service-Level Objectives to ensure the most excellent customer experience.
- Applies everything-as-code methodologies across configuration, infrastructure, orchestration, and elsewhere.
- Hosts blameless postmortems to share learnings, discover gaps, embrace transparency, and improve reliability across our services.
- Leads projects from inception to completion.
- Participates in an on-call rotation to assist in finding a resolution during incidents.
Minimum Skills & Requirements:
- 5+ years of experience building infrastructure solutions in AWS using Infrastructure-as-Code technologies such as Terraform or CloudFormation.
- 5+ years of experience working with Docker containers and related orchestration technologies (such as Kubernetes or ECS).
- 5+ years of experience building and deploying CI/CD pipelines.
- Experience with AWS, Docker, Kubernetes, Terraform, Python, PHP
- Experience with architectural patterns of large, high-scale applications, such as well-designed APIs and database schemas.
- Experience working collaboratively in cross-functional teams with engineers in product and data groups.
- Deep technical expertise; Writes, debugs, and refactors code while being mindful of tradeoffs, scalability, architecture, and code cleanliness. Demonstrates mastery of their craft to solve problems in automation, infrastructure, and/or developer tooling.
- Reliability & Quality; Experience leveraging observability tooling and practices such as SLOs to help engineering teams own the reliability and quality of the software they build.
- Leadership - Define and deliver well-scoped milestones for a project; beginning to show up as a leader in their area of expertise on key projects. Senior SREs are able to make autonomous decisions relevant to our overall strategy.
Why You'll Love Working at Curology:
- Competitive salary and equity packages
- Comprehensive benefits: medical, dental, and vision insurance for employees; flexible spending account; 401k; mental health & wellness programs
- Home office setup stipend
- Minimum Time Off policy (unlimited PTO, with at least 3 weeks off)
- 11 company observed holidays
- Additional holidays: Curology days off (1 per quarter), 1 annual floating holiday (employee’s choice), and Gratitude Week (employees take the full week of Thanksgiving off; business critical teams observe different days)
- Paid parental leave
- Employee donation matching program
- Company-sponsored events
- Free subscription to Curology or Agency