Jobs / Jobgether
Site Reliability Engineer
Jobgether · United States
Visa: unknownSalary: unknownWork mode: unknown
Skills
ansibleargocdawsci/cdcirclecigithub actionskubernetesterraform
Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Site Reliability Engineer in the United States.
This role offers the opportunity to maintain and scale mission-critical infrastructure for a fast-growing, fully remote technology company. You will work with diverse systems, including multi-terabyte databases, Kubernetes clusters, telemetry pipelines, and CI/CD workflows, ensuring high availability, resilience, and performance. The position combines hands-on engineering with strategic problem-solving, allowing you to design and implement scalable, automated, and maintainable systems. You will collaborate across engineering teams to optimize infrastructure, improve disaster recovery, and enhance operational reliability. Success in this role directly impacts system uptime, customer satisfaction, and overall business performance. This is ideal for engineers who value autonomy, ownership, and building high-quality infrastructure in a remote-first environment.
Accountabilities
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Why Apply Through Jobgether?
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
This role offers the opportunity to maintain and scale mission-critical infrastructure for a fast-growing, fully remote technology company. You will work with diverse systems, including multi-terabyte databases, Kubernetes clusters, telemetry pipelines, and CI/CD workflows, ensuring high availability, resilience, and performance. The position combines hands-on engineering with strategic problem-solving, allowing you to design and implement scalable, automated, and maintainable systems. You will collaborate across engineering teams to optimize infrastructure, improve disaster recovery, and enhance operational reliability. Success in this role directly impacts system uptime, customer satisfaction, and overall business performance. This is ideal for engineers who value autonomy, ownership, and building high-quality infrastructure in a remote-first environment.
Accountabilities
- Build, maintain, and optimize infrastructure systems across databases, cloud platforms, and telemetry pipelines.
- Ensure high availability, stability, and resilience of production systems and applications.
- Automate operational processes, including CI/CD workflows, database lifecycles, and deployment pipelines.
- Monitor and respond to incidents, providing escalation support and root cause analysis.
- Collaborate with engineering teams to design and implement scalable solutions and multi-region architectures.
- Contribute to infrastructure improvements, disaster recovery strategies, and operational documentation.
- Mentor and support team members in best practices for infrastructure management and reliability.
- Senior-level candidates: 5+ years of experience in building modern infrastructure systems; Staff-level candidates: 8+ years.
- Proven expertise supporting mission-critical production systems, serving as a final escalation point.
- Strong knowledge of cloud computing (AWS), container orchestration (Kubernetes), and CI/CD tools (GitHub Actions, ArgoCD, CircleCI).
- Experience with configuration management and automation tools such as Terraform and Ansible.
- Proficiency with databases including MongoDB, PostgreSQL, Elasticsearch, and data analytics/telemetry systems.
- Solid understanding of networking and data transfer protocols (DNS, HTTP, TCP).
- Excellent communication skills and ability to work effectively in a fully remote, collaborative team.
- US-based candidates only. Bonus points for open source contributions, multi-region architecture experience, MLOps exposure, or experience scaling Temporal.
- Competitive compensation with organization-wide goal-based bonus.
- Paid Time Off: ~5 weeks plus Winter and Summer holidays; additional PTO accrual each year.
- Flexible 80% work option: choose between standard 5-day weeks or 4-day weeks at 80% pay.
- Paid parental leave for primary and secondary caregivers.
- Sabbatical: 1 month paid leave after 5 years of service.
- Healthcare (US residents): medical, dental, vision, HSA, dependent care FSA.
- 401k (US residents): 6% matching contributions with immediate vesting.
- Remote-first flexibility, professional development support, and opportunities to contribute to open-source projects.
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Why Apply Through Jobgether?
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.