Remote
$112k–$188k
senior
23 days ago
full-time
quality 8.7/10
What you’ll be doing:
- Own the reliability, monitoring, and incident response lifecycle for AI infrastructure services, including on-call support for AWS deployment pipelines, root cause analysis, and blameless retros.
- Build automation and tooling to streamline operational IT workflows, eliminate manual tasks, and improve deployment velocity across CI/CD frameworks and Kubernetes environments.
- Partner with the Coinbase Infrastructure team to extend CI/CD frameworks supporting IT services and enterprise network platforms, and with Security and Compliance to integrate surveillance tooling into deployment pipelines.
- Strengthen observability and documentation standards across IT engineering by defining metrics, implementing monitoring solutions, and maintaining technical documentation that sets a standard of excellence.
- Develop full-stack applications that power internal AI products and infrastructure with Go or Python.
What we look for in you:
- 5+ years of experience automating and supporting cloud infrastructure (AWS) and network environments, with hands-on use of infrastructure-as-code tools (Terraform, Ansible, Chef, Puppet, or Salt).
- Proven experience deploying, managing, and troubleshooting containerized workloads using Docker and Kubernetes in production environments.
- Proficiency in at least one scripting or programming language (Python, Bash, Ruby, or Go) and version control workflows using Git-based CI/CD pipelines.
- Track record of leading incident response in environments with strict SLAs, including root cause analysis, blameless retros, and measurable reliability improvements.
- Utilizes generative AI responsibly, maintaining human oversight to deliver business-ready outputs and drive measurable improvements in workflow efficiency, cost, and quality.
What we offer:
- Base salary varies by location (see range below). Total compensation may also include equity and bonus eligibility, and benefits (medical, dental, vision, 401(k)).
- Annual base salary range (excluding equity and bonus): $186,065 — $218,900 USD.
Similar jobs
Senior Site Reliability Engineer, Workforce Identity
Coinbase · Remote
$112k–$188k
23 days ago
View →
Senior Site Reliability Engineer
Manychat · Remote
$88k–$130k
5 days ago
View →
(Senior) DevOps Engineer (f/m/d)
adjoe · Remote
$90k–$135k
13 days ago
View →
Senior DevOps Engineer
TradingView · Remote
$80k–$130k
13 days ago
View →
Senior DevOps Engineer
Incode · Remote
$115k–$196k
23 days ago
View →
Senior DevOps Engineer
Incode · Remote
$98k–$162k
1 month ago
View →