Remote
$115k–$196k
senior
2 months ago
full-time
quality 8.8/10
- Ensuring storage is reliable, predictable, and not a bottleneck for any critical workloads across the company
- Owning performance and stability of storage systems, and continuously improving them as data volumes and workloads grow
- Designing and evolving data placement, resiliency, and lifecycle strategies to balance performance, cost, and reliability
- Ensuring the platform behaves predictably during failures, maintenance, and scaling events
- Improving how storage integrates with compute environments (GPU/HPC, Kubernetes, data pipelines)
- Driving faster and more reliable incident detection, resolution, and prevention
- Improving capacity planning to avoid emergency scaling and unexpected degradation
- Continuously improving tooling, automation, and operational practices to make the platform easier to operate and scale
- Experience operating large-scale storage systems in production (distributed or vendor-based)
- Strong understanding of Linux, storage performance, and system behavior under load
- Ability to troubleshoot complex issues and drive them to resolution
- Practical approach to automation and system reliability
- Ownership mindset — ability to take responsibility for critical systems and improve them over time
Nice-to-have:
- Experience working with high-performance or distributed storage systems
- Understanding of networking in high-throughput environments
- Experience in environments with high reliability and performance requirements (finance, HFT, etc.)
- Great challenges with many opportunities to prove yourself
- A welcoming group of highly qualified international professionals
- Cutting-edge hardware and technology
- Work remotely from anywhere in the world
- Access any of our global offices anytime
- Flexible schedule
- 40 paid days off
- Competitive salary
Similar jobs
Senior Go Engineer
Quantori · Remote
$115k–$196k
20 days ago
View →
Staff Site Reliability Engineer-Federal, Security Clearance
Zscaler · Remote
$119k–$170k
3 months ago
View →
Operations Reliability Engineer - Automations
Alpaca · Remote
$105k–$175k
1 month ago
View →
Infrastructure Engineer
Numeus Group · Remote
$125k–$200k
1 month ago
View →
Quant Developer
Eqvilent · Remote
$81k–$138k
2 months ago
View →
Senior Systems Software Engineer
Unto Labs · Remote
$214k–$220k
3 days ago
View →