Role in brief
Gauntlet is seeking an Infrastructure Engineer to build and maintain cloud-native platforms for onchain finance. This role involves supporting application teams, owning CI/CD, managing Kubernetes, and enhancing security and resilience. Candidates with strong software engineering skills, particularly in Python, and experience with GCP, Kubernetes, and Terraform should apply to help manage over $1.5B in client assets.
About the role
This Infrastructure Engineer role at Gauntlet focuses on building and maintaining the core infrastructure that supports their financial systems. The position involves direct support for application teams by handling infrastructure requests, ensuring product engineers can focus on development. A key responsibility is to manage and extend CI/CD pipelines, specifically GitHub Actions, with a goal of achieving fully automated and secure deployments without direct human access to production environments.
The successful candidate will be responsible for infrastructure as code, authoring and updating Terraform modules across various GCP environments. Managing Kubernetes deployments using Helm and maintaining asynchronous workloads on Dagster are also central to the role. A significant initial project will likely be unifying observability across teams, consolidating alerting into a single view with proper incident routing.
Long-term objectives for this position include advancing the resilience of Gauntlet's infrastructure towards a region and cloud-agnostic posture, ensuring services can recover from failures. Strengthening security through IAM, secrets management, and least privilege principles, as well as contributing to SOC 2 readiness, are also important. The role also involves exploring automation using AI for routine tasks.
The national pay range for this full-time Infrastructure Engineer role is $150,000 to $175,000 base, with additional potential for On Target Earnings and equity.
Skills that matter here
- GCP: This role requires hands-on experience with Google Cloud Platform, as it is the primary cloud environment for Gauntlet's infrastructure.
- Kubernetes: The engineer will be responsible for operating large-scale Kubernetes production systems and managing service deployments.
- Terraform: Building and maintaining infrastructure as code is a core duty, specifically authoring and updating Terraform modules.
- GitHub Actions: The role involves owning and extending CI/CD workflows, with a focus on GitHub Actions.
- Dagster: The engineer will be responsible for keeping async workloads healthy on Dagster.
- Python: Strong software engineering fundamentals in Python are highly valued, along with comfort in scripting.
Who this role suits
- A person who thrives on supporting product engineers by quickly addressing infrastructure needs.
- Someone who is meticulous about security and access control, advocating for principles like least privilege.
- An individual who enjoys unifying disparate systems, such as consolidating observability and alerting.
- A candidate who is proactive in advancing system resilience and exploring automation with AI.
From the employer
- Support the application teams: turn around infra requests (permissions, roles, service setup, project peering) so product engineers stay focused on shipping.
- Own CI/CD and deployments: maintain and extend our GitHub Actions workflows and help migrate toward a dedicated CD tool with proper permissioning — the goal is fully automated, locked-down deploys via service accounts, no direct engineer access to production.
- Build and maintain infrastructure as code: author and update Terraform modules for new and existing services across GCP environments.
- Run Kubernetes the right way: manage service deployments via Helm (we're on Helm 4) keep async workloads healthy on Dagster.
- Unify observability (likely first project): consolidate today's per-team alerting into a single view — system-to-system dashboards plus incident alerting that routes upstream service/vendor failures to the right impacted teams and on-call rotations.
- Advance resilience: help move us toward a fully region- and cloud-agnostic posture so services can pick up and move if something fails.
- Strengthen security & access: apply IAM, secrets management, least privilege, and auditability; contribute to SOC 2 readiness.
- Automate with AI: build agent skills / `agents.md` so routine tasks (provisioning access, simple changes) can be handled by an agent instead of human engineering hours, and use AI to reason through bigger problems.
- Strong software-engineering fundamentals in at least one production language (Python, Go, TypeScript, or Rust); Python especially valued, plus comfort scripting and working in the shell.
- Hands-on experience with cloud infrastructure and core cloud services, especially GCP (AWS/Azure transferable).
- Experience operating large-scale Kubernetes production systems.
- Experience with Infrastructure as Code, especially Terraform.
- Familiarity with CI/CD systems, especially GitHub Actions or Octopus Deploy.
- Ability to debug production issues using logs, metrics, traces, shell tools, and source code.
- Security and access-control fundamentals: IAM, secrets management, least privilege, and auditability.
- Clear written communication around incidents, design decisions, and operational procedures.
- The national pay range for this role is $150,000 - $175,000 base plus additional On Target Earnings potential by level and equity in the company.
- Our salary ranges are based on paying competitively for a company of our size and industry, and are one part of many compensation, benefits and other reward opportunities we provide.
- Individual pay rate decisions are based on a number of factors, including qualifications for the role, experience level, skill set, and balancing internal equity relative to peers at the company.
Questions about this role
What is the remote work policy for this role?
This is a fully remote position.
What level of experience is required for this role?
The role requires strong software engineering fundamentals and hands-on experience with cloud infrastructure, large-scale Kubernetes, and Infrastructure as Code.
How do I apply for this position?
The job posting does not specify an application method, but typically applications are submitted through the company's website or a designated job portal.