Infrastructure Engineer

Remote $82k–$138k 7 days ago full-time quality 8.6/10

Role in brief

Gauntlet is seeking an Infrastructure Engineer to build and maintain cloud-native solutions for their onchain finance platform. This role involves supporting application teams, owning CI/CD, managing Kubernetes, and enhancing system resilience and security. Candidates with strong software engineering skills in Python, Go, TypeScript, or Rust, and experience with GCP, Kubernetes, and Terraform are encouraged to apply.

PythonGoTypeScriptRustGCPKubernetesTerraformGitHub ActionsHelm 3+DagsterIAMSecrets Management

About the role

This role focuses on building and maintaining the core infrastructure that powers Gauntlet's financial systems in onchain finance. The Infrastructure Engineer will be responsible for supporting application teams by handling infrastructure requests, ensuring product engineers can focus on development. Key tasks include managing permissions, roles, and service setups to streamline product delivery.

A significant part of the work involves owning and improving the CI/CD pipeline, maintaining GitHub Actions workflows, and migrating towards a dedicated CD tool for automated and secure deployments. The engineer will also author and update Terraform modules for services across GCP environments, ensuring infrastructure is defined as code. Managing Kubernetes deployments via Helm and maintaining async workloads on Dagster are also core responsibilities.

Success in this position means enhancing system resilience, security, and observability. This includes consolidating alerting systems, moving towards a region- and cloud-agnostic posture, and strengthening security through IAM, secrets management, and least privilege principles. The role also involves contributing to SOC 2 readiness and exploring the use of AI for automating routine operational tasks.

The salary for this Infrastructure Engineer role is between $82,000 and $138,000 USD.

Skills that matter here

  • Python: This role requires strong software engineering fundamentals in Python, especially for scripting and shell work.
  • GCP: The position requires hands-on experience with cloud infrastructure and core cloud services, specifically GCP, though AWS/Azure experience is transferable.
  • Kubernetes: The engineer will be responsible for operating large-scale Kubernetes production systems and managing service deployments via Helm.
  • Terraform: This role involves building and maintaining infrastructure as code by authoring and updating Terraform modules for services across GCP environments.
  • GitHub Actions: The engineer will maintain and extend GitHub Actions workflows as part of owning CI/CD processes.
  • IAM: This role requires applying IAM, secrets management, and least privilege principles to strengthen security and access controls.

Who this role suits

  • A person with strong software engineering fundamentals in at least one production language (Python, Go, TypeScript, or Rust), with a particular value placed on Python proficiency.
  • Someone with practical experience in cloud infrastructure, especially GCP, and who has operated large-scale Kubernetes production systems.
  • An individual who values security and access control, with familiarity in IAM, secrets management, and auditability.
  • A clear communicator who can articulate incidents, design decisions, and operational procedures in writing.

From the employer

What you'll do;

  • Support the application teams: turn around infra requests (permissions, roles, service setup, project peering) so product engineers stay focused on shipping.
  • Own CI/CD and deployments: maintain and extend our GitHub Actions workflows and help migrate toward a dedicated CD tool with proper permissioning — the goal is fully automated, locked-down deploys via service accounts, no direct engineer access to production.
  • Build and maintain infrastructure as code: author and update Terraform modules for new and existing services across GCP environments.
  • Run Kubernetes the right way: manage service deployments via Helm (we're on Helm 4) keep async workloads healthy on Dagster.
  • Unify observability (likely first project): consolidate today's per-team alerting into a single view — system-to-system dashboards plus incident alerting that routes upstream service/vendor failures to the right impacted teams and on-call rotations.
  • Advance resilience: help move us toward a fully region- and cloud-agnostic posture so services can pick up and move if something fails.
  • Strengthen security & access: apply IAM, secrets management, least privilege, and auditability; contribute to SOC 2 readiness.
  • Automate with AI: build agent skills / agents.md so routine tasks (provisioning access, simple changes) can be handled by an agent instead of human engineering hours, and use AI to reason through bigger problems.

What you bring;

  • Strong software-engineering fundamentals in at least one production language (Python, Go, TypeScript, or Rust); Python especially valued, plus comfort scripting and working in the shell.
  • Hands-on experience with cloud infrastructure and core cloud services, especially GCP (AWS/Azure transferable).
  • Experience operating large-scale Kubernetes production systems.
  • Experience with Infrastructure as Code, especially Terraform.
  • Familiarity with CI/CD systems, especially GitHub Actions or Octopus Deploy.
  • Ability to debug production issues using logs, metrics, traces, shell tools, and source code.
  • Security and access-control fundamentals: IAM, secrets management, least privilege, and auditability.
  • Clear written communication around incidents, design decisions, and operational procedures.

Bonus points

  • Supporting SOC 2 controls - evidence collection, access reviews, change management, or audit readiness.
  • Observability with Datadog, Prometheus, Grafana, OpenTelemetry, Honeycomb, or similar.
  • Improving developer experience through internal tooling, templates, scripts, or platform APIs.
  • Incident response experience, including postmortems and follow-up remediation.
  • Experience with Dagster, Helm 3+, high-scale CD tooling (Bazel, Octopus), or AI/agent-assisted ops.
  • Basic web3 / DeFi literacy (transactions, wallets) and genuine curiosity about onchain — the role doesn't touch chain directly, but the business is onchain.

Questions about this role

What is the remote work policy for this role?

This is a fully remote position.

What is the salary range for this position?

The salary for this role ranges from $82,000 to $138,000 USD.

What are the core technical skills required for this role?

Candidates should have strong software engineering fundamentals in Python, Go, TypeScript, or Rust, hands-on experience with GCP, Kubernetes, Terraform, and familiarity with CI/CD systems like GitHub Actions.

Similar jobs

Before you apply

  • Legitimate employers never ask you to pay anything to apply or get hired.
  • Never share seed phrases or private keys. No real job needs them.
  • Do not install software ("test tasks", "trading tools", "video call clients") sent during hiring.
  • Check that the application page's domain really belongs to Gauntlet.