Senior Infrastructure Engineer
Role in brief
Cointracker seeks a Senior Infrastructure Engineer to manage and scale the platform supporting its crypto tax and AI-driven personal finance products. This role involves operating Kubernetes clusters, database infrastructure, and caching layers, while also securing the platform and implementing observability. Ideal for an experienced engineer with deep GCP and Kubernetes skills who thrives in a fully remote environment.
About the role
This role focuses on the operational stability and scalability of Cointracker's core platform, which underpins its crypto tax and AI-powered financial planning tools. The Senior Infrastructure Engineer will be responsible for managing GKE clusters, ensuring they can handle fluctuating loads through KEDA-driven autoscaling. This includes maintaining critical database infrastructure like PostgreSQL and AlloyDB, along with Redis caching layers, to ensure reliable performance, especially during peak periods like tax season.
A key aspect of the position involves developing and maintaining infrastructure as code using Terraform and Helm charts, facilitating consistent deployments and GitOps pipelines. The engineer will also implement comprehensive observability solutions with OpenTelemetry, covering monitoring, dashboards, and metric export, while ensuring data privacy. Platform security is another core responsibility, utilizing tools like Cloudflare, Cloud Armor, and GCP Secret Manager.
Success in this role means providing a robust, scalable, and secure platform that allows backend and AI teams to develop and deploy new features efficiently. The engineer will partner closely with these teams, setting up staging environments and ensuring that infrastructure needs are met without delays. This requires a proactive approach to identifying and addressing technical challenges, with a focus on high availability and zero-downtime operations for a high-traffic web application.
The starting salary for this position ranges from $165,750 to $195,000 USD, and the candidate's location does not affect this range.
Skills that matter here
- GCP: Extensive hands-on experience with GCP services like GKE, AlloyDB/CloudSQL, Redis, Pub/Sub, GCS, Secret Manager, and IAM is required for managing the cloud infrastructure.
- Kubernetes: Proficiency in Kubernetes is essential for cluster management, autoscaling, workload identity, and node pool operations for the consumer web application.
- Terraform: Strong Terraform skills are needed for developing modules, managing state, and maintaining infrastructure-as-code discipline across environments.
- PostgreSQL: Experience with PostgreSQL and AlloyDB is critical for managing database operations at scale, including read pools, connection pooling with PgBouncer, and capacity planning.
- Redis: Managing Redis caching layers is a core task for handling session and data caching at scale within the platform.
- OpenTelemetry: Implementing observability with OpenTelemetry involves setting up monitors, dashboards, and metric routing, including PII stripping and Pub/Sub export.
Who this role suits
- This person has a deep conviction about the mission or the technical challenges involved.
- They are proactive in identifying problems, taking ownership, and driving outcomes.
- They excel at building collaborative relationships and are sought after for complex projects.
- This individual demonstrates strong critical thinking, able to uncover underlying issues and make sound judgments under uncertainty.
From the employer
- Operate and scale the GKE clusters powering the consumer webapp, with KEDA-driven autoscaling tied to product-dependent load metrics.
- Manage the database infrastructure that keeps tax season survivable: primary instances, read pools, connection pooling, and PSC networking.
- Maintain Redis caching layers for session and data caching at scale.
- Operate GitOps pipelines and image-driven promotion workflows from staging to production.
- Develop and maintain Helm charts for consistent deployments across environments.
- Implement observability with OpenTelemetry — monitors, dashboards, metric routing with PII stripping, and Pub/Sub export.
- Secure the platform with Cloudflare integration, Cloud Armor WAF, CloudOrigin CA TLS certificates, and GCP Secret Manager via External Secrets.
- Partner with backend and AI teams to stand up staging environments so they can run experiments and ship new features without waiting on infra.
Role-agnostic
- Opportunity Alignment: Genuine conviction about our mission, the opportunity we're going after, or the technical problems we're solving.
- High Agency: You identify what matters, take it on, and own outcomes.
- Exceptional Collaboration: Build high-trust partnerships that make team members want you on hard projects.
- Critical Thinking: Find the real question beneath the surface one, weigh alternatives, and show clear judgment under uncertainty.
Required
- Deep hands-on experience with GCP — GKE, AlloyDB/CloudSQL, Redis, Pub/Sub, GCS, Secret Manager, IAM
- Strong Terraform skills — module development, state management, infrastructure-as-code discipline
- Kubernetes proficiency — cluster management, Workload Identity, autoscaling, PodDisruptionBudgets, node pools
- Database operations at scale — PostgreSQL/AlloyDB read pools, connection pooling (PgBouncer), capacity planning
- High-traffic webapp experience — handling seasonal traffic spikes, zero-downtime deploys, rapid rollback
- Monitoring and observability — alerting strategies, SLO/SLI definition
- The starting salary for this position is between $165,750 - $195,000 USD.
- Candidate's location does not impact the posted salary range.
- The actual compensation offered will be based on a variety of job-related factors including relevant skills, education, and experience.
- This position also includes equity in CoinTracker and benefits like medical, dental, vision, 401(k), life insurance, and PTO.
Questions about this role
What is the remote work policy for this position?
This is a fully remote position, and the company operates as a fully distributed team.
What is the seniority level of this role?
This is a senior-level position.
What are the key technical skills required for this role?
Key technical skills include deep experience with GCP, strong Terraform skills, Kubernetes proficiency, database operations at scale with PostgreSQL/AlloyDB, and expertise in monitoring and observability.