Staff Software Engineer, Cloud Data Storage
Apply now →Summary
Cloud Data Store (CDS) owns the storage, retrieval, and lifecycle of all workflow data at planet scale. We design the persistence APIs, build storage abstractions that run across cloud vendors, and deliver the observability that lets customers trust their state machines for years. [To see more detail re: the Temporal CDS Eng team, click here]
As a Staff Engineer, you will get the chance to design, build, and maintain significant portions of our backend functionality for highly scalable, multi-tenant services. You’ll own the custom persistence stack for Temporal Cloud which includes a Write Ahead Log, various metadata stores (Cassandra, etcd), multi-level caches, tiered storage etc.
What You'll Do
Design & build distributed data systems – craft APIs, schemas, and replication paths that keep petabytes of workflow history durable and query-able. Clearly document design choices and operational knowledge to successfully deploy and run service with those features.
Drive reliability & performance – own SLOs, create chaos-test plans, profile hot paths, and lead incident reviews.
Technical leadership – break down roadmap epics, mentor mid-level engineers, steward design docs through RFC.
Cross-team collaboration – partner with the Server, Cloud, and DX teams to land features end-to-end.
What You'll Bring
5 or more years of experience as an "Arranger" and/or "Builder/Enhancer" of highly scalable distributed systems. see HERE for more info re: "Arranger" and/or "Builder/Enhancer"
Solid computer science fundamentals in distributed systems concepts including multi-threading and concurrency.
Experience writing concurrent code in production with languages like Go or Java or other applicable languages with skill level as "high end of Intermediate" and/or "Advanced" or "Expert" levels. see HERE for more info re: "high end of Intermediate" and/or "Advanced" or "Expert levels"
Experience building and running services on AWS. Bonus: Microsoft Azure and Google GCP experience.
Experience with Elastic Search and/or Clickhouse
Nice to Have
Prior contributions to Temporal, Cadence, or other workflow engines.
Deep expertise in a storage domain (LSM trees, columnar stores, transactional logs, etc.).
Operated multi-region, ≥99.99 % uptime services.
Experience working with Open Source Systems.
Experience in building K8s controllers and/or CRDs is a plus
Compensation
The estimated pay range for this role is $212,000 - $286,000 depending on experience and location.
Additionally, this role is eligible to participate in Temporal's equity plan.