{"jobs":[{"absolute_url":"https://job-boards.greenhouse.io/foresitelabs/jobs/8544870002","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"internal_job_id":6410643002,"location":{"name":"San Francisco, CA"},"metadata":null,"id":8544870002,"updated_at":"2026-05-12T18:37:23-04:00","requisition_id":"144","title":"Senior Data Engineer (5+ years) ","company_name":"Foresite Labs","first_published":"2026-05-12T18:37:23-04:00","language":"en","application_deadline":null,"content":"\u0026lt;p\u0026gt;Foresite Labs is a translational R\u0026amp;amp;D team that derives insights from precision measurement and population-scale biology and genetics to address unmet clinical needs. We use human genetics to systematically dissect and understand human disease biology and develop and critically evaluate therapeutic hypotheses. We engage in translational research, transforming basic insights into therapeutic opportunities. Our work supports drug discovery and company formation, and provides the core around which new ideas are realized and incubated. We offer competitive salaries, excellent benefits, a flexible work environment, and the opportunity to learn from top thinkers in various disciplines. Foresite Labs is headquartered in San Francisco and Boston.\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;strong\u0026gt;What You’ll Do\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Build and own production data infrastructure. Design, implement, and operate deterministic data pipelines that feed\u0026amp;nbsp; intelligence layers; ingest clinical, financial, scientific, and commercial data from REST APIs, XML feeds, and file-based sources into clean, queryable analytical layers; own the full lifecycle: pagination, rate limiting, auth, schema drift, idempotency, retries, monitoring, and alerting.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Model and curate high-quality data assets. Perform entity resolution, schema design, and quality enforcement across disparate and heterogeneous data sources; ensure downstream models, agents and dashboards operate on clean and trustworthy data.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Support AI-native workflows. Build vector infrastructure (e.g., embeddings, indexing, retrieval) and structured data interfaces that GenAI agents and LLM orchestrators depend on; ensure AI layers have the right data in the right shape at the right time.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Uphold high engineering standards and collaborate broadly. Lead code and design reviews, establish testing and observability best practices, and mentor peers; partner with ML engineers, computational biologists, and company founders to translate scientific and business goals into maintainable, and scalable technical solutions.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Leverage agentic coding tools (Claude Code, Codex, or similar) to accelerate prototyping, refactoring, and debugging.\u0026amp;nbsp;\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;strong\u0026gt;What You’ll Bring\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;5+ years of professional data engineering experience designing, building, and operating production pipelines end-to-end - including schema design for analytical workloads, entity resolution across messy real-world sources, and data quality enforcement at the pipeline level.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Deep fluency in Python and SQL, writing performant, well-tested data transformation code (dbt or similar), with production experience in pipeline orchestration (e.g., Airflow, Prefect) covering DAG design, scheduling, dependency management, retry/backfill patterns, and alerting.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Hands-on cloud data stack experience across AWS or GCP (e.g., managed Postgres, object storage, query engines, serverless ETL) and working knowledge of IAM, networking, and infrastructure patterns; comfortable with Spark or Trino at scale over Parquet/Iceberg. Terraform, CDK, or Pulumi experience is a plus.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Exposure to vector databases (Pinecone, pgvector, Weaviate, or similar) with an understanding of how embedding-based retrieval fits into LLM-powered applications.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Comfortable building and navigating Unix environments, containers (Docker), and CI/CD pipelines (GitHub Actions or similar).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Mindset for rapid and early‑stage execution: bias for action, ownership of ambiguous problem spaces, and demonstrated ability to prioritize and wear multiple hats.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Strong written and verbal communication skills - comfortable explaining complex ideas clearly to technical and non‑technical partners.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;strong\u0026gt;Nice to Have\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Familiarity with biomedical or life sciences data (e.g., clinical trials, genomics, drug discovery, pharma commercial data).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Familiarity with LLM application patterns: prompting, context engineering, tool use, structured outputs, retrieval-augmented generation (RAG).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Experience with dashboard/BI tooling (Streamlit, Retool, Metabase, or similar).\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;strong\u0026gt;Why Foresite Labs\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Impact. Your code will accelerate discovery and bring novel therapeutics closer to patients and will help AI driven scientific innovation.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Autonomy. Small, senior teams give you room to architect and own end‑to‑end solutions without heavy bureaucracy.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Learning. Work at the intersection of software, cloud infrastructure, and biology alongside experts in each domain.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Flexibility \u0026amp;amp; Benefits. Competitive salary and equity, comprehensive healthcare, generous PTO, and hybrid/remote options aligned to our SF or Boston hubs.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Ready to build? Apply today and help us create the software foundation that powers the next generation of companies advancing biotech and scientific innovation.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;Location: San Francisco, CA\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Salary range: $185,000 - $221,400\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;em\u0026gt;Foresite Labs is an equal opportunity employer. We thrive on diversity and collaboration.\u0026lt;/em\u0026gt;\u0026lt;/p\u0026gt;","departments":[{"id":4035591002,"name":"Platform Team","child_ids":[],"parent_id":null}],"offices":[{"id":4018930002,"name":"Labs - SF","location":"San Francisco, California, United States","child_ids":[],"parent_id":null}]},{"absolute_url":"https://job-boards.greenhouse.io/foresitelabs/jobs/8533284002","data_compliance":[{"type":"gdpr","requires_consent":false,"requires_processing_consent":false,"requires_retention_consent":false,"retention_period":null,"demographic_data_consent_applies":false}],"education":"education_required","internal_job_id":6407865002,"location":{"name":"San Francisco, CA"},"metadata":null,"id":8533284002,"updated_at":"2026-05-30T12:15:54-04:00","requisition_id":"143","title":"Senior Software Engineer (5+ years)","company_name":"Foresite Labs","first_published":"2026-05-01T14:25:06-04:00","language":"en","application_deadline":null,"content":"\u0026lt;p\u0026gt;\u0026lt;strong\u0026gt;About Foresite Labs\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Foresite Labs is a translational R\u0026amp;amp;D team that derives insights from precision measurement and population-scale biology and genetics to address unmet clinical needs. We use human genetics to systematically dissect and understand human disease biology and develop and critically evaluate therapeutic hypotheses. We engage in translational research, transforming basic insights into therapeutic opportunities. Our work supports drug discovery and company formation, and provides the core around which new ideas are realized and incubated. We offer competitive salaries, excellent benefits, a flexible work environment, and the opportunity to learn from top thinkers in various disciplines. Foresite Labs is headquartered in San Francisco and Boston.\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;strong\u0026gt;What You’ll Do\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Build from scratch. Design, implement, and own production‑grade software systems, services, data pipelines, internal tools, often starting from scratch and evolving them through MVP, scale‑up, and sustained operation.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Ship features end-to-end. Work across the stack, with meaningful ownership of frontend applications built in React and Tailwind CSS as well as backend services and APIs, to deliver product value quickly and iterate with feedback.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Create polished user experiences. Translate ambiguous user and business needs into intuitive, performant interfaces that make complex scientific and operational workflows easier to use.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Use modern AI-native development workflows and agentic coding tools, such as Claude Code or Codex, to accelerate prototyping, implementation, refactoring, and debugging\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Uphold high engineering standards. Lead code and design reviews, establish testing and monitoring best practices, and collaborate with and mentor peers in clean, maintainable software.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Collaborate broadly. Pair with machine learning engineers and scientists, computational biologists, and portfolio company founders to translate scientific or business goals into robust technical solutions.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;strong\u0026gt;What You’ll Bring\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;5+ years of professional software engineering experience, including shipping greenfield systems you personally bootstrapped and supported in production.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Fluency in TypeScript and at least one modern backend language (Python, Go, Java, etc.), with strong experience building production full-stack applications.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Working knowledge of AWS (or GCP/Azure) core services and patterns—VPC, IAM, ECS/EKS/Lambda, RDS, S3—and experience expressing infrastructure as code with Terraform, CDK, or Pulumi a plus.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Comfort navigating Unix environments, containers (Docker), and CI/CD pipelines (GitHub Actions, GitLab, or similar).\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Hands-on experience with LLM systems and agentic application frameworks, including familiarity with tools such as LangChain and Pydantic AI, and an understanding of how to build production-oriented workflows involving tool use, structured data, multi-step reasoning, iterative loops, testing, and orchestration.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Mindset for early‑stage execution: bias for action, ownership of ambiguous problem spaces, and ability to wear multiple hats without extensive support structures.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Excellent written and verbal communication skills; you explain complex ideas clearly to technical and non‑technical partners.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;br\u0026gt;\u0026lt;strong\u0026gt;Why Foresite Labs\u0026lt;/strong\u0026gt;\u0026lt;/p\u0026gt;\n\u0026lt;ul\u0026gt;\n\u0026lt;li\u0026gt;Impact. Your code will accelerate discovery and bring novel therapeutics closer to patients and will help AI driven scientific innovation.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Autonomy. Small, senior teams give you room to architect and own end‑to‑end solutions without heavy bureaucracy.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Learning. Work at the intersection of software, cloud infrastructure, and biology alongside experts in each domain.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Flexibility \u0026amp;amp; Benefits. Competitive salary and equity, comprehensive healthcare, generous PTO, and hybrid/remote options aligned to our SF or Boston hubs.\u0026lt;/li\u0026gt;\n\u0026lt;li\u0026gt;Ready to build? Apply today and help us create the software foundation that powers the next generation of companies advancing biotech and scientific innovation.\u0026lt;/li\u0026gt;\n\u0026lt;/ul\u0026gt;\n\u0026lt;p\u0026gt;\u0026amp;nbsp;\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Location: San Francisco, CA\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;Salary range: $185,000 - $221,400\u0026lt;/p\u0026gt;\n\u0026lt;p\u0026gt;\u0026lt;em\u0026gt;Foresite Labs is an equal opportunity employer. We thrive on diversity and collaboration.\u0026lt;/em\u0026gt;\u0026lt;/p\u0026gt;","departments":[{"id":4035591002,"name":"Platform Team","child_ids":[],"parent_id":null}],"offices":[{"id":4018930002,"name":"Labs - SF","location":"San Francisco, California, United States","child_ids":[],"parent_id":null}]}],"meta":{"total":2}}