Dandelion Health Inc Logo

Dandelion Health Inc

Software Engineer

Posted Yesterday
Remote
Hiring Remotely in USA
135K-150K Annually
Mid level
Remote
Hiring Remotely in USA
135K-150K Annually
Mid level
Build and optimize high-throughput de-identification pipelines for large clinical datasets, execute QA, run and tune pipelines in health system cloud environments, reduce errors and costs, and collaborate with privacy and clinical informatics stakeholders.
The summary above was generated by AI

Our Team

Dandelion Health was founded in 2020 by experts in health tech, hospital systems, academia, and clinical AI. We are building the world’s largest AI training and clinical development platform. Today, we pride ourselves on our ability to make data access as easy as possible for AI developers, pharma, and medical devices, while raising the bar for patient safety and data quality. Tomorrow, we will be the place where any healthcare organization can go to build a responsible clinical AI product. Our culture is all about learning from data and improving, so we can help our clients improve health through AI. Meet the rest of our team here.

Our Data

We partner with health systems to safely and ethically make their de-identified patient data available to AI developers. Currently, the data is acquired from Sharp HealthCare, Sanford Health, and Texas Health Resources – with two additional U.S. health systems joining soon.

We have clinical data dating back to July 1, 2016. This data represents over 10 million patients and includes but is not limited to:

  • Structured data (e.g., 100% of the EMR, including some claims)

  • Unstructured text (e.g., clinical notes, radiology reports)

  • Images (e.g., DICOM, pathology)

  • Video

  • Waveforms

  • Continuous streaming monitoring data

Your Role

Dandelion is constantly expanding the breadth, depth, and completeness of health system datasets while improving the speed and quality of our de-identification pipeline. As an engineer working on our de-identification pipelines, you will:

  • Design and implement software systems that perform these de-identification rules at high scale and throughput (we de-identify billions of rows of data and millions of images each month) while constraining costs.

  • Generate and execute quality assurance plans to validate our de-identification processes.

  • Run de-identification pipelines in health system cloud environments, and optimize these pipelines to minimize error rates, improve processing efficiency, and reduce manual effort and cost.

  • Partner with our Director of Privacy and Clinical Informaticists to define de-identification rules.

Required technical skills

  • 3+ years of development experience in Python or an equivalent language in a professional setting, across the full software development lifecycle (design, implementation, testing, deployment, maintenance);

  • Familiarity with one or more command languages (e.g. Bash) and SQL.

Required Non-technical skills

  • Demonstrated ability to design and improve workflows, including associated operating procedures, cost management, and quality assurance;

  • Strong analytical decision-making and organizational skills;

  • Perseverance and practical problem solving;

  • Humility and strong team collaboration;

  • Enthusiasm about protecting patients’ personal data.

We are an AWS and Python shop, and our datasets are stored in AWS Redshift, Snowflake, or Parquet files which are processed in Pandas DataFrames.

Preferred skills

  • Proficiency with data structures such as Pandas DataFrames;

  • Previous software deployment in a cloud computing environment (e.g., AWS, Azure);

  • Familiarity with virtualization and containerization (e.g., Docker, VMware);

  • Prior experience working with healthcare data;

  • Experience interacting with non-technical stakeholders to deploy software solutions.

Team Benefits

  • Remote work and flexible hours. Availability needed for meetings, which we try to keep to a healthy minimum

  • Complete wellness benefits including healthcare, dental, vision, PTO, sick days and more. Ask for details

  • Professional development days to build your skills

  • Collegial work environment

  • Academic bent towards inquiry and problem solving but start-up speed and flexibility

  • Great balance of focus time to work on projects but easy to access team members to discuss issues and work collaboratively

  • Dandelion is a mission-driven company that is focused on improving patient care

Similar Jobs

3 Hours Ago
In-Office or Remote
118K-232K Annually
Senior level
118K-232K Annually
Senior level
Aerospace • Information Technology • Software • Cybersecurity • Design • Defense • Manufacturing
Develop, deploy, and maintain RTOS and Linux (Yocto) configurations and hypervisor (Xen/KVM) integrations for safety-critical systems. Lead technical design, testing, performance analysis, and verification activities. Provide subject-matter expertise, tooling/process improvements, and support software lifecycle from requirements through deployment.
Top Skills: Arinc 653BazelBootloadersBuildrootCC++ClangDevice DriversDockerGccGitGitlab CiKvmLinux KernelPodmanPosixPythonRtosRustXenYoctoYocto Linux
2 Days Ago
Easy Apply
Remote
United States
Easy Apply
142K-210K Annually
Junior
142K-210K Annually
Junior
Big Data • Fintech • Mobile • Payments • Financial Services
Build and operate ML training and serving infrastructure by designing, developing, and launching backend systems. Collaborate across teams to decompose projects, support operations and on-call, create monitoring and metrics, perform code reviews, and contribute to developer velocity and platform reliability.
Top Skills: AWSKotlinKubernetesMySQLPython
2 Days Ago
Remote or Hybrid
201K-352K Annually
Senior level
201K-352K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design, build, and operate production-grade agentic AI systems: multi-agent orchestration, enterprise-grounded reasoning using CMDB/Knowledge Graph, retrieval/RAG pipelines, model integration with frontier SDKs, and trust/safety/governance. Lead architecture, code quality, and mentor engineers for scalable, safe autonomous agents.
Top Skills: AnthropicC++CmdbGoGoogle (Ai Sdks)Hybrid SearchInference OptimizationJavaKnowledge GraphLlm Fine-TuningMlopsModel ObservabilityOpenaiPrompt EngineeringPythonRagRe-RankingRetrieval Evaluation MetricsVector StoresWorkflow Data Fabric

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account