Cogstate Logo

Cogstate

Data Engineer

Posted Yesterday
Easy Apply
Remote
Hiring Remotely in United States
120K-135K Annually
Mid level
Easy Apply
Remote
Hiring Remotely in United States
120K-135K Annually
Mid level
The Data Engineer will build and maintain data infrastructure, design data pipelines, and develop reusable frameworks, leveraging technologies like Azure Databricks and PySpark for data processing and transformation.
The summary above was generated by AI

At Cogstate, we’re advancing the science of brain health - making it faster, easier, and more accurate to assess cognition across clinical trials, healthcare settings, and everyday life.

Our digital cognitive assessments are trusted by researchers, clinicians, and pharmaceutical partners around the world, helping to drive breakthroughs in neuroscience and improve outcomes for people living with neurological conditions. Founded on decades of cognitive science and backed by rigorous validation, Cogstate’s assessments are used in more than 150 countries and over 2,000 clinical trials.

Our global team of experts - spanning psychology, data science, operations, and technology - works together to solve complex challenges in brain health assessment, always with a patient-first mindset. Whether we’re supporting a multinational Alzheimer’s trial or developing tools to bring cognitive testing into routine care, our work is meaningful, collaborative, and constantly evolving. 

At Cogstate, we’re not just imagining the future of brain health - we’re building it.

That’s why we’re seeking a Data Engineer responsible for building and maintaining Cogstate’s data infrastructure using best practice approach and engineering. The position will have a central role for establishing and maintaining data pipelines and reporting tables using Azure Databricks, working closely with other members of the data and scientific services team and being a point of contact for the data platform and associated data reporting.

Core Responsibilities

  • Understand Cogstate data sources and develop data pipelines using Databricks to bring all data into the data lake. 
  • Design, develop, implement, and tune large-scale distributed systems and pipelines that process large volumes of data; focusing on scalability, low-latency, and fault-tolerance in every system 
  • Developing scalable and re-usable frameworks for ingesting data into Azure Databricks, incorporating standards and best practices into engineering solutions 
  • Databricks engineering - query tuning, performance tuning, troubleshooting, and debugging pipelines. 
  • Deep understanding of ETL/ELT design methodologies, architecture, strategy, and tactics for complex ETL solutions, including CI/CD skills. 
  • Develop high performance scripts in PySpark to achieve objectives of enterprise data, BI, data visualization and analytics needs.
  • Data processing/transformation using various technologies such as Apache Spark, SQL, Python/Scala and Azure cloud services. 
  • Manage code versions in source control and coordinate changes across teams by leveraging Github. 
  • Participate in architecture design and discussions, provide logical and physical data design, and database modelling 
  • Be part of the Agile team to ensure availability of data to internal and external users.
  • Organize and manage data shares.
  • Solve complex data issues around data integration, data quality, and other data processing incidents 
  • Work with business system owners to resolve source data issues and refine transformation rules 

Qualifications

  • BS/BA in Computer Science, Data Science, or a related field or relevant experience
  • 2+ years in implementing data engineering solutions in PySpark in Databricks
  • Knowledge of relational databases and Apache Spark.
  • Strong knowledge of Databricks configuration, troubleshooting and performance tuning.
  • Testing, automation and orchestration, including Github and Azure functions.
  • Experience with development tools for CI/CD.
  • Deep expertise in programming languages for data processes (PySpark, Python, Scala).
  • Experience with relational databases like SQL Server writing complex SQL transformations

What’s In It For You

  • Remote Work Practices: Cogstate is a virtual first company. Cogstate employees can work from anywhere where Cogstate is registered to business within the United States, Australia, or the United Kingdom!
  • Generous Paid Time-off: Cogstate employees receive 20 days of vacation leave, 10 days of personal leave and 10 paid public holidays.
  • 401(k) Matching: As you invest in yourself and your future, Cogstate invests in you too: we match up to3% of your yearly salary in Cogstate’s 401k program.
  • Competitive Salary: We offer competitive base salaries plus additional earning opportunities based on the position.
  • Health, Dental & Vision Coverage: We've invested in comprehensive health & dental insurance options with competitive company contributions to help when you need it most. We also offer free vision insurance for all full-time employees.
  • Short-Term & Long Term Disability Life Insurance: 100% employer sponsored
  • Pre-Tax Benefits: Healthcare and Dependent Care Flexible Spending Accounts
  • Learning & Development Opportunities: Cogstate offers a robust learning program from mentorships to assistance with programs to improve knowledge or obtain certifications in applicable areas of interest.

Wage Range
$120,000$135,000 USD

Our Culture
We bring our whole selves to work every day. We’re courageous and we deliver together. We’re passionate individuals who enjoy working together. We’re brave enough and care enough to have the right conversations to get the best outcome and are famous for our can-do attitude. We see challenges as opportunities and move with pace to achieve our goals.

If you’re ready to help us in our journey to optimize the measurement of brain health around the world, please apply now!

Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on the company. If you need assistance in applying please email [email protected].

Privacy Notice for Job Applicants

Cogstate is committed to protecting your personal data. We collect and process your information for recruitment purposes in compliance with applicable laws, including the  Australian Privacy Principles (APPs), the UK General Data Protection Regulation (UK GDPR), California Consumer Privacy Act (CCPA), Virginia Consumer Data Protection Act (VCDPA), Colorado Privacy Act (CPA), and similar laws in other jurisdictions.

For more information on how we collect, use, and protect your data, and your rights under these laws, you can find Cogstate's privacy policy by clicking here.


Top Skills

Spark
Azure Cloud Services
Azure Databricks
Git
Pyspark
Python
Scala
SQL

Similar Jobs

13 Days Ago
Remote or Hybrid
Framingham, MA, USA
69K-129K Annually
Mid level
69K-129K Annually
Mid level
Big Data • Healthtech • Software
Design, build, and maintain scalable ETL/ELT pipelines using Python, Spark, Databricks, Airflow and SSIS. Integrate and cleanse diverse healthcare datasets, implement Unity Catalog for metadata and governance, optimize Spark performance and JVM tuning, support Medallion architecture, and collaborate with cross-functional teams to automate CI/CD, observability, and data quality processes.
Top Skills: Python,Scala,Sql,Apache Spark,Databricks,Aws,Ssis,Apache Airflow,Unity Catalog,Jenkins,Gitlab Ci,Parquet,Delta,Csv,Xml,Nosql,Jvm,Medallion Architecture
Yesterday
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Artificial Intelligence • eCommerce • Fintech • Payments • Retail • Software • Analytics
Build and maintain data onboarding pipelines and automation to extract, transform, validate, and import retailer data from legacy POS systems. Optimize performance, ensure data quality, and collaborate with onboarding, product, and operations teams to scale retailer onboarding.
Top Skills: Ai ToolsAsync Job ProcessingData PipelinesPostgresRuby On Rails
25 Days Ago
Remote
Pennsylvania, USA
101K-155K Annually
Senior level
101K-155K Annually
Senior level
Healthtech • Logistics • Pharmaceutical
The role involves collaborating with stakeholders to develop scalable data solutions using Databricks, applying AI/ML techniques, ensuring data integrity, and transforming business data into insights.
Top Skills: AgentbricksAIAzureBigQueryDatabricksGenieMlflowPythonScalaSnowflakeSQLUnity CatalogVector Indexes

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account