VoxelCloud Logo

VoxelCloud

Data Engineer Intern

Posted 8 Days Ago
In-Office
Los Angeles, CA, USA
Internship
In-Office
Los Angeles, CA, USA
Internship
Support the Data team by building and maintaining scalable ETL pipelines and data infrastructure for multi-modal medical datasets, automate data QA, optimize data delivery on cloud platforms, and provide analytics tools and secure cross-border data storage.
The summary above was generated by AI
Company Description

Founded in 2016, VoxelCloud is a Los Angeles-based leader worldwide in artificial intelligence (AI) analysis of medical images.  Backed by Sequoia and Tencent.  We help healthcare providers make better/earlier diagnoses and other clinical decisions.  http://www.voxelcloud.ai

Job Description

The Data team at VoxelCloud (Westwood, Los Angeles, CA) manages and maintains large-scale medical and healthcare data at the core of all our R&D activities. Reporting to Data Team Lead, the Data Engineer intern will participate in the acquisition and manipulation of massive datasets in multi-modal formats (medical images, text(EMR), etc.) on cloud storage. The ideal candidate is an experienced data pipeline builder and data wrangler who enjoys optimizing data systems and building them from the ground up. The Data Engineer intern will support our software developers and machine learning engineers on product/research initiatives and will create an optimal data delivery pipeline that is consistent across ongoing projects. They must be self-directed and comfortable supporting the data needs of multiple teams, systems, and products. The right candidate will be excited by the prospect of optimizing or even re-designing our company’s data architecture to support our next generation of products and data initiatives.

Responsibilities:

  • Create and maintain optimal data pipelines to support machine learning research and development
  • Identify, design, and implement internal process improvements: automating data QA, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS/AliCloud big data technologies.
  • Build analytics tools that utilize the data pipeline to provide actionable insights into product utilization and operational efficiency.
  • Keep our data separated and secure across national boundaries both locally and on cloud storage.

Qualifications

  • Proficient with at least one object-oriented/object function scripting languages: Python, Java, C++, Scala, etc
  • Working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases (Postgres).
  • Experience building and optimizing ‘big data’ data pipelines, architectures and data sets.
  • Solid understanding of information retrieval, statistics and machine learning. Experience with Computer Vision and NLP is a plus.
  • Prefer 1+ years in big data and related technology (e.g. DFS); experience with high-performance and scalable distributed system.
  • Prefer experience with AWS cloud services: EC2, EMR, RDS, Redshift
  • Skillful with automation tasks, but willing to get hands dirty for quality control.
  • Detail-oriented, well organized and self-motivated with a continuous drive to learn, explore and challenge; good communication skills and team player. 
  • Experience supporting and working with cross-functional teams in a dynamic environment.
  • MS, BA/BS degree in computer science, statistics or related field.

Additional Information

We Offer…   

  • An outstanding start-up culture;  
  • Transparent, collaborative work environment; 
  • Competitive compensation
  • Excellent Medical, Dental, and Vision coverage
  • 401k, paid Vacation and Holiday

All your information will be kept confidential according to EEO guidelines.

Similar Jobs

12 Days Ago
In-Office
Internship
Internship
Software
The Software Engineer Intern will enhance an event mining framework for data insights, focusing on automating monitoring, CI/CD integration, and data visualization.
Top Skills: Ci/CdDatadogElk StackGitGrafanaJavaScriptPrometheusPythonSpark
12 Days Ago
In-Office
Internship
Internship
Software
Contribute to the development of metrics dashboards by optimizing image/video search indexing, building iterative search features, and creating smart sampling strategies.
Top Skills: C++GoJavaPython
8 Hours Ago
In-Office
102K-163K Annually
Senior level
102K-163K Annually
Senior level
Cloud • Fintech • Food • Information Technology • Software • Hospitality
The Senior Data Analyst will conduct analyses, automate reporting, and provide insights to inform strategic decisions and business performance across teams.
Top Skills: Bi Tools Like HexLookerNumpyPandasPythonSigmaSQLTableau

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account