Sumble Logo

Sumble

Data Engineer

Posted 2 Hours Ago
Be an Early Applicant
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
The Data Engineer will build scalable data pipelines, develop data warehouse solutions, and define data access standards while managing data integrity and access patterns.
The summary above was generated by AI

Sumble is building a knowledge graph from web data with a first focus on data for go-to-market teams. We use sources like job posts and resume data to identify things like org structure, tech stack, and key projects (e.g., GenAI initiatives, cloud migrations). Our product already has strong product-market fit, early revenue, and happy customers — and now we’re ready to accelerate.

Our long-term vision is to become the primary destination for accessing high-quality web data. Try the product at sumble.com.

Our Team

We are a team of 15, including 10 engineers with experience at companies such as Google, Meta, Stack Overflow, and Kaggle.

What we are looking for:

  • Building mission-critical, flexible and scalable data pipelines to move and analyze data at scale, with a focus on reliability, data consistency, and data recovery
  • Exploring and defining standards for data access and analytics patterns
  • Developing data warehouse and data lake solutions that evolve with the size and scale of the company
  • Proficiency in SQL, orchestrators, data modeling

Our Tech Stack:

  • Languages & Frameworks: Python, FastAPI, React, Typescript
  • Cloud Platform: Google Cloud Platform (GCP)
  • Databases: PostgreSQL, AlloyDB
  • ML/Data: PyTorch, Huggingface, vLLM, Skypilot, Marimo, Prefect
  • Infrastructure: Cloud Run
  • Design: Figma, Vercel V0


Challenges We Tackle:

  • Transforming noisy datasets into high-quality data products
  • Running expensive analytics computations efficiently
  • Managing the complexity of a growing number of data sources, machine learning models, and large data operations
  • Creating a user experience that allows both powerful high-level aggregations AND allows users to also see the granular underlying source data

Requirements
  • Located within Americas timezones

Benefits
  • Medical, dental, and vision (US)
  • 401k (US)
  • Target 4 weeks PTO

Top Skills

Alloydb
Cloud Run
Fastapi
Figma
Google Cloud Platform
Huggingface
Marimo
Postgres
Prefect
Python
PyTorch
React
Skypilot
Typescript
Vercel V0
Vllm

Similar Jobs

13 Days Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
121K-200K
Junior
121K-200K
Junior
Marketing Tech • Social Media • Software • Analytics • Business Intelligence
The Data Engineer will design, maintain ETL pipelines, collaborate on data products, and improve data infrastructure to ensure reliable data access across stakeholders.
Top Skills: AirflowAWSAzureDagsterDbtGCPMySQLPostgresPythonSQL
Yesterday
Remote or Hybrid
Texas, USA
106K-196K Annually
Senior level
106K-196K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Design, build, and optimize data infrastructure for real-time processing, mentor junior engineers, and ensure data reliability while collaborating with teams.
Top Skills: Apache FlinkAWSAzureBigQueryCloudFormationDatabricksGCPJavaKafkaPythonRedshiftScalaSnowflakeSparkSQLTerraform
25 Days Ago
Easy Apply
Remote or Hybrid
2 Locations
Easy Apply
90K-100K
Junior
90K-100K
Junior
AdTech • Big Data • Information Technology • Marketing Tech • Sales • Software
The Data Engineer will develop and optimize ETL pipelines and systems in Google Cloud Platform, collaborating with engineers and scientists to support B2B data solutions.
Top Skills: AirflowBigQueryComposerDataflowGoogle Cloud PlatformKubernetesPub/SubPythonSQL

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account