Socket (socket.dev) Logo

Socket (socket.dev)

Senior Data Engineer

Reposted 15 Days Ago
Remote
Hiring Remotely in United States
Senior level
Remote
Hiring Remotely in United States
Senior level
Design and build data pipelines and infrastructure. Ensure data flows reliably for real-time analytics, optimize storage and query performance, and collaborate across teams.
The summary above was generated by AI

About Us

Socket helps devs and security teams ship faster by cutting out security busywork. Thousands of orgs use Socket to safely find, audit, and manage open source code. Our customers — from Anthropic to xAI, and Figma to Vercel — love Socket (just check out their tweets to see for yourself!)


Founded by Feross Aboukhadijeh, a long-time open source maintainer with software downloaded over a billion times a month, Socket has raised $65M in funding from top angels, operators, and security leaders.

About the Role

We're looking for a Data Engineer to join our Data Platform team and build the infrastructure that powers Socket's data ecosystem. You'll design and maintain systems that handle billions of records, enable real-time analytics, and power the insights our customers rely on to secure their software supply chain.

This is a high-impact role where you'll work across the stack- from ingestion pipelines to analytics APIs - ensuring data flows reliably and is accessible when teams need it.

What You'll Do

  • Design and build scalable data pipelines that ingest, process, and transform high-volume event streams and historical data

  • Develop and maintain APIs that deliver analytics, trend reports, and drill-down capabilities to internal teams and external customers

  • Build robust infrastructure for data quality monitoring, ensuring accuracy and completeness across customer and artifact datasets

  • Optimize data storage and query performance using systems like ClickHouse, Kafka, NATS, and PostgreSQL to support real-time and batch use cases

  • Implement usage tracking, auditing, and event processing systems that provide visibility into platform behavior

  • Create reliable data ingestion systems for security scan results, SBOM data, and artifact metadata

  • Build infrastructure for outbound integrations that deliver Socket data to customer systems

  • Collaborate with product, security research, and engineering teams to understand data needs and deliver solutions that scale

What You'll Bring

  • 5+ years of professional software engineering experience

  • 3+ years of experience building data pipelines and infrastructure in production environments

  • Strong proficiency in Node.js and TypeScript for backend development

  • Experience with Kafka or other streaming platforms like NATS, RabbitMQ, or Kinesis in event-driven architectures

  • Hands-on experience with ClickHouse or other columnar/OLAP databases like BigQuery, Snowflake, DuckDB, or similar

  • Solid understanding of data modeling, schema design, and query optimization

  • Familiarity with Parquet or other cloud data lake technologies like Delta Lake or Iceberg

  • Experience building REST APIs and data access layers for analytics use cases

  • Comfort working with large-scale distributed systems and debugging performance bottlenecks

  • Strong ownership mindset - you take responsibility for the systems you build and ensure they're reliable

  • Clear communication skills; you can explain technical trade-offs to both engineers and non-technical stakeholders

Nice to Have

  • Experience with time-series data and real-time analytics

  • Familiarity with security or DevOps tooling ecosystems

  • Background working with SBOM formats or supply chain security concepts

  • Experience with data quality frameworks and observability tools

  • Understanding of multi-tenant architectures and data isolation patterns

We know how important clarity is when looking for a new role, so we've put together a read-me about the Interview Process at Socket.

Benefits: Our benefits are crafted to support you and your family, so you can take care of what matters most and thrive in and outside of work. We offer:

  • Market competitive salary bands

  • Meaningful equity program

  • Comprehensive health benefits for you and your family

  • Flexible time-off, holidays, and winter shutdown to rest & recharge

  • Paid parental leave

  • Remote-first, with quarterly team off-sites

At Socket, we

  1. Pursue Excellence: We set ourselves apart by consistently delivering work of exceptional quality and distinction.

  2. Move with urgency and focus: We prioritize swift, decisive action.

  3. Think rigorously: We care about being right and it often takes reasoning from first principles to get there. We value alternative perspectives and have constructive discussions.

  4. Trust and amplify: We overtrust, always assume good intent, and give specific feedback to help each other improve.

  5. Feel a strong sense of ownership: We wear many hats and feel a strong sense of overall ownership of the company and we're non-territorial regarding our nominal domains.

  6. Are customer obsessed: We relentlessly prioritize the needs of our customers, striving to exceed their expectations and delight them at every interaction.

Top Skills

Clickhouse
Delta Lake
Iceberg
Kafka
Nats
Node.js
Parquet
Postgres
Typescript

Similar Jobs

Yesterday
Remote or Hybrid
New York, NY, USA
140K-180K Annually
Senior level
140K-180K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Lead engineering and automation for a data collaboration ecosystem: design secure, scalable Snowflake/Databricks clean room architectures, build ELT pipelines (Snowpark/PySpark), implement MLOps and AI/LLM integrations, enforce RBAC and privacy controls, drive observability, cost optimization, onboarding, and operational excellence for partner-facing data products.
Top Skills: Python,Sql,Snowflake,Snowpark,Databricks,Pyspark,Liveramp,Airflow,Dbt,Great Expectations,Langchain,Llamaindex,Vector Databases,Snowflake Cortex,Rag,Llms
3 Days Ago
In-Office or Remote
Minnetonka, MN, USA
92K-164K Annually
Senior level
92K-164K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Design, build, and maintain scalable batch and streaming data pipelines using Python/PySpark on Databricks or Snowflake. Implement data models (OLTP/OLAP), develop advanced SQL, orchestrate workflows with Airflow, enforce governance via Microsoft Purview, build data quality frameworks, support cloud/DevOps practices, monitor production pipelines, and lead technical design and code reviews.
Top Skills: Python,Pyspark,Databricks,Snowflake,Sql,Apache Airflow,Microsoft Purview,Azure,Aws,Gcp
4 Days Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
130K-165K Annually
Senior level
130K-165K Annually
Senior level
Artificial Intelligence • Insurance • Machine Learning • Software • Analytics
Lead design and implementation of scalable, HIPAA-compliant data pipelines and platforms for healthcare ML. Build ETL, orchestration, and tooling for processing EHR, claims, pharmacy, and bioinformatics data; collaborate with data scientists to produce modeling-ready datasets and ensure data quality, reliability, and operational excellence.
Top Skills: Python,Sql,Apache Spark (Pyspark),Databricks,Snowflake,Airflow,Dagster,Prefect,Terraform,Docker,Kubernetes,Aws,Dbt,Ci/Cd

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account