Snapdocs, Inc. Logo

Snapdocs, Inc.

Data Scientist

Posted 6 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
As a Data Scientist at Snapdocs, you will enhance Document Quality Control services via NLP and Generative AI while collaborating with operations and engineering teams.
The summary above was generated by AI

Snapdocs is a rapidly growing company that is disrupting the residential mortgage market, bringing scalable and sophisticated software to a pillar of the US economy that still relies on fax machines and manila envelopes. Today, 20% of real estate transactions are processed through our platform. Our products rely on carefully designed workflows, AI-based automations, and empathetic user experiences to deliver best-in-class customer experiences. We are backed by investors like Sequoia, Y Combinator, and F-Prime. 

We are an innovative team. As we expand our product offering to serve more customers in more ways, we need to grow our team with smart, hungry, and curious people. That’s where you come in…

About the Role
Snapdocs is looking for a Data Scientist to help us improve and scale our Document Quality Control (QC) services. You’ll partner with our lead data scientists to develop and optimize solutions for document classification, information extraction, and annotation detection—across hundreds of document types.

This is a high-impact role where you’ll apply the latest in NLP and Generative AI to real-world use cases, helping us build reliable, intelligent systems that make sense of messy, unstructured document data.

What You’ll Do

  • Improve the performance and generalizability of Document QC services, including classification, extraction, and annotation detection
  • Apply and test cutting-edge generative AI methods to improve outcomes across varied document types
  • Optimize model performance for new customer document sets
  • Partner with DS Operations to ensure high-quality training and evaluation data
  • Contribute to service repositories and help productionize models in collaboration with Engineering

Initial Priorities
Your first project will be either:

  • Testing, implementing, and optimizing a general classification methodology on a new customer dataset
    or
  • Deploying and refining an information extraction pipeline for a new customer use case

What We’re Looking For:

  • 3+ years in data science, preferably working on ML-driven products
  • Experience with a wide range of machine learning approaches, including LLMs and traditional supervised/unsupervised learning
  • Background in document classification or information extraction (text and images)
  • Experience collaborating with Product and Engineering to ship models into production

Technical Skills:

  • Expert in Python and ML libraries (e.g., Scikit-learn, HuggingFace)
  • Comfortable with SQL and working in production data pipelines
  • Hands-on experience with cloud platforms (AWS, GCP, or Azure)
  • Understanding of service architecture and working with APIs
  • Familiar with CI/CD practices and software engineering fundamentals

Nice to Have:

  • Experience with semantic similarity techniques (e.g., BERT, RAG) for classification
  • Background in prompt engineering and LLM observability
  • Experience monitoring model drift and performance post-deployment

Who You Are

  • A collaborative, thoughtful problem solver
  • Curious and passionate about ML, NLP, and Generative AI
  • Comfortable working on ambiguous problems with imperfect data
  • Driven to turn research into scalable, production-ready systems

Apply Now
If you're excited to build next-gen AI systems that operate at real-world scale, we’d love to hear from you.


At Snapdocs, we believe our differences make us stronger. We’re building a team of curious, driven people from all backgrounds who are united by a shared desire to solve meaningful problems and build something that matters. We value trust, autonomy, and the kind of collaboration that brings out the best ideas—and the best in each other.

To support our team, we offer a comprehensive & thoughtful benefits package for all full-time employees, which includes:

  • Excellent medical, dental, and vision coverage
  • 401(k) with up to 4% company match
  • 16 weeks of paid parental leave
  • Flexible Paid Vacation Time Off + 10 Sick Days for exempt roles
  • Generous Accrued Paid Vacation Time Off + 10 sick days for non-exempt roles
  • Summer & Winter Break (~1-week each) + 9 Holidays per year
  • Healthcare and Dependent Care FSA
  • HSA Employer Contribution ($75-150 for individuals, $150-$250 for families)
  • $15K Family Building Benefit (lifetime limit)
  • Life and Disability Insurance
  • $1,500 Annual Lifestyle Stipend to support your well-being

Please note: Part-time employees are not eligible for benefits at this time

Snapdocs is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. If you have a disability or special need that requires accommodation, please let us know.

California residents applying for positions at Snapdocs are subject to our candidate privacy policy. (www.snapdocs.com/california-candidate-privacy)


Top Skills

AWS
Azure
GCP
Huggingface
Python
Scikit-Learn
SQL

Similar Jobs

6 Days Ago
Easy Apply
Remote or Hybrid
US
Easy Apply
168K-277K
Senior level
168K-277K
Senior level
Marketing Tech • Social Media • Software • Analytics • Business Intelligence
Lead end-to-end model development, drive business-focused data science solutions, mentor junior data scientists, and shape analytics strategy. Collaborate cross-functionally to solve complex business challenges.
Top Skills: AWSAzureGCPPythonSQLTableau
6 Days Ago
Remote or Hybrid
7 Locations
110K-180K Annually
Mid level
110K-180K Annually
Mid level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Data Scientist III, you will apply machine learning and data science skills to enhance cybersecurity technologies, identify threats, and improve ML models.
Top Skills: AWSMachine LearningPython
6 Days Ago
Remote
United States
145K-196K Annually
Mid level
145K-196K Annually
Mid level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
The Data Scientist will analyze user behavior, develop segmentation strategies, and create dashboards to optimize product engagement and revenue.
Top Skills: HadoopPythonRSQL

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account