Summa Linguae Technologies Logo

Summa Linguae Technologies

Technical Project Manager - AI Data Processing

Posted Yesterday
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
The Technical Project Manager will coordinate data deliveries, automate validation processes, and work closely with engineering and operations teams, ensuring quality compliance of datasets.
The summary above was generated by AI

Technical Project Manager - AI Data Processing 

Work model: Remote (covering PST hours) - Based in PST timezone
Employment type: Full-time (Contract or Employment)

About DATAmundi 

DATAmundi builds advanced software solutions that power our localization and data services. We support AI companies and research teams by delivering high-quality datasets, validation workflows, and scalable data processing. Our R&D initiatives explore how modern AI systems — including LLMs, speech models, and multimodal systems — can be evaluated, improved, and safely deployed through structured data and validation methodologies. 

We are expanding our R&D activities and seeking researchers to collaborate on applied research and technical outreach within the AI ecosystem. 

About the role 

We’re hiring a Technical Project Manager with a hands-on, data engineering skillset to support Data Processing for AI. You’ll translate client requirements into executable validation logic for data processing workflowssupport data post processing, and help ensure delivery data is consistent, measurable, and within client metrics and guidelines. 

This role sits at the intersection of technical delivery, data QA, and automationYou’ll work closely with development teams and support the Operations Team while also writing/maintaining lightweight scripts and queries that turn requirements into quality assurance checks. 

 

What you’ll do 

  • Own the end-to-end coordination of data deliveries, from intake to validation and handoff. 
  • Work with client data delivered via S3 buckets or direct uploads; ensure correct structure, completeness, and readiness for downstream use. 
  • Translate client guidelineinto automated validation using SQL, regex, and supporting scripts. 
  • Create and  to compute quality and consistency metrics such as: 
  • WER (Word Error Rate) maintain Python utilities
  • IAA (Inter-Annotator Agreement) 
  • Additional dataset-level metrics as required 
  • Use Windows Command Prompt for bulk file operations (creating/moving/downloading folders and files) to support processing and delivery workflows. 
  • Partner with internal development teams by writing Jira tickets for platform improvements and bug fixes (requirements, steps to reproduce, acceptance criteria). 
  • Quickly ramp on internal platforms and configuration logic (e.g., worktypes / templates), advising on setup patterns and tradeoffs. 
  • Investigate issues by querying datasets through database tools and producing clear summaries of findings and next steps. 

 

What we’re looking for 

  • years experience in technical delivery / project coordination in a data environment (data, analytics, ML, QA automation, or platform operations. 
  • Practical comfort with: 
  • Python (scripting for metrics and data validation workflows) 
  • SQL (queries used in automated checks) 
  • Regex (pattern-based validation) 
  • Command line / Windows CMD (bulk file operations) 
  • Strong written communication and the ability to convert fuzzy requirements into precise, testable checks. 
  • Experience working with engineering teams and using tools like Jira to drive execution. 
  • Ability to evaluate options and recommend an approach based on pros/cons, timelines, and maintainability. 

 

Nice to have 

  • Experience with speech/audio or text datasets (given WER and annotation agreement use cases). 
  • Familiarity with cloud data workflows (especially AWS S3 concepts like buckets, prefixes, access patterns). 
  • Experience with data labeling/annotation workflows and quality frameworks. 

Top Skills

Aws S3
JIRA
Python
Regex
SQL
Windows Command Prompt

Similar Jobs

4 Minutes Ago
Easy Apply
Remote
United States of America
Easy Apply
155K-220K Annually
Expert/Leader
155K-220K Annually
Expert/Leader
Cloud • Information Technology
The Principal Software Engineer will tackle platform-level challenges, assist team members, write secure and scalable code, and develop features for cloud storage solutions.
Top Skills: AIAnsibleBashCephContainerdDhcpDnsGoGrafanaHelmHelmfileKubernetesLdapLokiPostgresPrometheusPxeTerraform
4 Minutes Ago
Easy Apply
Remote
United States of America
Easy Apply
Senior level
Senior level
Cloud • Information Technology
The Senior Industry Marketing Manager will drive marketing strategy for AI/ML, develop initiatives, create content, and collaborate with sales teams.
Top Skills: AICloud StorageMarketing
5 Minutes Ago
Remote
USA
120K-210K Annually
Senior level
120K-210K Annually
Senior level
Healthtech • Pet
As a Senior Product Manager, own product strategy and roadmap, lead cross-functional initiatives, analyze performance metrics, and mentor junior team members while improving user experience and operational efficiency.
Top Skills: Data AnalyticsProduct ManagementSaaS

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account