Datadog Logo

Datadog

Staff Applied Scientist - Agentic Interfaces

Posted 5 Hours Ago
Be an Early Applicant
Easy Apply
Hybrid
New York, NY
276K-345K Annually
Expert/Leader
Easy Apply
Hybrid
New York, NY
276K-345K Annually
Expert/Leader
The Staff Applied Scientist will lead the evaluation strategy for AI agent integrations at Datadog, defining metrics and building datasets for quality improvements in agent performance through applied research and technical leadership.
The summary above was generated by AI

Team description

At Datadog, AI agents are becoming first-class consumers of observability, security, and software delivery data — from third-party coding agents like Claude Code, Cursor, and Copilot, to our own Bits SRE, Bits Assistant, and Bits Dev Agent. The Agentic Interfaces team owns the platform that connects these agents to Datadog: the MCP Server, the tools and retrieval surfaces agents call into, and — critically — the evaluation systems that tell us whether an agent's experience on Datadog data is actually getting better over time.

This role is about that last piece. We're hiring a Staff Applied Scientist to define what "good" means for an Agentic interface at Datadog and to build the measurement systems that make it true. "Good" isn't one number — it spans answer quality, tool-selection accuracy, retrieval relevance, latency, token cost, and end-to-end agent success on real customer workflows. You'll design the evals, build the datasets, define the metrics, and partner with the AI engineers on the team to land the platform that lets every product group at Datadog ship integrations that are demonstrably better release over release.

The space is full of open research questions. How do you evaluate an agent end-to-end when the trajectory is non-deterministic? How do you score tool selection when the tool catalog has hundreds of entries and grows weekly? How do you build a measurement system that catches regressions across first-party and third-party agents at once, without each team writing their own harness? If those are the problems you want to spend your time on, come build this with us.

Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. That's okay. If you’re passionate about technology and want to grow your skills, we encourage you to apply.


What You’ll Do:

  • Own the evaluation strategy for Datadog's AI agent integrations. Define the metrics — offline and online, quality and cost, single-turn and trajectory-level — that the team and the broader organization optimize against.

  • Build the eval datasets, golden traces, and regression harnesses that catch quality changes before they hit customers, and make those assets reusable by every team contributing tools to the platform.

  • Drive measurable improvements to retrieval relevance, tool-selection accuracy, and context efficiency, partnering closely with the AI engineers on the team who build the underlying platform.

  • Run applied research on the open problems in agent–data interaction: tool selection under large catalogs, multi-turn agent evaluation, grounding and hallucination control on live telemetry, cost/quality tradeoffs at scale.

  • Partner with the Bits SRE, Bits Assistant, and Bits Dev Agent teams so first-party agents benefit from the same measurement substrate as third-party integrations, and so learnings move freely in both directions.

  • Provide technical leadership across the Agentic Interfaces team and the broader organization through design reviews, working groups, and mentorship, and represent the team externally through talks, blog posts, and contributions to the open agent ecosystem.

Who You Are:

  • You have a BS/MS/PhD in a scientific field, or equivalent experience.

  • 10+ years of relevant engineering or applied science experience, including time as a technical lead.

  • Proven track record of leading ML or GenAI initiatives in a product-driven environment, from research through production.

  • Significant experience with evaluation, experimentation, or measurement of ML systems at scale.

  • You bring a strong product mindset and are comfortable driving initiatives across cross-functional teams.

  • You thrive in ambiguity and can make sound technical calls when the path isn’t yet defined.

Benefits and Growth:

  • New hire stock equity (RSUs) and employee stock purchase plan (ESPP)

  • Continuous professional development, product training, and career pathing

  • An inclusive company culture, giving programs, and the ability to join our Community Guilds (Datadog employee resource groups)

  • Competitive global benefits and global Spring Health benefits for employees and dependents age 6+

#LI-Onsite

Datadog offers a competitive salary and equity package, and may include variable compensation. Actual compensation is based on factors such as the candidate's skills, qualifications, and experience. In addition, Datadog offers a wide range of best in class, comprehensive and inclusive employee benefits for this role including healthcare, dental, parental planning, and mental health benefits, a 401(k) plan and match, paid time off, fitness reimbursements, and a discounted employee stock purchase plan.

The reasonably estimated yearly salary for this role at Datadog is:
$276,000$345,000 USD

About Datadog: 

Datadog is the leading observability and security platform for the AI era, providing businesses with unified visibility across the technology stack to manage complexity at scale. It brings applications, infrastructure, data, models, and security into one place, using AI to detect and resolve issues before they impact customers. Trusted globally by Fortune 500 companies and high-growth AI leaders, Datadog enables businesses to move faster with clarity and confidence. Learn more about #DatadogLife on Instagram, LinkedIn, and Datadog Learning Center.

Equal Opportunity at Datadog:

Datadog is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and other characteristics protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. Here are our Candidate Legal Notices for your reference. 

Datadog endeavors to make our Careers Page accessible to all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please complete this form. This form is for accommodation requests only and cannot be used to inquire about the status of applications. 

Privacy and AI Guidelines:

Any information you submit to Datadog as part of your application will be processed in accordance with Datadog’s Applicant and Candidate Privacy Notice. For information on our AI policy, please visit Interviewing at Datadog AI Guidelines.

Similar Jobs at Datadog

An Hour Ago
Easy Apply
Hybrid
Easy Apply
211K-309K Annually
Senior level
211K-309K Annually
Senior level
Artificial Intelligence • Cloud • Security • Software • Cybersecurity
The Senior Manager, Product Solutions Architecture at Datadog oversees onboarding and enabling partners, collaborates on technical initiatives, and advocates for partners' needs while enhancing service offerings using Datadog features.
Top Skills: AlibabaAnsibleAppdAWSAzureChefCloud FoundryDatadogDockerDynatraceGCPGitlabGoJavaScriptJenkinsKubernetesNew RelicOpenshiftPerlPHPPuppetPythonRubySplunkTerraform
An Hour Ago
Easy Apply
Hybrid
Easy Apply
300K-484K Annually
Expert/Leader
300K-484K Annually
Expert/Leader
Artificial Intelligence • Cloud • Security • Software • Cybersecurity
The Distinguished Architect will provide technical leadership in AI, guide infrastructure discussions, and collaborate with stakeholders on AI observability solutions, requiring extensive experience and strategic thinking.
Top Skills: Ai/LlmDistributed Systems ArchitectureGpusHigh-Performance ComputingOrchestration FrameworksTpus
19 Hours Ago
Easy Apply
Hybrid
Easy Apply
135K-150K Annually
Senior level
135K-150K Annually
Senior level
Artificial Intelligence • Cloud • Security • Software • Cybersecurity
The Strategic Account Executive is responsible for targeting and closing new business with Datadog's largest customers, maintaining relationships, and understanding their business needs while negotiating favorable terms.

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account