xAI Logo

xAI

RL Environments Specialist

Reposted 20 Days Ago
Be an Early Applicant
Easy Apply
Remote
Hiring Remotely in USA
100-200 Hourly
Mid level
Easy Apply
Remote
Hiring Remotely in USA
100-200 Hourly
Mid level
Create full reinforcement learning environments, including UI and backend, and manage task creation and validation processes for training AI agents.
The summary above was generated by AI
About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.


About the Role

We need talented engineers that will create full RL environments (UI, backend, programmatically generate tasks and validation) for training computer use agents. This means that we need you to take ownership of the entire task creation process for a given environment.

In this role, you will
  • Build sandbox UIs that our agents and RL actors will interact with.
  • Create tasks for built environments and programmatically validate task completion.
  • Enjoys working remotely
Qualifications
  • Strong professional experience with React.js (hooks, modern state management, TypeScript preferred) — required
  • Strong professional experience building backend services in Python (FastAPI, Flask, or Django) — required
  • Hands-on experience with containerization (Docker required; Docker Compose/Kubernetes a plus)
  • Strong front-end design skills and exceptionally high taste in UI/UX, polish, and visual detail
  • Proven ability to design a relational database schema in Python and populate it with large-scale, realistic mock data
  • Experience creating and exposing clean, well-documented API endpoints (REST or GraphQL)
  • Exceedingly high standards for code quality, readability, testing, and front-end craftsmanship
  • Extensive day-to-day experience using coding agents / AI assistants as a power user (Cursor, Claude, Copilot, Grok, Aider, etc.)
  • Good understanding of the Reinforcement Learning paradigm (RLHF, PPO, DPO, reward modeling, etc.)
Preferred Qualifications
  • Posses strong logical reasoning skills, is detail-oriented, and thrives in a fast-paced work environment.
  • Eager to teach to and learn from teammates.
  • Enthusiasm to collaboratively build the best truth-seeking AI out there!
Interview Process
  1. Technical hands-on live coding round
  2. Hiring Manager / Final interview round
Location & Other Expectations
  • Tutor roles may be offered as full-time, part-time, or contractor positions, depending on role needs and candidate fit. They may be performed remotely from any location worldwide, subject to legal eligibility, time-zone compatibility, and role specific needs.
  • For US based candidates, please note we are unable to hire in the states of Wyoming and Illinois at this time.
  • We are unable to provide visa sponsorship.
  • For those who will be working from a personal device, your computer must be a Chromebook, Mac with MacOS 11.0 or later, or Windows 10 or later.
Compensation

US based candidates: $35/hour - $100/hour depending on factors including relevant experience, skills, education, geographic location, and qualifications. 

International candidates: Information will be provided to you during the recruitment process.

Benefits

Benefits vary based on employment type, location and jurisdiction. Benefits for eligible U.S. based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role specific information will be provided to you during the interview process.

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

Top Skills

Containerization
Python

Similar Jobs

An Hour Ago
Remote or Hybrid
2 Locations
84K-138K Annually
Senior level
84K-138K Annually
Senior level
Automotive • Hardware • Internet of Things • Mobile • Software • App development • PropTech
The Sr. Field Sales Representative is responsible for driving sales growth, managing accounts, training customers, and maintaining strong client relationships across assigned territories.
An Hour Ago
Remote or Hybrid
Florida, USA
Senior level
Senior level
AdTech • Digital Media • Marketing Tech
The role involves providing technical solutions, mentoring sales teams, collaborating on product modifications, conducting product demos, and maintaining product knowledge.
Top Skills: Amazon Web Services (Aws)
An Hour Ago
In-Office or Remote
San Francisco, CA, USA
135K-150K Annually
Senior level
135K-150K Annually
Senior level
Big Data • Information Technology • Software • Analytics • Energy
The Account Director will enhance relationships, sell software solutions, manage accounts, and achieve revenue objectives within the energy sector.
Top Skills: MS OfficeSalesforce

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account