The role involves developing RAG workflows using LLMs, managing vector databases, and optimizing AWS-based microservices for high-volume applications.
Job description:
We are looking for a Senior Software Engineer specializing in Retrieval-Augmented Generation (RAG) systems, with experience in large language models (LLMs), vector databases, and cloud-based microservices. Your role will focus on building, integrating, and optimizing LLM workflows using LangChain and managing complex infrastructure with AWS services like Lambda and ECS. You'll bring expertise in containerized environments, using Docker, and work with vector databases to power data-driven applications. You will report to a Staff Software Engineer and work remote in the United States or hybrid based on proximity to our office.
You'll Have Opportunity To
Required Skills:
We are looking for a Senior Software Engineer specializing in Retrieval-Augmented Generation (RAG) systems, with experience in large language models (LLMs), vector databases, and cloud-based microservices. Your role will focus on building, integrating, and optimizing LLM workflows using LangChain and managing complex infrastructure with AWS services like Lambda and ECS. You'll bring expertise in containerized environments, using Docker, and work with vector databases to power data-driven applications. You will report to a Staff Software Engineer and work remote in the United States or hybrid based on proximity to our office.
You'll Have Opportunity To
- RAG Workflow Development: Design and deploy LLM-driven RAG workflows using LangChain and vector databases to provide high-accuracy data retrieval and enhanced content generation.
- Vector Database Management: Integrate and manage vector databases like Qdrant for optimized, high-speed vector searches and data retrieval.
- Cloud Computing: Use AWS services, including Lambda and ECS, to build serverless architectures and scalable containerized applications.
- API & Backend Development: Build APIs with FastAPI and Uvicorn to support low-latency interactions and handle high traffic volumes.
- Monitoring & Observability: Implement observability best practices using Datadog, ddtrace, and logging tools to maintain performance and troubleshoot complex workflows.
Required Skills:
- Proficiency in LLM and RAG Workflows: experience with LangChain and vector databases, applying RAG techniques for intelligent data retrieval and generation.
- Proficient in AWS environment
- Understanding of MCP Servers
Top Skills
AWS
Datadog
Docker
Ecs
Fastapi
Lambda
Langchain
Python
Uvicorn
Vector Databases
Similar Jobs
Artificial Intelligence • Cloud • Mobile • Security • Software
Manage post-sale customer relationships, ensuring adoption and expansion of Hiya’s SaaS solutions while driving strategic initiatives and revenue growth.
Top Skills:
Data-Driven InsightsSaas Solutions
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
The Manager, Sales Enablement will develop frameworks and training to maximize revenue growth, manage multiple projects, and collaborate across teams to enhance client success.
Cloud • Fintech • Software • Business Intelligence • Consulting • Financial Services
The Senior Consultant will lead project delivery, manage healthcare strategies for FQHCs, perform analyses, and engage with clients within guidelines.
Top Skills:
ExcelOutlookPowerPointWord
What you need to know about the Los Angeles Tech Scene
Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering