Vichara Technologies Logo

Vichara Technologies

AI Engineering Lead

Reposted 18 Days Ago
In-Office or Remote
Hiring Remotely in Ridgewood, NJ
Senior level
In-Office or Remote
Hiring Remotely in Ridgewood, NJ
Senior level
Lead the architecture and design of AI systems, integrate and fine-tune models, manage multi-agent frameworks, and collaborate with teams for deployment and observability.
The summary above was generated by AI
Company Description

Vichara is a Financial Services focused products and services firm headquartered in NY and building systems for some of the largest i-banks and hedge funds in the world.

Job Description

Key Responsibilities

πŸ”Ή Architecture & System Design

  • Architect, design, and lead multi-agent LLM systems using LangGraph, LangChain, and Promptfoo for prompt lifecycle management and benchmarking.

  • Build Retrieval-Augmented Generation (RAG) pipelines leveraging hybrid vector search (dense + keyword) using LanceDB, Pinecone, or Elasticsearch.

  • Define system workflows for summarization, query routing, retrieval, and response generation, ensuring minimal latency and high precision.

  • Develop RAG evaluation frameworks combining retrieval precision/recall, hallucination detection, and latency metrics β€” aligned with analyst and business use cases.

πŸ”Ή AI Model Integration & Fine-Tuning

  • Integrate GPT-4o, PaLM 2, and open-weight models (LLaMA, Mistral) for task-specific contextual Q&A.

  • Fine-tune transformer models (BERT, SentenceTransformers) for document classification, summarization, and sentiment analysis.

  • Manage prompt routing and variant testing using Promptfoo or equivalent tools.

πŸ”Ή Agentic AI & Orchestration

  • Implement multi-agent architectures with modular flows β€” enabling task-specific agents for summarization, retrieval, classification, and reasoning.

  • Design fallback and recovery behaviors to ensure robustness in production.

  • Employ LangGraph for parallel and stateful agent orchestration, error recovery, and deterministic flow control.

πŸ”Ή Data Engineering & RAG Infrastructure

  • Architect ingestion pipelines for structured and unstructured data β€” including financial statements, filings, and PDF documents.

  • Leverage MongoDB for metadata storage and Redis Streams for async task execution and caching.

  • Implement vector-based search and retrieval layers for high-throughput and low-latency AI systems.

πŸ”Ή Observability & Production Deployment

  • Deploy end-to-end AI systems on AWS EKS / Azure Kubernetes Service, integrated with CI/CD pipelines (Azure DevOps).

  • Build comprehensive monitoring dashboards using OpenTelemetry and Signoz, tracking latency, retrieval precision, and application health.

  • Enforce testing and regression validation using golden datasets and structured assertion checks for all LLM responses.

πŸ”Ή Cross-functional Collaboration

  • Collaborate with DevOps, MLOps, and application development teams to integrate AI APIs with React / FastAPI-based user interfaces.

  • Work with business analysts to translate credit, compliance, and customer-support requirements into actionable AI agent workflows.

  • Mentor a small team of GenAI developers and data engineers in RAG, embeddings, and orchestration techniques.

Qualifications

  • Experience:
    • 5+ years as an AI or ML Engineer
  • Required Skills & Experience

  • LLMs & GenAI: GPT-4o, PaLM 2, LangGraph, LangChain, Promptfoo, SentenceTransformers

  • RAG Frameworks: LanceDB, Pinecone, ElasticSearch, FAISS, MongoDB

  • Agentic AI: LangGraph multi-agent orchestration, routing logic, task decomposition

  • Fine-Tuning: BERT / domain-specific transformer tuning, evaluation framework design

  • Infra & MLOps: FastAPI, Docker, Kubernetes (EKS/AKS), Redis Streams, Azure DevOps CI/CD

  • Monitoring: OpenTelemetry, Signoz, Prometheus

  • Languages & Tools: Python, SQL, REST APIs, Git, Pandas, NumPy

  • 🧠 Nice-to-Have Skills

  • Knowledge of Reranker-based retrieval (MiniLM / CrossEncoder)

  • Familiarity with Prompt evaluation and scoring (BLEU, ROUGE, Faithfulness)

  • Domain exposure to Credit Risk, Banking, and Investment Analytics

  • Experience with RAG benchmark automation and model evaluation dashboards

Additional Information

 

 

 

Top Skills

Aws Eks
Azure Devops
Azure Kubernetes Service
Elasticsearch
Git
Lancedb
Langchain
Langgraph
MongoDB
Numpy
Opentelemetry
Pandas
Pinecone
Promptfoo
Python
Redis Streams
Rest Apis
Signoz
SQL

Similar Jobs

Yesterday
Remote
United States
Senior level
Senior level
Artificial Intelligence • Software • Analytics
Lead the development of generative AI tools for fashion, guiding ML engineers, and implementing innovative AI systems and workflows.
Top Skills: AWSAzureClip-Like ArchitecturesCloud Gpu PlatformsComputer VisionDiffusion ModelsGCPGenerative AiLlmsMachine LearningMultimodal AiMultimodal Embeddings
18 Days Ago
Remote
United States
195K-260K Annually
Senior level
195K-260K Annually
Senior level
Computer Vision • Virtual Reality
Lead a cloud-based Platform Engineering team, overseeing backend services, collaborating with international teams, and managing technical roadmaps for the IKEA Kreativ app.
Top Skills: DjangoDockerGCPInfrastructure As CodeKubernetesPostgresPulumiPythonSQLTerraform
24 Days Ago
In-Office or Remote
14 Locations
Senior level
Senior level
Food
Lead the development and deployment of AI and ML solutions, oversee MLOps strategy, and collaborate with data scientists to enhance data-driven products.
Top Skills: AIAzure Ai FoundryAzuremlDatabricksLlmopsMachine LearningAzureMlops

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent β€” not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account