Design and build scalable data lakes, warehouses, and lakehouses; implement Python ETL/ELT pipelines and Airflow orchestration; ingest data from third-party APIs; optimize columnar storage and SQL performance; support ML/AI initiatives; consult with stakeholders to translate business goals into data architecture and infrastructure using cloud and IaC tools.
AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI/ML, and our people-first culture has earned us multiple Best Place to Work awards.
WHY JOIN US
If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you!
ABOUT THE ROLE
We are looking for a Senior Data Engineer to design and build scalable data lakes, warehouses, and lakehouse architectures supporting a thematic research platform that processes large volumes of financial data daily. You will implement Python-based ETL/ELT pipelines, orchestrate workflows with Airflow, develop ingestion workflows from third-party APIs, and work with Snowflake, Spark, and AWS to deliver high-performance data infrastructure. The role combines hands-on engineering with technical consulting responsibilities, translating business goals into data architecture roadmaps.
WHAT YOU WILL DO
- Design and implement Python Data Engineering solutions;
- Design and build scalable Data Lakes, Data Warehouses, and Data Lakehouses;
- Design and implement robust ETL/ELT processes at scale using Python, incorporating modern pipeline orchestration tools like Airflow;
- Develop sophisticated ingestion workflows from diverse 3rd party APIs and data sources;
- Manage and optimize various file formats (Parquet, Avro, ORC) and columnar storage to ensure high-performance data retrieval;
- Work with AI development tools to support and accelerate ongoing development, machine learning initiatives and advanced analytics;
- Act as a technical consultant for stakeholders and leadership to gather requirements, understand business goals, and translate them into technical roadmaps;
- Work with Terraform and other tools to build AWS and on-prem infrastructure.
MUST HAVES
- You must be authorized to work for ANY employer in the US (e.g., Green card holders, TN visa holders, GC EAD, H4 EAD, U4U with EAD), as we are unable to sponsor or take over employment visa sponsorship at this time;
- Bachelor’s degree in computer science/engineering or other technical field, or equivalent experience;
- 5+ years of experience with Python;
- 5+ years of experience with data processing, manipulation, and analytics libraries like Pandas, Polars, PySpark or DuckDB;
- 2+ years of experience with Big Data technologies (Spark, Snowflake);
- Expert-level knowledge of pipeline orchestration using Airflow or similar industry-standard tools;
- Deep understanding of Medallion Architecture, columnar file formats, and diverse database technologies (SQL, NoSQL, and Lakehouse architectures);
- Proven ability to work with 3rd party APIs for complex data ingestion tasks;
- Proficiency with modern Cloud platforms (AWS, GCP, Snowflake) and advanced SQL optimization;
- Exceptional soft skills with a proven ability to gather requirements from leadership and collaborate effectively across cross-functional teams;
- Excellence in optimizing complex data pipelines and troubleshooting data latency or consistency issues in massive datasets;
- A self-starter mindset, regularly investigating more efficient data architectures and AI development tools to improve pipeline performance;
- Taking pride in data integrity and the accuracy of the end-to-end pipelines and architectures you build;
- Strong communication skills for seamless global collaboration with stakeholders and distributed teams;
- Upper-intermediate English level.
NICE TO HAVES
- Familiarity with the fintech industry, understanding of financial data, regulatory requirements, and business processes specific to the domain;
- Documentation skills to document data pipelines, architecture designs, and best practices for knowledge sharing and future reference;
- OpenSearch, Elasticsearch;
- AWS Sagemaker Studio, Jupyter for analyze data;
- Terraform;
- Scala.
PERKS AND BENEFITS
- Professional growth: Mentorship, TechTalks, and personalized growth roadmaps.
- Competitive compensation: USD-based pay with education, fitness, and team activity budgets.
- Exciting projects: Modern solutions with Fortune 500 and top product companies.
- Flextime: Flexible schedule with remote and office options.
Similar Jobs
Software
Design and build scalable data lakes, warehouses, and lakehouses; implement Python ETL/ELT pipelines and Airflow orchestration; ingest from third-party APIs; optimize columnar file formats and SQL performance; support ML initiatives; consult with stakeholders to translate business goals into data architecture roadmaps; work with Spark, Snowflake, and cloud infrastructure (AWS/GCP).
Top Skills:
AirflowAvroAWSDuckdbGCPNoSQLOrcPandasParquetPolarsPysparkPythonSnowflakeSparkSQL
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Designs and delivers onboarding for global enterprise sellers via the Sales Academy. Facilitates training, coaches sellers, reinforces sales methodology and AI-native tools, mentors trainers, leverages data/AI to personalize learning, manages learner progression and readiness reporting, and partners with GTM teams to align launch and readiness across time zones.
Top Skills:
Ai ToolsDynamicsIntrepidLms PlatformsMicrosoftMicrosoft TeamsSalesforceSeismicZoom
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design, build, deploy, and operate high-scale, multi-tenant Kubernetes services. Write clean, testable code, collaborate with product owners, integrate AI tools, mentor engineers, and improve performance, observability, and data processing pipelines.
Top Skills:
Build ToolsClaudecodeDebuggerDevinFlinkGoIdeJavaKafkaKubernetesMicroservicesMulti-Tenant ArchitectureObservability StackProfilersServicenowSource Control (Git)Unix
What you need to know about the Los Angeles Tech Scene
Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

