Do you want to work on cutting-edge projects with the world’s best IT engineers? Do you wish you could control which projects to work on and choose your own pay rate? Are you interested in the future of work and how the cloud will form teams? If so - the Gigster Talent Network is for you.
Our clients rely on our Network for two main areas, Software Development and Cloud Services. In some cases, they need help building great new products, in others they want our expertise in migrating, maintaining, and optimizing their cloud solutions.
At Gigster, whether working with entrepreneurs to realize ‘the next great vision’ or with Fortune 500 companies to deliver a big product launch, we build really cool enterprise software on cutting-edge technology.
The Role:
We are seeking an experienced Data Engineer with deep expertise in data transformation at scale, particularly in integrating and processing data from third-party public APIs. This role is critical to enhancing and maintaining data pipelines that feed into Natural Language Processing (NLP) models.
What you’ll do:
Design, build, and optimize scalable ETL/ELT data pipelines using Apache Spark, Apache Kafka, and orchestration tools such as Prefect or Airflow
Integrate external data sources and public APIs with internal data systems
Work with large-scale datasets to support NLP model training and inference
Analyze existing pipelines and recommend enhancements for performance, reliability, and scalability
Collaborate with cross-functional teams, including data scientists and ML engineers
Own the end-to-end engineering process—from planning and technical design to implementation
Regularly report progress and outcomes to client stakeholders
Proficiency in Python and experience with data transformation and data engineering best practices
Strong experience with Apache Spark, Apache Kafka, and Google Cloud Platform (GCP)
Hands-on experience with workflow orchestration tools (e.g., Prefect, Airflow)
Demonstrated experience working with large datasets and real-time data processing
Experience building and maintaining ETL/ELT pipelines for analytical or machine learning use cases
Self-motivated, with excellent communication and project ownership skills
Preferred Qualifications:
Familiarity with financial services data or regulated data environments
Experience with Snowflake or Google BigQuery
- Experience with PostgreSQL and GCS (Google Cloud Storage)
Exposure to NLP workflows and data requirements for machine learning models
Logistics:
- This is a part-time, short term, 4 to 6 weeks contract
- Preferred location: Remote US
Top Skills
Similar Jobs
What you need to know about the Los Angeles Tech Scene
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering