Maintain and monitor web scraping configurations, ingest and transform scraped data into the data lake, detect and resolve pipeline issues, produce stakeholder reports, and build scalable ETL pipelines and data architectures to support analytics.
We are growing! We are currently looking to hire a Data Engineer to work with us remotely.
Who we are:
Founded in 2006, we’re proud to be a global business. From Shanghai to Paris, we have 12 offices and operate across four continents in 70 countries. We are home to over 250 professionals from around the world, working together to serve more than 230 luxury clients.
At CXG, we love to evolve, elevate, and transform experiences while bringing brand promises to life. We offer strategic solutions that impact performance and elevate the customer experience of some of the world’s most iconic premium and luxury brands.
Your duties will also involve:
- Maintain and manage website scraping configurations using Python
- Monitor scraping configurations for errors and potential crashes.
- Oversee retrieved data to detect potential issues and blockages.
- Coordinate with stakeholders to understand scraping task requirements and report issues.
- Prepare and share periodic reports on scraping activities with stakeholders.
- Develop necessary pipelines to ingest data into the Datalake and perform required transformations.
Requirements
What you will bring along:
- Minimum 2 years of experience in a similar role.
- Proven experience in data engineering with expertise in designing and implementing scalable data architectures.
- Strong experience with ETL processes, data modeling, and data warehousing (Airflow & DBT preferred).
- Expertise in database technologies, both relational (SQL) and NoSQL.
- Knowledge of cloud platforms, particularly Azure.
- Solid understanding of data security measures and compliance standards.
- Excellent Python experience for data engineering and automation.
- Strong collaboration skills to work closely with data scientists and analysts.
- Ability to optimize data pipelines for performance and efficiency.
- Ability to build, test, and maintain tasks and projects.
- Experience with version control systems, such as Git.
- Hands-on experience with Airflow and/or DBT.
- Experience with Terraform for infrastructure management.
- Strong academic background in a relevant field.
- Fluent in English (French is a plus).
Top Skills
Airflow
Azure
Data Lake
Data Modeling
Data Warehousing
Dbt
ETL
Git
NoSQL
Python
SQL
Terraform
Web Scraping
Similar Jobs
Mobile • Software
The Senior Data Engineer designs and maintains data infrastructure, collaborates with teams, and builds data models and APIs to support data-driven decisions.
Top Skills:
AWSAzureAzure Data FactoryCi/CdCloud FormationData Modeling MethodologiesDatabricksEltETLGitlabPythonSQLTerraform
Insurance
Design and develop scalable data pipelines using PySpark and Databricks, focusing on data ingestion, transformation, validation, and performance optimization.
Top Skills:
DatabricksPysparkPythonSparkSQL
Insurance
The Lead Data Engineer will design and build scalable data pipelines on Databricks, develop ETL/ELT pipelines with PySpark, and mentor engineering teams for data transformation and analytics.
Top Skills:
DatabricksPysparkPythonSQL
What you need to know about the Los Angeles Tech Scene
Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering


