BJAK Logo

BJAK

Data Engineer

Posted 8 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
As a Data Engineer, you will prepare datasets for machine learning, manage data labeling workflows, and ensure high-quality data for AI applications.
The summary above was generated by AI

Transform Language Models into Real-World Applications

We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.
This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.

Why This Role Matters

You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.

What You’ll Do
  • Collect, clean, and preprocess user-generated text and image data for fine-tuning large models

  • Design and manage scalable data labeling pipelines, leveraging both crowdsourcing and in-house labeling teams

  • Build and maintain automated datasets for content moderation (e.g., safe vs unsafe content)

  • Collaborate with researchers and engineers to ensure datasets are high-quality, diverse, and aligned with model training needs

What Is It Like
  • Likes ownership and independence

  • Believe clarity comes from action - prototype, test, and iterate without waiting for perfect plans.

  • Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.

  • Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.

  • See feedback and failure as part of growth - you’re here to level up.

  • Possess humility, hunger, and hustle, and lift others up as you go.

Requirements
  • Proven experience preparing datasets for machine learning or fine-tuning large models

  • Strong skills in data cleaning, preprocessing, and transformation for both text and image data

  • Hands-on experience with data labeling workflows and quality assurance for labeled data

  • Familiarity with building and maintaining moderation datasets (safety, compliance, and filtering)

  • Proficiency in scripting (Python, SQL) and working with large-scale data pipelines

What You’ll Get
  • Flat structure & real ownership

  • Full involvement in direction and consensus decision making

  • Flexibility in work arrangement

  • High-impact role with visibility across product, data, and engineering

  • Top-of-market compensation and performance-based bonuses

  • Global exposure to product development

  • Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals

  • Health, dental & vision insurance

  • Global travel insurance (for you & your dependents)

  • Unlimited, flexible time off

Our Team & Culture

We’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.

About Bjak

BJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through Bjak.com. We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.
If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.

Top Skills

Python
SQL

Similar Jobs

3 Days Ago
Remote or Hybrid
Orlando, FL, USA
5-5
Senior level
5-5
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
As a Data Engineer II, you will design, build, and maintain data pipelines, collaborate with product managers, and troubleshoot production issues.
Top Skills: Apache AirflowAWSDynamoDBGlueJavaPysparkPythonRedshiftS3ScalaSQLSQL
7 Days Ago
Easy Apply
Remote or Hybrid
3 Locations
Easy Apply
Senior level
Senior level
Real Estate • Security • Software • Cybersecurity • PropTech
The Data Engineer will build and maintain data pipelines, contribute to architecture decisions, support data governance, and mentor peers, focusing on scalability and reliability.
Top Skills: AirflowBigQueryDbtKafkaPythonRedshiftScalaSnowflakeSpark StreamingSQL
7 Days Ago
Remote
United States
180K-220K Annually
Senior level
180K-220K Annually
Senior level
Big Data • Cloud • Software • Generative AI • Big Data Analytics
As a Data Engineer at Monte Carlo, you will build and maintain data pipelines, collaborate with data science teams, and enhance pipeline reliability using the company's own product.
Top Skills: AirflowAWSBigQueryCloudFormationPysparkPythonRedshiftSnowflakeSparkSQLTerraform

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account