Skip to main content
Back to jobs

Senior Data Scientist NLP/GenAI - Catalog

External
mirakllabs logoMirakllabs ยท Paris, France
Full-timeOn-site1mo ago
AirflowAWSComputer VisionCore MLData AnalysisGenerative AI
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Build and deploy ML algorithms to production that power 500+ e-commerce and marketplace sites across 40 countries, directly impacting revenue growth, operational efficiency, and transaction safety
  • Tackle real-world catalog challenges including automatic content rewriting, product attribute extraction from images and text, variant detection, product categorization, seller onboarding automation, and trending product prediction
  • Work with cutting-edge AI techniques including multimodal models and LLM fine-tuning-Mirakl is one of the few French players with fine-tuned LLMs in large-scale production
  • Own your projects end-to-end : from data analysis and prototyping to production deployment with Data Engineers and dev teams, plus building dashboards to monitor algorithm performance
  • Collaborate across teams to refine use cases, user experience, and integration paths while presenting results at weekly data science meetings
  • What You'll Bring to the Role

Requirements

  • 4+ years of experience as a Data Scientist with strong hands-on NLP and applied ML in industry
  • Proven track record of deploying Machine Learning algorithms to production
  • Experience with Spark development for large-scale data processing
  • Expertise in NLP and Computer Vision algorithms and state-of-the-art architectures (e.g., Transformers)
  • Proficiency in Python and TensorFlow and/or PyTorch
  • Knowledge of the latest LLMs and fine-tuning techniques
  • Data-driven, pragmatic, and business-oriented approach
  • Strong ownership and autonomy with excellent team collaboration
  • Tech Stack
  • Core ML/AI:
  • Python, TensorFlow, PyTorch, Hugging Face
  • LLM-specific:
  • Autotrain, Unsloth, Galileo, LangChain, Anyscale
  • Data infrastructure:
  • Databricks, Spark, AWS (Amazon Redshift, S3), SQL, Airflow, Delta Lake
  • Our Hiring Process
  • We warmly encourage you to apply to any of our roles, even if you think you're not an exact match.
  • Hiring steps:
  • A 30/45-minute phone call with one of our Tech recruiters to discuss your background, expectations, and the role
  • A 30-minute technical call with someone from the Data Science team to dive into concrete aspects of your expertise and how it fits our projects
  • A take-home assignment to demonstrate your technical skills
  • A 75-minute technical debrief and discussion with the Data Science Team managers
  • 2x45mn STAR interviews with future Mirakl colleagues to discuss our values and culture
  • We welcome collaborators with their diverse perspectives and experiences to power us forward. These often far exceed conventional job requirements and help us create a culture of continuous learning. If you're ready to join a global leader powering digital transformation for 450+ of

Benefits

Vision insuranceRemote work options

Additional Information

About Mirakl: Founded in 2012, Mirakl has been at the forefront of marketplace innovation, empowering every business to compete in the platform economy. Today, Mirakl's operating system combines an enterprise marketplace solution (Mirakl Platform) that enables retailers and B2B organizations to launch, scale, and operate marketplaces and dropship, AI-powered multichannel selling (Mirakl Connect), retail media (Mirakl Ads) and an agentic commerce infrastructure (Mirakl Nexus). With dual headquarters in Boston and Paris, Mirakl helps a global ecosystem of 450+ marketplaces (B2C and B2B) and a network of over 100k third-party marketplace sellers. Brands like Macy's, Decathlon, Carrefour, Asos, and Airbus Helicopters use Mirakl to grow their businesses in new and remarkable ways. For more information: www.mirakl.com . Mirakl in Numbers: ๐Ÿ—“๏ธ Founded in 2012 | Member of French Tech Next40 ๐Ÿ‘ฅ 750+ employees in 9 offices worldwide: Paris, Barcelona, Bordeaux, Boston, London, Munich, New York, Sydney, Tokyo ๐Ÿ‡ซ๐Ÿ‡ท 350+ Mirakl Tech teams members mainly based in France โš™๏ธ 5 Saas Solutions Our Values: Working at Mirakl means accelerating your career alongside ambitious, passionate, and supportive colleagues. We're proud of the diversity of backgrounds, perspectives, and experiences that make our teams unique. Our 5 values guide how we collaborate: ๐Ÿ’ก Work Hard Together: Teamwork and collaboration are the foundation of our success ๐Ÿ† Get Things Done: We prioritize action and efficiency for impactful results ๐Ÿš€ Go Above & Beyond: We tackle challenges proactively and always aim for excellence ๐ŸŽ“ Succeed Through Expertise: Knowledge sharing and continuous learning are core to our culture ๐Ÿค Satisfy & Empower Clients: We're committed to our clients' success The Team You'll Join You'll be part of our Catalog Data Science team led by Arthur Delaitre and Adrien Morvan . As part of our broader Data team (60+ people), you'll be prototyping, iterating, and shipping algorithms to production that directly impact marketplace catalog challenges-from NLP to large-scale Generative AI with custom LLMs. Our opportunity can be located in Paris or Bordeaux and requires 4 days onsite per week, 1 day remote. Meet Arthur Delaitre , Data Science Manager for the team:


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at mirakllabs? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect