Skip to main content
Back to jobs

AI & LLM Data Analyst (Customer-Facing)

External
similarweb logoSimilarweb · Tel Aviv-yafo, Israel
Full-timeOn-site1mo ago
AWSBigQueryData AnalysisHugging FaceLangChainLean
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

We're looking for a Data Analyst to join the Data for AI team. This is a hands-on, customer-facing role focused on working with leading AI companies to turn real-world data into inputs that support model development and evaluation. You'll collaborate closely with external AI teams and internal engineering and product partners to deliver data-driven solutions for specific AI use cases. The work is fast-paced, technical, and often open-ended, requiring comfort with large datasets, ambiguous requirements, and end-to-end ownership. What does the day-to-day looks like: Own end-to-end delivery of data solutions for AI use cases, from understanding model and product requirements to analysis, implementation, quality, and automation Work hands-on with large, raw datasets to create high-quality data inputs that support model training, evaluation, and iteration Apply strong quantitative analysis and data exploration skills to assess coverage, quality, and behavior of data used in AI systems Build scripts, analyses, and reusable components in Python and SQL to support scalable and repeatable workflows Collaborate closely with Engineering to ensure solutions are reliable, scalable, and production-ready Partner directly with external AI teams and internal stakeholders to translate open-ended questions into concrete data outputs This role is a good fit if you have: 4+ years of hands-on experience working with large-scale data using SQL and Spark or BigQuery Strong Python skills for data analysis, scripting, and building reusable workflows Experience working with raw, imperfect data and turning it into reliable, high-quality outputs Strong analytical and problem-solving skills, with the ability to break down open-ended or ambiguous requirements Ability to take end-to-end ownership of data projects, from exploration to delivery Some hands-on experience with LLM-based systems, such as running inference via APIs, experimenting with prompts, or participating in basic evaluation or testing workflows Clear communication skills in English and experience working directly with external stakeholders

Requirements

  • Deeper hands-on experience with LLMs in production or experimentation, for example prompt engineering, batch inference, or structured evaluation using APIs such as OpenAI, Anthropic, or similar providers
  • Familiarity with agent frameworks or orchestration layers (for example LangChain, LlamaIndex)
  • Experience with LLM evaluation or monitoring workflows, including offline evals, prompt regression testing, or tools such as LangSmith, Weights & Biases, TruLens, or Ragas
  • Experience experimenting with open-source or local models (for example via Ollama, vLLM, or Hugging Face tooling)
  • Familiarity with cloud-based data infrastructure, including AWS
  • Why you'll love being a Similarwebber:
  • You'll find a home for your big ideas : We encourage an open dialogue and empower employees to bring their ideas to the table. You'll find the resources you need to take initiative and create meaningful change within the organization.
  • We offer competitive perks & benefits: We take your well-being seriously, and offer competitive compensation packages to all employees. We also put a strong emphasis on community, with regular team outings and happy hours.

Additional Information

At Similarweb, we build some of the most comprehensive and unique views of how the digital world actually works. The Data for AI team is a small, specialized group within Similarweb, working closely with a select set of the world's leading AI organizations (mostly foundational model companies). The team's mission is to enable these companies to improve their models and AI assistants by applying Similarweb's data to real-world AI use cases. The work involves deep collaboration with AI teams and a strong focus on data quality, scale, and applicability to modern machine learning systems. The team operates in a lean, high-ownership environment and plays a direct role in shaping how Similarweb's data is used in advanced AI products.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at similarweb? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect
AI & LLM Data Analyst (Customer-Facing) at Similarweb