Skip to main content
Back to jobs

RE/RS, Data Understanding - Foundations

External
openai logoOpenai · San Francisco
Full-timeOn-site2w ago
ComplianceDeep Learning
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

The Data Understanding team is responsible for creating the high quality datasets and their quantized representation for OpenAI. This includes synthesizing data, building VQ representations, and processing, filtering, deduplication, quality control, and tokenization so it can be used effectively in big model training runs. We're looking to advance how OpenAI builds and understands pretraining data at scale. You'll treat data quality and curation as core research problems: developing new methods to select, combine, and transform data; creating datasets that improve model capabilities; and designing rigorous experiments to understand how data choices and interventions affect model learning and downstream behavior. You'll work closely with frontier models and web-scale data to build evidence for which approaches work and why, then translate successful research into scalable data processing pipelines We Expect You To Have a strong track record of new or improved ML ideas, through publications, projects, or applied research. Own and drive a research agenda, from choosing the right problems to carrying long-running work through to impact. Be excited by OpenAI's empirical, collaborative approach to research.

Requirements

  • Thoughtfulness about AI's impact, including privacy, provenance, and data quality.
  • Experience building high-performance deep learning or large-scale data processing systems.
  • About OpenAI
  • We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
  • For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement .
  • To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form . No response will be provided to inquiries unrelated to job posting compliance.
  • We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link .
  • OpenAI Global Applicant Privacy Policy
  • At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at openai? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect