Skip to main content
Back to jobs

Senior MLOps Engineer

External
crunchyroll logoCrunchyroll · Hyderabad, India
Full-timeOn-site4d ago
AirflowAWSCI/CDCloudFormationComplianceDocker
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Crunchyroll is growing and changing, presenting unique challenges and opportunities to support millions of anime fans around the world. The AI/ML team provides seamless help to our internal stakeholders, ensuring an exceptional experience for all Crunchyroll fans. The AI/ML team relies on strong MLOps practices to ensure models are reliable, scalable, and impactful in production. Design, build, and maintain end-to-end ML infrastructure and pipelines to support model training, deployment, and monitoring. Develop and manage CI/CD pipelines for ML to enable fast, reliable, and automated delivery of ML models. Implement and manage model registry, experiment tracking, and versioning using tools like MLflow, SageMaker Model Registry, or equivalent. Establish monitoring, observability, and alerting frameworks to detect drift, degradation, and anomalies in real-time. Partner with data scientists to productionize ML models, ensuring seamless transition from research to production. Optimize ML workflows for performance, scalability, and cost-effectiveness across training and inference. Leverage platforms such as AWS SageMaker, Databricks, Kinesis, Lambda, Kubernetes (EKS), and Docker for ML operations. Collaborate with data engineering and software engineering teams to integrate ML services into large-scale distributed systems. Drive best practices for MLOps, including reproducibility, governance, compliance, and security of deployed models. How you'll work with Data Science Partner with ML Engineers to deploy and scale models built with frameworks like PyTorch, TensorFlow, and Scikit-learn. Help data scientists track experiments, compare runs, and promote models to production. Translate research notebooks into production-grade pipelines with reproducible training and inference workflows. Co-own model lifecycle management: data, training, validation, deployment, monitoring, retraining. Ensure ML models align with software engineering best practices for testing, automation, and observability. About You We get excited about candidates, like you, because... Bachelor's or Master's degree in Data Science, Computer Science, Statistics, or a related field. 8+ years of experience in MLOps, ML infrastructure, or DevOps for AI/ML systems. MLflow, SageMaker, Databricks ML for experiment tracking, model registry, and lifecycle management. Airflow, or Step Functions for workflow orchestration. MLFLow for monitoring ML models in production. Deep knowledge of CI/CD and automation frameworks (GitHub Actions, Terraform, CloudFormation). Hands-on experience with containerization (Docker) and orchestration (EKS). Proficiency in Python and scripting for ML integrations. Strong knowledge of cloud platforms (AWS preferred) and services relevant to ML (SageMaker, Lambda, S3, Kinesis, Step Functions). Understanding of security, compliance, and governance in ML production systems. Excellent problem-solving and communication skills, with a proven ability to work with cross-functional teams of data scientists The R&D team is dedicated to developing, testing, and validating robust and scalable machine learning models that drive business objectives. Our focus includes enhancing operational processes through AI/ML solutions, such as trend analysis, anomaly detection, and the deployment of large language models (LLMs) for tasks like querying system health. Another major focus area is preserving, and improving customer experience and retention. We closely work with our stakeholders to ensure AI/ML objectives are clearly defined. Why you will love working at Crunchyroll In addition to getting to work with fun, passionate and inspired colleagues, you will also enjoy the following benefits and perks: Best-in class medical, dental, and vision private insurance healthcare coverage Access to counseling & mental health sessions 24/7 through our Employee Assistance Program (EAP) Free premium access to Crunchyroll Professional Development Company's Paid Parental Leave up to 26 weeks for birthing parents up to 12 weeks for non-birthing parents Hybrid Work Schedule Paid Time Off Flex Time Off 5 Yasumi Days Half-Day Fridays during the summer Winter Break #LifeAtCrunchyroll ((select from the following job modalities for this role: #LI-Hybrid #LI-remote #LI-onsite)) About our Values We want to be everything for someone rather than something for everyone and we do this by living and modeling our values in al

Benefits

Health insuranceDental insuranceVision insuranceRemote work optionsParental leave

Additional Information

About Crunchyroll Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across 200+ countries and territories, and help them connect with the stories and characters they crave. Whether that experience is online or in-person, streaming video, theatrical, games, merchandise, events and more, it's powered by the anime content we all love. Join our team, and help us shape the future of anime!


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at crunchyroll? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect