Senior Data Scientist
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
The Senior Data Scientist will lead the maturation of Securly's content classification system - building the ML infrastructure that determines, at scale, whether web content is appropriate for K-12 students, and establishing the rigorous evaluation framework that product and leadership teams depend on. This is applied ML with direct student safety impact - not research. You will lead a significant uplift of Securly's classification models: refactoring binary models to proper multiclass classification, building labeled evaluation datasets, and producing standardized model cards with per-category precision, recall, F1, and confusion matrix analysis. At L5, you are the technical leader of the data science function for content safety. You will define the evaluation methodology the team follows, set the standard for what a model card must contain before a model ships, mentor the team on applied ML rigor, and serve as the interface between data science and engineering on production integration constraints. Level: L5 Experience: 8-15 Years Location: Pune, India Work Type: Hybrid (2 days onsite) Reports To: Engineering Manager, Data Platform What It Means to Be L5 at Securly L5 at Securly is a Staff Engineer. You are the technical owner, not just an implementer. Drive technical direction for your initiative end-to-end: from architecture to production, with minimal oversight from your engineering manager. Identify and resolve ambiguity in requirements, system boundaries, and design tradeoffs without waiting for a fully-formed spec. Mentor L3/L4 engineers on the team: code reviews, design feedback, pairing, and raising the bar for what production-quality work looks like. Partner with your L6 technical lead and the Distinguished Engineer on architectural decisions, surfacing tradeoffs clearly rather than deferring them upward. Contribute to cross-team engineering standards: you are expected to influence practices beyond your immediate squad. Translate technical context into clear written artifacts that non-engineers (PM, Support, Leadership) can act on. Participate in on-call rotation and own the full incident lifecycle for your system: detection, diagnosis, resolution, and retrospective.
Responsibilities
- Define the evaluation methodology for content classification at Securly: establish what a model card must contain and hold every model release to that standard before it ships.
- Lead the multiclass refactor of Securly's content classification models: redesign binary models to handle multi-label, multi-class content categories (Adult Content, Violence, Self-Harm, Social Media, and others).
- Build and maintain labeled evaluation datasets with robust annotation workflows; address class imbalance and label noise systematically; document dataset curation decisions in a versioned data card.
- Connect offline evaluation to production monitoring - surface classification drift and error patterns before they become customer-facing issues.
- Investigate and resolve misclassification errors: false positives (over-blocking) and false negatives (under-blocking); produce written root cause analyses.
- Build and maintain training data pipelines: ingestion, cleaning, labeling, and versioning at scale.
- Mentor the existing AI team on evaluation methodology, model development practices, and data science communication rigor.
- Communicate precision/recall tradeoffs to product managers and engineers; produce executive-level summaries of classification quality for leadership.
- Collaborate with engineering to integrate model outputs into the production filtering stack with appropriate latency and reliability constraints.
- Research and prototype improvements: feature representations, model architectures, active learning for label efficiency, domain adaptation for emerging content categories.
- Skills & Requirements
Requirements
- Machine learning - multi-label/multi-class classification, model evaluation methodology, handling class imbalance, feature engineering for text and URL data. 5+ years in applied ML roles.
- Python (ML stack) - production-quality code: scikit-learn, PyTorch or TensorFlow, pandas, numpy. Notebooks for exploration; production-grade pipelines for delivery.
- Text / NLP feature engineering - URL tokenization, domain analysis, HTML content features, TF-IDF or embedding-based representations for web content classification.
- ML evaluation rigor - precision/recall tradeoffs, confusion matrix analysis, offline vs. online evaluation, A/B testing, reproducible model cards. At L5, you define the evaluation standard.
- Data engineering for ML - training data pipelines, data versioning, handling noisy and partially labeled datasets, annotation workflow design.
- Technical communication and stakeholder influence - ability to present quantitative model quality findings to both engineering and non-technical leadership.
- Strongly Preferred
- Large-scale classification in production - shipping models with latency and thro
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at securly13? Share your experience