Data Scientist Interview Questions
35 questions available. Practice with AI-powered feedback.
Other roles
Google ML Foundations Interview: Loss Functions Guide
Prepare for Google ML interviews: learn MSE vs cross-entropy, derive gradients, and handle numerical stability and class imbalance. Practice follow-ups and choose the right loss.
Implement k-Fold Cross-Validation From Scratch — Uber
Implement k-fold, stratified, and time-series CV from scratch for ML evaluation. Includes split contracts, reproducibility, and aggregate metric. Read on to prepare.
LinkedIn ML: Large-Scale Streaming Mean & Variance
Compute population mean and variance in one pass over massive float streams. Includes mergeable, numerically stable summaries for distributed ML systems — try it.
LinkedIn ML System Design: Real-Time Nearby Recommendations
Build a low-latency, scalable ML system to recommend nearby places in real time. Get architecture, dataflow, personalization tips, and interview follow-ups.
Lyft ML Engineer Feature Engineering Interview Guide
Study Lyft ML Engineer feature engineering: feature creation, selection, encoding, scaling, leakage avoidance, and trade-offs. Read examples and practice solutions.
Microsoft ML Foundations: Statistical Analysis & A/B Tests
Microsoft ML interview: statistical analysis, A/B tests, hypothesis tests & confidence intervals. Learn test setup, sample-size, common pitfalls and follow-ups.
Microsoft ML System Design: Local Sports Team Recommender
Scalable recommender for local sports teams: data ingestion, candidate generation, ranking, real-time updates, and metrics. Prep for ML design interviews.
Netflix ML Coding: Compute TF-IDF for Corpus Implementation
Compute TF-IDF for a corpus in Python: implement TF, IDF and per-token TF-IDF scores. See interview flow, skills tested, and practice follow-ups to prepare.
NVIDIA ML Engineer Interview — Model Selection Guide
Prepare for NVIDIA ML interviews: master model selection, bias-variance trade-off, cross-validation, ensembles, and evaluation metrics. Try practice prompts.
OpenAI ML Coding: Noisy Human-Labeled Text Classifier
Analyze noisy human annotations and train embedding-based classifiers for identity_attack labels. Filter reliable annotators, retrain models, and propose robustness steps. Start preparing.
Oracle ML Interview: RAG Systems & Retrieval Models
Prepare for Oracle ML interviews on RAG systems — learn retrieval+generation integration, eval metrics, and experiment design. Read practical tips and follow-ups.
PayPal ML System Design: Real-Time Fraud Detection Engine
Prepare for PayPal ML interviews: design a low-latency, scalable real-time fraud detection pipeline. Learn components, latency tactics, scoring, and follow-ups.
Get More Real Data Scientist Questions
Practice data scientist interview questions with AI-powered hints, analysis, and feedback.
Start Free Practice