Data Scientist Interview Questions

35 questions available. Practice with AI-powered feedback.

Other roles

ml foundation
Google
DeepMind
Meta

Google ML Foundations Interview: Loss Functions Guide

Prepare for Google ML interviews: learn MSE vs cross-entropy, derive gradients, and handle numerical stability and class imbalance. Practice follow-ups and choose the right loss.

Software Engineer, ML EngineerEntry Level
ml coding
Uber
Lyft
Airbnb

Implement k-Fold Cross-Validation From Scratch — Uber

Implement k-fold, stratified, and time-series CV from scratch for ML evaluation. Includes split contracts, reproducibility, and aggregate metric. Read on to prepare.

Machine Learning Engineer, Data ScientistMid Level
ml coding
LinkedIn
Google
Amazon

LinkedIn ML: Large-Scale Streaming Mean & Variance

Compute population mean and variance in one pass over massive float streams. Includes mergeable, numerically stable summaries for distributed ML systems — try it.

Machine Learning Engineer, Data ScientistMid Level
ml system design
LinkedIn
Google
Yelp

LinkedIn ML System Design: Real-Time Nearby Recommendations

Build a low-latency, scalable ML system to recommend nearby places in real time. Get architecture, dataflow, personalization tips, and interview follow-ups.

Machine Learning Engineer, Data ScientistMid Level
ml foundation
Lyft
Uber
Airbnb

Lyft ML Engineer Feature Engineering Interview Guide

Study Lyft ML Engineer feature engineering: feature creation, selection, encoding, scaling, leakage avoidance, and trade-offs. Read examples and practice solutions.

Machine Learning Engineer, Data ScientistEntry Level
ml foundation
Microsoft
Google
Amazon

Microsoft ML Foundations: Statistical Analysis & A/B Tests

Microsoft ML interview: statistical analysis, A/B tests, hypothesis tests & confidence intervals. Learn test setup, sample-size, common pitfalls and follow-ups.

Data Engineer, ML EngineerEntry Level
ml system design
Microsoft
Google
Meta

Microsoft ML System Design: Local Sports Team Recommender

Scalable recommender for local sports teams: data ingestion, candidate generation, ranking, real-time updates, and metrics. Prep for ML design interviews.

Software Engineer, ML EngineerMid Level
ml coding
Netflix
Amazon
Spotify

Netflix ML Coding: Compute TF-IDF for Corpus Implementation

Compute TF-IDF for a corpus in Python: implement TF, IDF and per-token TF-IDF scores. See interview flow, skills tested, and practice follow-ups to prepare.

Machine Learning Engineer, Data ScientistEntry Level
ml foundation
NVIDIA
Google
Amazon

NVIDIA ML Engineer Interview — Model Selection Guide

Prepare for NVIDIA ML interviews: master model selection, bias-variance trade-off, cross-validation, ensembles, and evaluation metrics. Try practice prompts.

Machine Learning Engineer, Data ScientistEntry Level
ml coding
OpenAI
Anthropic
Google

OpenAI ML Coding: Noisy Human-Labeled Text Classifier

Analyze noisy human annotations and train embedding-based classifiers for identity_attack labels. Filter reliable annotators, retrain models, and propose robustness steps. Start preparing.

Machine Learning Engineer, ML EngineerMid Level
ml foundation
Oracle
Google
Microsoft

Oracle ML Interview: RAG Systems & Retrieval Models

Prepare for Oracle ML interviews on RAG systems — learn retrieval+generation integration, eval metrics, and experiment design. Read practical tips and follow-ups.

Software Engineer, ML EngineerMid Level
ml system design
PayPal
Stripe
Square

PayPal ML System Design: Real-Time Fraud Detection Engine

Prepare for PayPal ML interviews: design a low-latency, scalable real-time fraud detection pipeline. Learn components, latency tactics, scoring, and follow-ups.

Machine Learning Engineer, Data ScientistMid Level

Get More Real Data Scientist Questions

Practice data scientist interview questions with AI-powered hints, analysis, and feedback.

Start Free Practice
Data Scientist Interview Questions | Voker