Real-time Sentiment Tracking System Design - Netflix

Question Description

You are asked to design a real-time social media sentiment tracking system focused on Netflix brand perception. The system must ingest high-volume streams from platforms (Twitter, Reddit, Facebook), normalize and filter posts about Netflix, apply text classification / sentiment models, aggregate scores over time, and surface trends and alerts to marketing and content teams.

High-level flow you should cover:

Data ingestion: connectors and streaming (API polling, webhooks, firehose) feeding a durable message queue.
Processing: streaming NLP (tokenization, language detection, entity recognition) then a sentiment classifier (binary/multi-class or continuous score) with post-level metadata (platform, timestamp, language).
Aggregation & storage: rollups by minute/hour/day stored in a time-series or OLAP store for ad-hoc queries and historical analysis.
Serving & alerting: dashboards, APIs, and anomaly detectors that trigger alerts when sentiment shifts beyond thresholds.

Skill signals interviewers expect:

Design of scalable streaming pipelines (Kafka, Pub/Sub, Spark/Flink/KStream) and low-latency processing patterns.
Practical NLP knowledge: text classification, handling slang/sarcasm, model evaluation, and continuous model refresh.
Time-series aggregation and forecasting approaches to predict trends and detect anomalies.
Reliability, cost trade-offs, data retention, and privacy considerations (rate limits, API costs, GDPR concerns).

As you present your design, discuss failure modes, monitoring, and how you'd validate model accuracy and alerting thresholds in production.

Netflix ML System Design: Real-time Sentiment Tracking

Question Description

Common Follow-up Questions

Related Questions

Explore More Questions

Practice This Question with AI