ml system design
Coupang
Amazon
eBay

Coupang ML System Design: Scalable E-commerce Search

Topics:
Search Ranking
Semantic Search
Vector Search
Roles:
Machine Learning Engineer
Search Engineer
ML Systems Engineer
Experience:
Mid Level
Senior
Staff

Question Description

You are asked to design a scalable search system for a large e-commerce platform (Coupang-scale). The system must take user text queries, handle millions of products, and return relevant results within strict latency and availability targets. You should cover query processing, candidate retrieval (keyword and semantic/vector), ranking, filtering/facets, autocomplete, and optional personalization.

Core content: explain how queries are parsed, tokenized, and normalized; how spelling correction, synonyms, and intent signals are applied; and how two retrieval paths (inverted-index keyword search and vector/semantic search using ANN) generate candidate sets. Describe a two-stage ranking pipeline where a fast lightweight model or heuristic filters candidates, and a richer ML ranking model (learning-to-rank / gradient boosting / neural ranker) reorders results using relevance, popularity, inventory, and user signals.

Flow / stages you should discuss:

  • Query ingestion & front-end: rate limiting, geo-routing, and autocomplete service
  • Candidate retrieval: inverted indices, sharded ANN/vector indexes, hybrid fusion
  • Feature store & ranking: online features, coarse-to-fine re-ranker, A/B testing hooks
  • Filtering & faceting: attribute indexes and fast post-filtering
  • Serving, caching & observability: edge caches, cold-start handling, monitoring and alarms

Skill signals: demonstrate knowledge of information retrieval (inverted index), semantic/vector search (embeddings + ANN), ML ranking (LTR), capacity planning for thousands QPS, latency budgeting (95th percentile <200ms), high availability patterns, sharding/replication, and trade-offs between recall, precision, and freshness.

Common Follow-up Questions

  • How would you fuse results from inverted-index keyword search and vector/semantic search to maximize recall without harming latency?
  • Describe the indexing pipeline and how you'd support near-real-time updates (price/stock) while keeping query latency under 200ms.
  • How would you design and evaluate an online A/B test for a new ML ranking model in the e-commerce search stack?
  • What capacity planning and sharding strategies would you use to handle thousands of QPS globally while maintaining 99.9% availability?

Related Questions

1Design an autocomplete and query suggestion service for high-traffic e-commerce
2How to build a low-latency ANN vector search index for millions of product embeddings
3Design a personalized ranking system that combines collaborative signals with query intent
4How to implement faceted search and filtering at scale for large product catalogs

Explore More Questions

Practice This Question with AI

Get real-time hints, detailed requirements, and insightful analysis of the question.

Scalable E-commerce Search Design - Coupang ML Engineer | Voker