HARD

Google Search Index

Build distributed index for 100B+ web pages supporting billions of queries/day.

Estimated Time: 120 minutes
#search#indexing#scale

Solution Overview

Distributed crawling, MapReduce indexing, inverted index, query rewriting, ranking.

Used By Companies

GoogleAmazon
Solution Overview

Distributed crawling, MapReduce indexing, inverted index, query rewriting, ranking.

Approach

Web crawling, distributed indexing, inverted index, ML ranking

Companies
  • Google
  • Amazon
Components
  • Web crawler
  • Indexer
  • Inverted index
  • Query processor
  • Ranker