HARD
Google Search Index
Build distributed index for 100B+ web pages supporting billions of queries/day.
Estimated Time: 120 minutes
#search#indexing#scale
Solution Overview
Distributed crawling, MapReduce indexing, inverted index, query rewriting, ranking.
Used By Companies
GoogleAmazon
Solution Overview
Distributed crawling, MapReduce indexing, inverted index, query rewriting, ranking.
Approach
Web crawling, distributed indexing, inverted index, ML ranking
Companies
- •Amazon
Components
- •Web crawler
- •Indexer
- •Inverted index
- •Query processor
- •Ranker