Crawl Dictionary

Crawl Rate

Requests per second a crawler makes to a website when it is crawling it.

Crawl Budget

An allocation of crawl requests to a host.

Crawl Frequency

Program determining which sites to crawl, how often, and how many pages to fetch from each site.

Crawl Rank

The frequency a page is crawled by a search engine bot compared to the ranking position of that page on that search engine.

Crawl Space

The totality of possible URLs for a website.

Crawl Ratio

The number of pages crawled by a search engine bot compared to the total number of available pages to crawl on a website. 100% means search engine knows all the pages on that website.

Effective Crawl Ratio

The number of specific pages crawled by a search engine bot in the crawl window of that type of pages on that website for that search engine compared to the total number of available pages to crawl on that type of pages of the website.

Crawl Window

Timeframe a search engine accepts to send visitors to a URL after it crawled it. That period will vary depending on the type of pages. Knowing that number allows to estimate the Effective Crawl Ratio.

Crawl Depth

Depth is the shortest path ‘minimal number of clicks’ from the homepage to a particular page. Crawl depth is how deep a crawler is programmed to explore a website. 

Crawl Waste

Pages available to crawl on a website which have no unique content, no SEO aim, no add value neither for users nor for search engines.

Crawl Simulation

Crawling not the website but the crawl of that website which is already performed.

Crawl Efficiency

The number of useful crawled pages by a search engine bot compared to all crawled pages by the same bot in a defined period.

Crawl Retention

Crawl timeframe by a search engine bot before a visit is recorded from that search engine.

Crawl Optimization

Intelligent use of Crawl Budget(Allocation) on a website.

Crawl Performance

The average time a crawler spends downloading a page (in milliseconds) 

Useful Crawl

Crawled pages of a website by a search engine bot which bring at least one visit from that search engine in a defined period.

Useless Crawl 

Crawled pages of a website by a search engine bot which bring no visits from that search engine in a defined period.

Simulate Empty Crawl

URLs which are known and kept by a crawler but not requested from the host, typically URLs blocked by robots.txt of a website. A crawler is sometimes configured on purpose to perform that kind of crawl to perform analysis on the links of a website.

Partial Crawl

Crawling specific, selected parts of a website.

Unique Crawl 

Unique URL, a crawler crawls on a single website in a defined period.

Thanks for taking time to read this post. I offer consulting, architecture and hands-on development services in web/digital to clients in Europe & North America. If you'd like to discuss how my offerings can help your business please contact me via LinkedIn

Have comments, questions or feedback about this article? Please do share them with us here.

If you like this article

Follow Me on Twitter

Follow Searchdatalogy on Twitter

About Us

My objective is bringing all my experience and expertise together to deliver solid technology solutions that can take your search traffic acquisition to the next level. My main goal is to assist you in building and maintaining your search marketing analytics platforms. My will is to leverage your marketing and IT teams search knowledge while bridging the gap between two.


Botify: Botify Certified Consultant

IBM: Data Scientist, Data Engineering Certificates

Google: Google Analytics, Google Adwords, Mobile Sites, Digital Sales Certificated Professional

Coursera: Data Engineering on Google Cloud Platform Specialization

Legal Terms Privacy

Recent Posts

SEO Data Analysis 2 days, 10 hours ago
BrightonSEO Conference 3 weeks, 2 days ago
HTTP2 On Top Sites 3 months, 2 weeks ago
Alexa Top 1 Million Sites 8 months, 2 weeks ago
Best SEO Conferences In 2018 9 months, 1 week ago
Web Marketing Festival 1 year, 3 months ago
WebCampDay 1 year, 4 months ago
QueDuWeb 1 year, 5 months ago
SEOCamp'us 1 year, 7 months ago
1 Million #SEO Tweets 1 year, 8 months ago
SEO, Six Blind Men & An Elephant 1 year, 9 months ago
SEO Hero 2017 1 year, 10 months ago
Digitalzone 1 year, 10 months ago
Technical SEO Log Analysis 1 year, 11 months ago
Crawl Dictionary 2 years ago

Recent Tweets