cheapbooks.cc
robots.txt

Robots Exclusion Standard data for cheapbooks.cc

Resource Scan

Scan Details

Site Domain cheapbooks.cc
Base Domain cheapbooks.cc
Scan Status Ok
Last Scan2025-12-05T15:50:35+00:00
Next Scan 2026-01-04T15:50:35+00:00

Last Scan

Scanned2025-12-05T15:50:35+00:00
URL http://cheapbooks.cc/robots.txt
Domain IPs 104.248.5.188
Response IP 104.248.5.188
Found Yes
Hash 41107493c5d4784d7f0226ef28ba799cddab80adbd6506bd6d5ab6d62337dccb
SimHash ca45dcf22611

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

Rule Path
Disallow /

amazonadbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

clickagy

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

linguee

Rule Path
Disallow /

photon

Rule Path
Disallow /

rytebot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

adbeat_bot

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

Other Records

Field Value
sitemap /sitemaps/sitemap-index.xml