archive.cheapbooks.com
robots.txt

Robots Exclusion Standard data for archive.cheapbooks.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	archive.cheapbooks.com
Base Domain	cheapbooks.com
Scan Status	Ok
Last Scan	2024-09-29T01:34:46+00:00
Next Scan	2024-10-29T01:34:46+00:00

Last Scan

Scanned	2024-09-29T01:34:46+00:00
URL	http://archive.cheapbooks.com/robots.txt
Domain IPs	104.248.5.188
Response IP	104.248.5.188
Found	Yes
Hash	141647076d4745f98e949d9b48ed7a0a8a9c939c039ee5c10cf94484e670468d
SimHash	cb4557f22611

Groups

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonadbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

clickagy

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshotcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

linguee

Rule	Path
Disallow	/

Rule

Path

Disallow

photon

Rule	Path
Disallow	/

Rule

Path

Disallow

rytebot

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

wotbox

Rule	Path
Disallow	/

Rule

Path

Disallow

adbeat_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

linkdexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	/sitemaps/sitemap-index.xml
sitemap	/sitemaps/urls.txt

Field

Value

sitemap

/sitemaps/sitemap-index.xml

sitemap

/sitemaps/urls.txt

archive.cheapbooks.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

ahrefsbot

amazonadbot

blexbot

clickagy

dotbot

grapeshotcrawler

linguee

photon

rytebot

seokicks

semrushbot

turnitinbot

wotbox

adbeat_bot

linkdexbot

proximic

Other Records

archive.cheapbooks.com
robots.txt