bookriotcom.c.presscdn.com
robots.txt

Robots Exclusion Standard data for bookriotcom.c.presscdn.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	bookriotcom.c.presscdn.com
Base Domain	presscdn.com
Scan Status	Ok
Last Scan	2024-05-19T14:07:51+00:00
Next Scan	2024-06-18T14:07:51+00:00

Last Scan

Scanned	2024-05-19T14:07:51+00:00
URL	http://bookriotcom.c.presscdn.com/robots.txt
Domain IPs	13.33.30.109, 13.33.30.28, 13.33.30.69, 13.33.30.77, 2600:9000:229f:3e00:12:3cea:bbc0:93a1, 2600:9000:229f:5600:12:3cea:bbc0:93a1, 2600:9000:229f:8a00:12:3cea:bbc0:93a1, 2600:9000:229f:ae00:12:3cea:bbc0:93a1, 2600:9000:229f:ba00:12:3cea:bbc0:93a1, 2600:9000:229f:be00:12:3cea:bbc0:93a1, 2600:9000:229f:c00:12:3cea:bbc0:93a1, 2600:9000:229f:f200:12:3cea:bbc0:93a1
Response IP	13.33.30.109
Found	Yes
Hash	371bdf435fdf6b8d98a56d1d73502520b1fcf5d3949ccb916f6ceff639ffa9bf
SimHash	620cdc42a213

Groups

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/search/
Disallow	/wp-admin/
Disallow	/search/_target*
Disallow	/wp-content/plugins/

Rule

Path

Disallow

/search/

Disallow

/wp-admin/

Disallow

/search/_target*

Disallow

/wp-content/plugins/

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

Other Records

Field	Value
sitemap	http://bookriot.com/sitemap_index.xml

Field

Value

sitemap

http://bookriot.com/sitemap_index.xml

Back to top

Warnings

1 invalid line.

Back to top

bookriotcom.c.presscdn.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

gptbot

*

Other Records

Other Records

Warnings

bookriotcom.c.presscdn.com
robots.txt