bookriotcom.c.presscdn.com
robots.txt

Robots Exclusion Standard data for bookriotcom.c.presscdn.com

Resource Scan

Scan Details

Site Domain bookriotcom.c.presscdn.com
Base Domain presscdn.com
Scan Status Ok
Last Scan2024-05-19T14:07:51+00:00
Next Scan 2024-06-18T14:07:51+00:00

Last Scan

Scanned2024-05-19T14:07:51+00:00
URL http://bookriotcom.c.presscdn.com/robots.txt
Domain IPs 13.33.30.109, 13.33.30.28, 13.33.30.69, 13.33.30.77, 2600:9000:229f:3e00:12:3cea:bbc0:93a1, 2600:9000:229f:5600:12:3cea:bbc0:93a1, 2600:9000:229f:8a00:12:3cea:bbc0:93a1, 2600:9000:229f:ae00:12:3cea:bbc0:93a1, 2600:9000:229f:ba00:12:3cea:bbc0:93a1, 2600:9000:229f:be00:12:3cea:bbc0:93a1, 2600:9000:229f:c00:12:3cea:bbc0:93a1, 2600:9000:229f:f200:12:3cea:bbc0:93a1
Response IP 13.33.30.109
Found Yes
Hash 371bdf435fdf6b8d98a56d1d73502520b1fcf5d3949ccb916f6ceff639ffa9bf
SimHash 620cdc42a213

Groups

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /search/
Disallow /wp-admin/
Disallow /search/_target*
Disallow /wp-content/plugins/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap http://bookriot.com/sitemap_index.xml

Warnings

  • 1 invalid line.