cloudbooklet.com
robots.txt

Robots Exclusion Standard data for cloudbooklet.com

Resource Scan

Scan Details

Site Domain cloudbooklet.com
Base Domain cloudbooklet.com
Scan Status Ok
Last Scan2025-09-24T00:14:03+00:00
Next Scan 2025-10-01T00:14:03+00:00

Last Scan

Scanned2025-09-24T00:14:03+00:00
URL https://cloudbooklet.com/robots.txt
Redirect https://www.cloudbooklet.com/robots.txt
Redirect Domain www.cloudbooklet.com
Redirect Base cloudbooklet.com
Domain IPs 104.21.24.91, 172.67.218.27, 2606:4700:3032::6815:185b, 2606:4700:3034::ac43:da1b
Redirect IPs 104.21.24.91, 172.67.218.27, 2606:4700:3032::6815:185b, 2606:4700:3034::ac43:da1b
Response IP 172.67.218.27
Found Yes
Hash 24769f2de15ccfb843bf0aad1511b9ec47a7ca7419f8bce248dfaefe5af5f6ba
SimHash 7018d9c2c681

Groups

*

Rule Path
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /recommended/
Disallow /.well-known/*
Disallow /index.php/*
Disallow /search/
Disallow *Xhr*
Disallow */xhr*
Disallow */ajax/*
Disallow /?s=

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

amazonbot
anthropic-ai
applebot
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
google-extended
gptbot
httrack
imagesiftbot
magpie-crawler
nutch
oai-searchbot
offline explorer
omgili
omgilibot
peer39_crawler/1.0
perplexitybot
scrapy
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cloudbooklet.com/sitemap_index.xml

Warnings

  • 1 invalid line.