holoratio.blog
robots.txt

Robots Exclusion Standard data for holoratio.blog

Resource Scan

Scan Details

Site Domain holoratio.blog
Base Domain holoratio.blog
Scan Status Ok
Last Scan2024-11-18T22:16:53+00:00
Next Scan 2024-11-25T22:16:53+00:00

Last Scan

Scanned2024-11-18T22:16:53+00:00
URL https://holoratio.blog/robots.txt
Domain IPs 192.0.78.143, 192.0.78.221
Response IP 192.0.78.143
Found Yes
Hash 1e29b131cfec026cb01f997eaec74093bd5d5c021308ced8d6ec3dc584b6e0fb
SimHash 73000800c0c1

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

ai2bot
ai2bot-dolma
amazonbot
applebot-extended
anthropic-ai
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
gptbot
google-extended
imagesiftbot
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
petalbot
perplexitybot
scrapy
sentibot
sentibot
timpibot
turnitinbot
youbot
webzio
webzio-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://holoratio.blog/sitemap.xml
sitemap https://holoratio.blog/news-sitemap.xml

Comments

  • Block AI Crawlers
  • End Block AI Crawlers