bolha.one
robots.txt

Robots Exclusion Standard data for bolha.one

Resource Scan

Scan Details

Site Domain bolha.one
Base Domain bolha.one
Scan Status Ok
Last Scan2024-10-04T03:00:32+00:00
Next Scan 2024-10-05T03:00:32+00:00

Last Scan

Scanned2024-10-04T03:00:32+00:00
URL https://bolha.one/robots.txt
Domain IPs 206.42.48.100
Response IP 206.42.48.100
Found Yes
Hash 98de22078e1ff291933682c749fb21f307094fcbf052a545a35a7599d964e221
SimHash 70364b01c1c4

Groups

ai2bot
ai2bot-dolma
amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
google-extended
googleother
googleother-image
googleother-video
icc-crawler
imagesiftbot
meta-externalagent
meta-externalfetcher
oai-searchbot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
iaskspider/2.0
img2dataset
omgili
omgilibot

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/