daja.cafe
robots.txt

Robots Exclusion Standard data for daja.cafe

Resource Scan

Scan Details

Site Domain daja.cafe
Base Domain daja.cafe
Scan Status Ok
Last Scan2024-10-18T23:41:05+00:00
Next Scan 2024-11-17T23:41:05+00:00

Last Scan

Scanned2024-10-18T23:41:05+00:00
URL https://daja.cafe/robots.txt
Domain IPs 104.21.50.93, 172.67.159.235, 2606:4700:3036::6815:325d, 2606:4700:3037::ac43:9feb
Response IP 104.21.50.93
Found Yes
Hash 88e4ad78a2f43d4e15eb1d418270b14ca5e0ea751bcf2013c16b1d58f04f7df9
SimHash 401a4a428ab3

Groups

*
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
cohere-ai
diffbot
facebookbot
gptbot
imagesiftbot
meta-externalagent
meta-externalfetcher
omgilibot
perplexitybot
timpibot
etaospider
petalbot
aspiegelbot
ahrefsbot
semrushbot
dotbot
mauibot
mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /whats-new/
Disallow /account/
Disallow /attachments/
Disallow /goto/
Disallow /posts/
Disallow /login/
Disallow /search/
Disallow /admin.php
Allow /

Other Records

Field Value
sitemap https://www.daja.cafe/sitemap.xml