daja.cafe
robots.txt
Robots Exclusion Standard data for daja.cafe
Resource Scan
Scan Details
Site Domain | daja.cafe |
Base Domain | daja.cafe |
Scan Status | Ok |
Last Scan | 2024-10-18T23:41:05+00:00 |
Next Scan | 2024-11-17T23:41:05+00:00 |
Last Scan
Scanned | 2024-10-18T23:41:05+00:00 |
URL | https://daja.cafe/robots.txt |
Domain IPs | 104.21.50.93, 172.67.159.235, 2606:4700:3036::6815:325d, 2606:4700:3037::ac43:9feb |
Response IP | 104.21.50.93 |
Found | Yes |
Hash | 88e4ad78a2f43d4e15eb1d418270b14ca5e0ea751bcf2013c16b1d58f04f7df9 |
SimHash | 401a4a428ab3 |
Groups
*
anthropic-ai
applebot-extended
bytespider
ccbot
chatgpt-user
claudebot
cohere-ai
diffbot
facebookbot
gptbot
imagesiftbot
meta-externalagent
meta-externalfetcher
omgilibot
perplexitybot
timpibot
etaospider
petalbot
aspiegelbot
ahrefsbot
semrushbot
dotbot
mauibot
mj12bot
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /whats-new/ |
Disallow | /account/ |
Disallow | /attachments/ |
Disallow | /goto/ |
Disallow | /posts/ |
Disallow | /login/ |
Disallow | /search/ |
Disallow | /admin.php |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.daja.cafe/sitemap.xml |