eniehack.net
robots.txt

Robots Exclusion Standard data for eniehack.net

Resource Scan

Scan Details

Site Domain eniehack.net
Base Domain eniehack.net
Scan Status Ok
Last Scan2025-08-11T14:28:34+00:00
Next Scan 2025-09-10T14:28:34+00:00

Last Scan

Scanned2025-08-11T14:28:34+00:00
URL https://eniehack.net/robots.txt
Redirect https://www.eniehack.net/robots.txt
Redirect Domain www.eniehack.net
Redirect Base eniehack.net
Domain IPs 104.21.20.198, 172.67.194.30, 2606:4700:3031::6815:14c6, 2606:4700:3037::ac43:c21e
Redirect IPs 104.21.20.198, 172.67.194.30, 2606:4700:3031::6815:14c6, 2606:4700:3037::ac43:c21e
Response IP 172.67.194.30
Found Yes
Hash fa02bf474f6dcb903c94800b9cba2ebe246dfacb7e18340476d105def78e8999
SimHash 6db4914bc1d2

Groups

*

Rule Path
Disallow /~eniehack/diary/

claudebot
gptbot
google-extended
ccbot
applebot-extended

Rule Path
Disallow /~eniehack/assets/img/

gptbot
google-extended
hatena antenna
steeler
icc-crawler
ccbot
claudebot
researchscan
obot
netcraftsurveyagent
sbintuitionsbot
cotoyogi
archive.org_bot
ia_archiver
ia_archiver-web.archive.org
heritrix
discordbot
slackbot
slackbotlinkexpanding
mastodon
applebot-extended

Rule Path
Allow /~eniehack/diary/

*

Rule Path
Allow /~eniehack/diary/*.rdf
Allow /~eniehack/diary/*.atom.xml