snuze.shaunc.com
robots.txt

Robots Exclusion Standard data for snuze.shaunc.com

Resource Scan

Scan Details

Site Domain snuze.shaunc.com
Base Domain shaunc.com
Scan Status Ok
Last Scan2024-06-01T06:27:30+00:00
Next Scan 2024-06-15T06:27:30+00:00

Last Scan

Scanned2024-06-01T06:27:30+00:00
URL https://snuze.shaunc.com/robots.txt
Domain IPs 172.93.52.73
Response IP 172.93.52.73
Found Yes
Hash 7a38d4307916e8ddf5fa82b01513dc22ac5709506afa1045233f85c59e675484
SimHash f05c6556a1c3

Groups

ia_archiver

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

mixnodecache

Rule Path
Disallow /

checkmarknetwork/1.0 (+http://www.checkmarknetwork.com/spider.html)

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

*

Rule Path
Disallow /css/
Disallow /doxygen/current/
Disallow /js/
Disallow /img/