aucd.org
robots.txt

Robots Exclusion Standard data for aucd.org

Resource Scan

Scan Details

Site Domain aucd.org
Base Domain aucd.org
Scan Status Ok
Last Scan2024-04-27T23:12:49+00:00
Next Scan 2024-05-27T23:12:49+00:00

Last Scan

Scanned2024-04-27T23:12:49+00:00
URL https://www.aucd.org/robots.txt
Domain IPs 104.20.16.32, 104.20.17.32, 172.67.2.179, 2606:4700:10::6814:1020, 2606:4700:10::6814:1120, 2606:4700:10::ac43:2b3
Response IP 104.20.16.32
Found Yes
Hash bad9615db91db54049ccc8d53ed9e7e75096a8460c6e2d300afb1fec74c95f92
SimHash 0000d89883fb

Groups

becomebot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

indy library

Rule Path
Disallow /

http://www.almaden.ibm.com/cs/crawler

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

*

Rule Path
Disallow /nirs/db

powermapper

Rule Path
Allow /