nercc.org
robots.txt

Robots Exclusion Standard data for nercc.org

Resource Scan

Scan Details

Site Domain nercc.org
Base Domain nercc.org
Scan Status Ok
Last Scan2025-09-07T14:28:47+00:00
Next Scan 2025-10-07T14:28:47+00:00

Last Scan

Scanned2025-09-07T14:28:47+00:00
URL https://nercc.org/robots.txt
Redirect https://www.nasrcc.org/robots.txt
Redirect Domain www.nasrcc.org
Redirect Base nasrcc.org
Domain IPs 35.197.52.145
Redirect IPs 35.197.52.145
Response IP 35.197.52.145
Found Yes
Hash eb48ade891c04674bc6cebeecd293f37b6413bd83874caa16d435311add7c124
SimHash 6c6441536755

Groups

googlebot

Rule Path
Disallow /out/
Allow /*

*

Rule Path
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /cgi-bin/
Disallow /trackback/
Disallow /comments/
Disallow /out/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php