nrao.edu
robots.txt
Robots Exclusion Standard data for nrao.edu
Resource Scan
Scan Details
Site Domain | nrao.edu |
Base Domain | nrao.edu |
Scan Status | Ok |
Last Scan | 2024-10-28T11:49:34+00:00 |
Next Scan | 2024-11-27T11:49:34+00:00 |
Last Scan
Scanned | 2024-10-28T11:49:34+00:00 |
URL | https://nrao.edu/robots.txt |
Redirect | https://www.nrao.edu/robots.txt |
Redirect Domain | www.nrao.edu |
Redirect Base | nrao.edu |
Domain IPs | 192.33.115.129 |
Redirect IPs | 192.33.115.129 |
Response IP | 192.33.115.129 |
Found | Yes |
Hash | e00870f41ec4fb6533729eeb967d3a8768a1ba0f2ffb41f4783ca7de5e32e0f4 |
SimHash | ed131be666b3 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /cgi-internal/ |
Disallow | /internal |
Disallow | /pubcompdocs/ |
Disallow | /gold/ |
Disallow | /php/ |
Disallow | /admin/obs/flash_reports/ |
Disallow | /admin/obs/flash_reports.shtml |
Disallow | /admin/dsaa/ |
Comments