nrao.edu
robots.txt

Robots Exclusion Standard data for nrao.edu

Resource Scan

Scan Details

Site Domain nrao.edu
Base Domain nrao.edu
Scan Status Ok
Last Scan2024-10-28T11:49:34+00:00
Next Scan 2024-11-27T11:49:34+00:00

Last Scan

Scanned2024-10-28T11:49:34+00:00
URL https://nrao.edu/robots.txt
Redirect https://www.nrao.edu/robots.txt
Redirect Domain www.nrao.edu
Redirect Base nrao.edu
Domain IPs 192.33.115.129
Redirect IPs 192.33.115.129
Response IP 192.33.115.129
Found Yes
Hash e00870f41ec4fb6533729eeb967d3a8768a1ba0f2ffb41f4783ca7de5e32e0f4
SimHash ed131be666b3

Groups

gsa-crawler

Rule Path
Disallow /cgi-bin/
Disallow /cgi-internal/
Disallow /php/cal/

*

Rule Path
Disallow /cgi-bin/
Disallow /cgi-internal/
Disallow /internal
Disallow /pubcompdocs/
Disallow /gold/
Disallow /php/
Disallow /admin/obs/flash_reports/
Disallow /admin/obs/flash_reports.shtml
Disallow /admin/dsaa/

Comments

  • robots.txt for www.nrao.edu