nrc.gov
robots.txt

Robots Exclusion Standard data for nrc.gov

Resource Scan

Scan Details

Site Domain nrc.gov
Base Domain nrc.gov
Scan Status Ok
Last Scan2024-06-07T06:32:17+00:00
Next Scan 2024-07-07T06:32:17+00:00

Last Scan

Scanned2024-06-07T06:32:17+00:00
URL https://www.nrc.gov/robots.txt
Domain IPs 23.39.11.53, 2600:1413:b000:480::e2c, 2600:1413:b000:483::e2c
Response IP 23.39.11.53
Found Yes
Hash dc463238605ded8ad36761a813ffab3b4d393a5decc11e9ba90c92f1d5b71501
SimHash 2c14dc608dd2

Groups

*

No rules defined. All paths allowed.

akamai-sitesnapshot

No rules defined. All paths allowed.

amazonbot

Product Comment
amazonbot Amazon's user agent
Rule Path Comment
Disallow /docs/ disallow this directory

Other Records

Field Value
sitemap https://www.nrc.gov/sitemapindex.xml
sitemap https://www.nrc.gov/sitemapindex.xml

Warnings

  • 4 invalid lines.