nh.gov
robots.txt

Robots Exclusion Standard data for nh.gov

Resource Scan

Scan Details

Site Domain nh.gov
Base Domain nh.gov
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-21T20:45:56+00:00
Next Scan 2024-11-20T20:45:56+00:00

Last Successful Scan

Scanned2024-07-01T20:44:48+00:00
URL https://www.nh.gov/robots.txt
Domain IPs 23.15.240.34, 23.15.240.58, 2600:141a:8000::1748:58cb, 2600:141a:8000::1748:58d9
Response IP 23.15.240.34
Found Yes
Hash 8706956848d97d296db8cfb4948b033749014a8fb7158792b7328ebfe40e1cda
SimHash 23309959e7d3

Groups

*

Rule Path
Disallow /graphics/
Disallow /Templates/
Disallow /error/
Disallow /Library/
Disallow /search/
Disallow /data/
Disallow /_mm/
Disallow /_notes/
Disallow /_baks/
Disallow /MMWIP/

Comments

  • addresses all robots by using wild card *
  • list folders robots are not allowed to index
  • list specific files robots are not allowed to index
  • format Disallow: /file.htm
  • End of robots.txt file