/.well-known/

Log In Sign Up

nixonlibrary.gov
robots.txt

Robots Exclusion Standard data for nixonlibrary.gov

Archived Snapshots

Resource Scan

Scan Details

Site Domain	nixonlibrary.gov
Base Domain	nixonlibrary.gov
Scan Status	Ok
Last Scan	2024-06-30T04:53:40+00:00
Next Scan	2024-07-30T04:53:40+00:00

Last Scan

Scanned	2024-06-30T04:53:40+00:00
URL	https://nixonlibrary.gov/robots.txt
Redirect	https://www.nixonlibrary.gov/robots.txt
Redirect Domain	www.nixonlibrary.gov
Redirect Base	nixonlibrary.gov
Domain IPs	2600:1f18:43e8:f301:9046:c05f:75e7:c481, 2600:1f18:43e8:f302:b470:d266:4d03:3ed8, 52.206.136.3, 52.44.89.206
Redirect IPs	18.155.68.102, 18.155.68.103, 18.155.68.4, 18.155.68.96, 2600:9000:23d2:1c00:1f:92b9:4c0:93a1, 2600:9000:23d2:2000:1f:92b9:4c0:93a1, 2600:9000:23d2:2400:1f:92b9:4c0:93a1, 2600:9000:23d2:2600:1f:92b9:4c0:93a1, 2600:9000:23d2:8200:1f:92b9:4c0:93a1, 2600:9000:23d2:a800:1f:92b9:4c0:93a1, 2600:9000:23d2:d200:1f:92b9:4c0:93a1, 2600:9000:23d2:e000:1f:92b9:4c0:93a1
Response IP	18.155.68.103
Found	Yes
Hash	d93464f46cfc3ee42692ebe47c2feff6243b7a477ff8ec5c8a4e4e1dcd1eb704
SimHash	b8129d0bc564

Groups

*

No rules defined. All paths allowed.

Back to top

Other Records

Field

Value

sitemap

https://www.nixonlibrary.gov/sitemap.xml

sitemap

https://www.nixonlibrary.gov/sites/default/files/sitemap.xml

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html

Back to top