www.ars.usda.gov
robots.txt

Robots Exclusion Standard data for www.ars.usda.gov

Resource Scan

Scan Details

Site Domain www.ars.usda.gov
Base Domain usda.gov
Scan Status Ok
Last Scan2024-09-17T17:49:28+00:00
Next Scan 2024-10-17T17:49:28+00:00

Last Scan

Scanned2024-09-17T17:49:28+00:00
URL https://www.ars.usda.gov/robots.txt
Domain IPs 52.245.234.40
Response IP 52.245.234.40
Found Yes
Hash f08c2c6cbd6d1c59b983551f2c72e8e9a0213b4da6075e3e2c74b06202eb7eb0
SimHash 3b0678476d90

Groups

*

Rule Path
Disallow /bin/
Disallow /config/
Disallow /css/
Disallow /Jquery/
Disallow /views/
Disallow /scripts/
Disallow /umbraco/
Disallow /umbraco_client/
Disallow /App_Plugins/

Other Records

Field Value
sitemap https://www.ars.usda.gov/umbraco/usda/sitemap/index