statefarm.com
robots.txt

Robots Exclusion Standard data for statefarm.com

Resource Scan

Scan Details

Site Domain statefarm.com
Base Domain statefarm.com
Scan Status Ok
Last Scan2024-11-06T08:26:32+00:00
Next Scan 2024-11-20T08:26:32+00:00

Last Scan

Scanned2024-11-06T08:26:32+00:00
URL https://statefarm.com/robots.txt
Redirect https://www.statefarm.com/robots.txt
Redirect Domain www.statefarm.com
Redirect Base statefarm.com
Domain IPs 152.195.54.7
Redirect IPs 117.18.238.236
Response IP 117.18.238.236
Found Yes
Hash f4453314ec88eca2bca601f540db9f83b730304f4c6372491beb6720f52b7b8b
SimHash 5d31f11424b7

Groups

petalbot

Rule Path
Disallow /

*

Rule Path
Allow /.well-known/
Disallow /content/dam/sf-library/en-us/secure/legacy/xlsx/*.xls
Disallow /content/dam/sf-library/en-us/secure/legacy/pdf/*.pdf
Disallow /content/dam/sf-library/en-us/secure/legacy/team-west/*.pdf
Disallow /content/dam/sf-library/es-us/secure/legacy/xlsx/*.xls
Disallow /content/dam/sf-library/es-us/secure/legacy/pdf/*.pdf
Disallow /content/dam/sf-library/es-us/secure/legacy/team-west/*.pdf
Disallow /errors/
Disallow /_css/
Disallow /_images/
Disallow /_js/
Disallow /cdn/
Disallow /jscript/
Disallow /pzn_json_inc/
Disallow /status/
Disallow /role/
Disallow /general/
Disallow /samples/
Disallow /pdf/us/merchant-welcome-kit.pdf
Disallow /discountdoublecheck/

Other Records

Field Value
sitemap https://www.statefarm.com/sitemap.xml

Comments

  • Disallow - documents that shouldn't be indexed. Including both English and Spanish, just in case pages link to docs in other languages.
  • Disallow - old ones
  • Sitemaps