fdacs.gov
robots.txt

Robots Exclusion Standard data for fdacs.gov

Resource Scan

Scan Details

Site Domain fdacs.gov
Base Domain fdacs.gov
Scan Status Ok
Last Scan2024-09-16T03:21:28+00:00
Next Scan 2024-10-16T03:21:28+00:00

Last Scan

Scanned2024-09-16T03:21:28+00:00
URL https://www.fdacs.gov/robots.txt
Domain IPs 76.76.21.164, 76.76.21.22
Response IP 76.76.21.123
Found Yes
Hash 6a459fc009a59ac817f014ac10bfe3a8c21f71ca163b97b653c25c95d7c10083
SimHash 681151c1c0f1

Groups

*

Rule Path
Disallow /

googlebot
bingbot
bingpreview
msnbot
slurp
duckduckbot
applebot
ia_archiver
facebookexternalhit
twitterbot
linkedinbot

Rule Path
Allow /
Disallow /admin/
Disallow /site_admin/
Disallow /search
Disallow /search/
Disallow /content/search
Disallow /content/advancedsearch
Disallow /content/tipafriend
Disallow /layout/set/print
Disallow /rss
Disallow /media/
Disallow /ezinfo/
Disallow /user/
Disallow /test-area/

Comments

  • Disallow all
  • But allow only important bots