fdacs.gov
robots.txt

Robots Exclusion Standard data for fdacs.gov

Resource Scan

Scan Details

Site Domain fdacs.gov
Base Domain fdacs.gov
Scan Status Ok
Last Scan2024-11-15T03:22:25+00:00
Next Scan 2024-12-15T03:22:25+00:00

Last Scan

Scanned2024-11-15T03:22:25+00:00
URL https://www.fdacs.gov/robots.txt
Domain IPs 76.76.21.22, 76.76.21.98
Response IP 76.76.21.241
Found Yes
Hash f46c514f10ca01c5d39c41ae5832a6b0ffb0d9e75064b471fea7adcdd30ba58a
SimHash 681151c1c0f1

Groups

*

Rule Path
Disallow /

googlebot
bingbot
bingpreview
msnbot
slurp
duckduckbot
applebot
ia_archiver
facebookexternalhit
twitterbot
linkedinbot

Rule Path
Allow /
Disallow /admin/
Disallow /site_admin/
Disallow /search
Disallow /search/
Disallow /content/search
Disallow /content/advancedsearch
Disallow /content/tipafriend
Disallow /layout/set/print
Disallow /rss
Disallow /media/
Disallow /ezinfo/
Disallow /user/
Disallow /test-area/

Comments

  • Disallow all
  • But allow only important bots