gfrisicaro.it
robots.txt

Robots Exclusion Standard data for gfrisicaro.it

Resource Scan

Scan Details

Site Domain gfrisicaro.it
Base Domain gfrisicaro.it
Scan Status Ok
Last Scan2024-11-10T18:20:29+00:00
Next Scan 2024-11-17T18:20:29+00:00

Last Scan

Scanned2024-11-10T18:20:29+00:00
URL https://gfrisicaro.it/robots.txt
Redirect https://www.gfrisicaro.it/robots.txt
Redirect Domain www.gfrisicaro.it
Redirect Base gfrisicaro.it
Domain IPs 89.46.109.33
Redirect IPs 89.46.109.33
Response IP 89.46.109.33
Found Yes
Hash d64c2276af2e06284273ab460ed1795c52e758826424f055c8ee137847cd002f
SimHash a00e1d58c3e0

Groups

*

Rule Path
Disallow /tmp/
Disallow /TouchMe/
Disallow /appservices/
Disallow /ENI/
Disallow /DataStampaBB/
Disallow /OTA/
Disallow /DS/
Disallow /corimBackup/
Disallow /console/
Disallow /berry-app-ws/

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html