croceviola.org
robots.txt

Robots Exclusion Standard data for croceviola.org

Resource Scan

Scan Details

Site Domain croceviola.org
Base Domain croceviola.org
Scan Status Ok
Last Scan2025-04-01T02:33:05+00:00
Next Scan 2025-05-01T02:33:05+00:00

Last Scan

Scanned2025-04-01T02:33:05+00:00
URL https://croceviola.org/robots.txt
Domain IPs 86.107.32.194
Response IP 86.107.32.194
Found Yes
Hash 7789dc95bbafcfed6ff308cb8c3eecd1a868cc70f16c1f54b8bf4dbc6341b059
SimHash e31f1d5acbe4

Groups

*

Rule Path
Disallow /wp-
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Allow /wp-content/uploads

Other Records

Field Value
sitemap http://www.croceviola.org/page-sitemap.xml

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html