webxcon.com
robots.txt

Robots Exclusion Standard data for webxcon.com

Resource Scan

Scan Details

Site Domain webxcon.com
Base Domain webxcon.com
Scan Status Ok
Last Scan2024-05-15T08:40:06+00:00
Next Scan 2024-06-14T08:40:06+00:00

Last Scan

Scanned2024-05-15T08:40:06+00:00
URL https://webxcon.com/robots.txt
Redirect https://www.webxcon.com/robots.txt
Redirect Domain www.webxcon.com
Redirect Base webxcon.com
Domain IPs 104.26.0.72, 104.26.1.72, 172.67.70.118, 2606:4700:20::681a:148, 2606:4700:20::681a:48, 2606:4700:20::ac43:4676
Redirect IPs 104.26.0.72, 104.26.1.72, 172.67.70.118, 2606:4700:20::681a:148, 2606:4700:20::681a:48, 2606:4700:20::ac43:4676
Response IP 172.67.70.118
Found Yes
Hash fba1dcf39b7d82392d8d664467e35a69987dfdc4583a55210aa2149ac66ee9c4
SimHash e3179d5ac3f5

Groups

*

Rule Path
Allow /*.js*
Allow /*.css*
Allow /*.png*
Allow /*.jpg*
Allow /*.gif*
Disallow /administrator/
Disallow /cache/
Disallow /cli/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /logs/
Disallow /tmp/
Disallow /component/
Disallow /itemlist/
Disallow /user/
Disallow /item/
Disallow /mailto/
Disallow /print/
Disallow /frontpage/
Disallow /content/
Disallow /rsform/
Disallow /form/
Disallow /*?start*
Allow /*.js$
Allow /*.css$
Allow /libraries/gantry/css/*.css$
Allow /libraries/gantry/js/*.js
Allow /cache/plg_jch_optimize/*.js
Allow /cache/plg_jch_optimize/*.css

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html