community.joomla.org
robots.txt

Robots Exclusion Standard data for community.joomla.org

Resource Scan

Scan Details

Site Domain community.joomla.org
Base Domain joomla.org
Scan Status Ok
Last Scan2025-08-02T17:31:27+00:00
Next Scan 2025-08-16T17:31:27+00:00

Last Scan

Scanned2025-08-02T17:31:27+00:00
URL https://community.joomla.org/robots.txt
Domain IPs 18.218.35.232, 2600:1f16:706:400:3b4a:40aa:9071:dde2
Response IP 18.218.35.232
Found Yes
Hash aa2421e99d8e2b18d29b6b86544699409ce63aa0622626d1f101a3c92470767c
SimHash 201c0461c584

Groups

*

Rule Path
Allow /*.js***************
Allow /*.css**************
Allow /*.png**************
Allow /*.jpg**************
Allow /*.jpeg**************
Allow /*.gif**************
Allow /*.eot**************
Allow /*.woff**************
Allow /*.ttf**************
Allow /*.svg**************
Allow /*.otf**************
Allow /*.pdf**************
Allow /*.PNG**************
Allow /*.JPG**************
Allow /*.JPEG**************
Disallow /*?start=
Disallow /*?limitstart=
Disallow /*?site=
Disallow /*?ostm_view=
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cgi-bin/
Disallow /cli/
Disallow /includes/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /tmp/
Disallow /videos/

Other Records

Field Value
sitemap https://community.joomla.org/sitemap

Comments

  • Please don't remove folders from disallow.
  • The allows at the top allow any of the mimetypes listed to be crawled within any folder
  • using long-tail wildcards, these ignore the disallows for the folders below.
  • This gives full render for the search engines whilst preventing full crawls of system
  • folders. The images folder is allowed to allow twitter/facebook sharing of images.
  • THIS ALLOWS FULL RENDER AT ENGINES
  • THESE FOLDERS SHOULD NEVER BE CRAWLED
  • JSitemap entries