pldspace.com
robots.txt

Robots Exclusion Standard data for pldspace.com

Resource Scan

Scan Details

Site Domain pldspace.com
Base Domain pldspace.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2025-07-30T01:36:18+00:00
Next Scan 2025-08-13T01:36:18+00:00

Last Successful Scan

Scanned2025-06-20T21:48:42+00:00
URL https://pldspace.com/robots.txt
Domain IPs 2001:8d8:100f:f000::2ad, 217.160.0.240
Response IP 217.160.0.240
Found Yes
Hash 97c87f1e050ea0ead076373282f50734e90f71734b825931f9a4739ca0d5ace8
SimHash a21d1d1803f4

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml