compromatbase.info
robots.txt

Robots Exclusion Standard data for compromatbase.info

Resource Scan

Scan Details

Site Domain compromatbase.info
Base Domain compromatbase.info
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-03-28T01:58:25+00:00
Next Scan 2025-06-26T01:58:25+00:00

Last Successful Scan

Scanned2023-11-11T22:29:53+00:00
URL https://compromatbase.info/robots.txt
Domain IPs 104.21.66.115, 172.67.159.150, 2606:4700:3030::ac43:9f96, 2606:4700:3034::6815:4273
Response IP 172.67.159.150
Found Yes
Hash b8f016493cb8f80812e680ae6007a63e7f3e128ce56211f4cb80b3dadbfd941a
SimHash e21d1d1983f4

Groups

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

Other Records

Field Value
sitemap /sitemap/sitemap-index-gz.xml

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml