mpzp24.pl
robots.txt

Robots Exclusion Standard data for mpzp24.pl

Resource Scan

Scan Details

Site Domain mpzp24.pl
Base Domain mpzp24.pl
Scan Status Ok
Last Scan2024-09-26T04:22:45+00:00
Next Scan 2024-10-03T04:22:45+00:00

Last Scan

Scanned2024-09-26T04:22:45+00:00
URL https://mpzp24.pl/robots.txt
Domain IPs 104.21.49.220, 172.67.152.235, 2606:4700:3031::ac43:98eb, 2606:4700:3035::6815:31dc
Response IP 104.21.49.220
Found Yes
Hash 7e2f9c6ccb0897723d0bedda7ee710023289c935f08add76aeb8c899386dd000
SimHash a31f955803e1

Groups

*

Rule Path Comment
Disallow /administrator/ -
Disallow /cache/ -
Disallow /cli/ -
Disallow /components/ -
Disallow /images/ -
Disallow /includes/ -
Disallow /installation/ -
Disallow /language/ -
Disallow /libraries/ -
Disallow /logs/ -
Disallow /media/ -
Disallow /modules/ -
Disallow /plugins/ -
Disallow /templates/ -
Disallow /tmp/ -
Disallow *.pdf Block pdf files. Non-standard but works for major search engines.

Other Records

Field Value
sitemap https://mpzp24.pl/api.php/sitemap

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html