litomerice.cz
robots.txt

Robots Exclusion Standard data for litomerice.cz

Resource Scan

Scan Details

Site Domain litomerice.cz
Base Domain litomerice.cz
Scan Status Ok
Last Scan2025-02-27T16:12:32+00:00
Next Scan 2025-03-29T16:12:32+00:00

Last Scan

Scanned2025-02-27T16:12:32+00:00
URL https://litomerice.cz/robots.txt
Redirect https://www.litomerice.cz/robots.txt
Redirect Domain www.litomerice.cz
Redirect Base litomerice.cz
Domain IPs 45.138.107.39, 45.138.107.40
Redirect IPs 45.138.107.39, 45.138.107.40
Response IP 45.138.107.39
Found Yes
Hash e64193e17f4aa7936c2f89d2aea0928caa8e667e34ab894b88023fc7d463f271
SimHash 220ce579c184

Groups

*

Rule Path
Allow /*.js***************
Allow /*.css**************
Allow /*.png**************
Allow /*.jpg**************
Allow /*.jpeg**************
Allow /*.gif**************
Allow /*.eot**************
Allow /*.woff**************
Allow /*.ttf**************
Allow /*.svg**************
Allow /*.otf**************
Allow /*.pdf**************
Allow /*.PNG**************
Allow /*.JPG**************
Allow /*.JPEG**************
Allow /*.mp3**************
Allow /*.pdf**************
Disallow /administrator/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/

Other Records

Field Value
sitemap https://www.litomerice.cz/sitemap.xml

Comments

  • Please don't remove folders from disallow.
  • The allows at the top allow any of the mimetypes listed to be crawled within any folder
  • using long-tail wildcards, these ignore the disallows for the folders below.
  • This gives full render for the search engines whilst preventing full crawls of system
  • folders
  • THIS ALLOWS FULL RENDER AT ENGINES
  • THESE FOLDERS SHOULD NEVER BE CRAWLED