haroldslist.com
robots.txt

Robots Exclusion Standard data for haroldslist.com

Resource Scan

Scan Details

Site Domain haroldslist.com
Base Domain haroldslist.com
Scan Status Ok
Last Scan2026-01-29T11:25:47+00:00
Next Scan 2026-02-05T11:25:47+00:00

Last Scan

Scanned2026-01-29T11:25:47+00:00
URL https://haroldslist.com/robots.txt
Redirect https://www.haroldslist.com/robots.txt
Redirect Domain www.haroldslist.com
Redirect Base haroldslist.com
Domain IPs 50.63.7.193
Redirect IPs 50.63.7.193
Response IP 50.63.7.193
Found Yes
Hash 315a00beb7ed55cdad872db4c6365b739cf9c6e809f0f7630eb1ab5b6140c4a0
SimHash 64543ff668b5

Groups

*

Rule Path
Disallow
Disallow /application/
Disallow /system/
Disallow /vendor/
Disallow /index.php

Comments

  • Allow all web crawlers access to all content
  • Disallow access to certain directories
  • Optionally, disallow access to specific files
  • Sitemap location
  • Sitemap: https://www.haroldslist.com/sitemap.xml