hist-world.com
robots.txt

Robots Exclusion Standard data for hist-world.com

Resource Scan

Scan Details

Site Domain hist-world.com
Base Domain hist-world.com
Scan Status Ok
Last Scan6/3/2025, 8:49:23 PM
Next Scan 6/10/2025, 8:49:23 PM

Last Scan

Scanned6/3/2025, 8:49:23 PM
URL https://hist-world.com/robots.txt
Domain IPs 104.21.58.116, 172.67.159.104, 2606:4700:3034::6815:3a74, 2606:4700:3036::ac43:9f68
Response IP 172.67.159.104
Found Yes
Hash e8a9448f7508c066059e1ea1935baeda58ac96afde88181cf7d6ba5daa843a47
SimHash af2cece04939

Groups

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /logs/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/
Disallow /component/
Disallow /content/
Disallow /category/
Disallow /users/
Disallow /mailto/
Disallow /register-login/
Disallow /index.php/
Disallow /login?view*
Disallow /*?limitstart=*
Disallow /*?start=*
Disallow /tag/
Disallow /mobile/
Disallow /m/

Other Records

Field Value
crawl-delay 19

Warnings

  • `host` is not a known field.