theglade.com
robots.txt

Robots Exclusion Standard data for theglade.com

Resource Scan

Scan Details

Site Domain theglade.com
Base Domain theglade.com
Scan Status Ok
Last Scan2025-10-24T01:36:06+00:00
Next Scan 2025-11-23T01:36:06+00:00

Last Scan

Scanned2025-10-24T01:36:06+00:00
URL https://theglade.com/robots.txt
Domain IPs 2001:8d8:100f:f000::294, 217.160.0.205
Response IP 217.160.0.205
Found Yes
Hash 418b8bd8c7746aa4346fc8e2eee37240daca2956d2936a73040be02130f985bd
SimHash eb654d7ed578

Groups

*

Rule Path
Disallow /scripts/
Disallow /chat/
Disallow /links/
Disallow /guest/
Disallow /irene/
Disallow /jadzia/
Disallow /shirKhan/
Disallow /umis/
Disallow /khan/download/
Disallow /khan/tools/
Disallow /kdt/
Disallow /bio/
Disallow /infodesk/
Disallow /conlib/
Disallow /contenido/
Disallow /docs/
Disallow /pear/
Disallow /phpmyadmin/
Disallow /cms/images/
Disallow /cms/upload/bilder/gallerie/
Disallow /cms/upload/bilder/figuren/
Disallow /cms/upload/bilder/archiv/
Disallow /salome/
Allow /cms/upload/bilder/comics/