hildesheim.de
robots.txt

Robots Exclusion Standard data for hildesheim.de

Resource Scan

Scan Details

Site Domain hildesheim.de
Base Domain hildesheim.de
Scan Status Ok
Last Scan2025-05-14T10:05:28+00:00
Next Scan 2025-06-13T10:05:28+00:00

Last Scan

Scanned2025-05-14T10:05:28+00:00
URL https://hildesheim.de/robots.txt
Redirect https://www.hildesheim.de/robots.txt
Redirect Domain www.hildesheim.de
Redirect Base hildesheim.de
Domain IPs 194.25.98.243
Redirect IPs 194.25.98.243
Response IP 194.25.98.243
Found Yes
Hash 4a7f824911a8ae8d8d60272d01542f005e7e0ad9ac40cd2cd4e5dd6207d0da3c
SimHash c215d5241757

Groups

emailcollector

Rule Path
Disallow /

gagarobot

Rule Path
Disallow /

vscooter

Rule Path
Disallow /

roverbot*

Rule Path
Disallow /

mirago

Rule Path
Disallow /

psbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcapture*

Rule Path
Disallow /

websauger*

Rule Path
Disallow /

teleport*

Rule Path
Disallow /

webwhacker*

Rule Path
Disallow /

webzip*

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

net attache*

Rule Path
Disallow /

webreaper*

Rule Path
Disallow /

sitesnagger*

Rule Path
Disallow /

httrack*

Rule Path
Disallow /