thi.de
robots.txt
Robots Exclusion Standard data for thi.de
Resource Scan
Scan Details
Site Domain | thi.de |
Base Domain | thi.de |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2025-10-06T23:12:20+00:00 |
Next Scan | 2025-11-05T23:12:20+00:00 |
Last Successful Scan
Scanned | 2025-08-31T21:09:24+00:00 |
URL | https://thi.de/robots.txt |
Redirect | https://www.thi.de/robots.txt |
Redirect Domain | www.thi.de |
Redirect Base | thi.de |
Domain IPs | 194.94.240.181 |
Redirect IPs | 194.94.240.181 |
Response IP | 194.94.240.181 |
Found | Yes |
Hash | c65327765dd22ade07f827c21355759bbf9e2d0cefa70b23728ae331fdc370a4 |
SimHash | ed245a248d31 |
Groups
*
Rule | Path |
---|---|
Disallow | /suche/ |
Disallow | /suche |
Disallow | /en/search/ |
Disallow | /en/search |
Other Records
Field | Value |
---|---|
sitemap | https://www.thi.de/sitemap.xml |