inberlin.de
robots.txt
Robots Exclusion Standard data for inberlin.de
Resource Scan
Scan Details
Site Domain | inberlin.de |
Base Domain | inberlin.de |
Scan Status | Ok |
Last Scan | 2024-06-27T08:54:24+00:00 |
Next Scan | 2024-07-04T08:54:24+00:00 |
Last Scan
Scanned | 2024-06-27T08:54:24+00:00 |
URL | https://inberlin.de/robots.txt |
Domain IPs | 2001:8d8:100f:f000::2e4, 217.160.0.251 |
Response IP | 217.160.0.251 |
Found | Yes |
Hash | 9e35e63545d21fa3bcc141c55c95924ddc37c0ee644d03033805844b083e2279 |
SimHash | ec5c0940cf93 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /stdcgi/ |
Disallow | /logs/ |
Disallow | /special/ |
Other Records
Field | Value |
---|---|
sitemap | http://inberlin.de/sitemap.xml |