gmx.de
robots.txt
Robots Exclusion Standard data for gmx.de
Resource Scan
Scan Details
Site Domain | gmx.de |
Base Domain | gmx.de |
Scan Status | Ok |
Last Scan | 2024-09-19T15:28:43+00:00 |
Next Scan | 2024-09-26T15:28:43+00:00 |
Last Scan
Scanned | 2024-09-19T15:28:43+00:00 |
URL | https://gmx.de/robots.txt |
Redirect | https://www.gmx.net/robots.txt |
Redirect Domain | www.gmx.net |
Redirect Base | gmx.net |
Domain IPs | 82.165.229.152, 82.165.229.87 |
Redirect IPs | 82.165.229.85 |
Response IP | 82.165.229.46 |
Found | Yes |
Hash | ca86016f10097e688289e5284e689885d066f7ff1425a3c9c729ca6805e459b2 |
SimHash | 78118b606133 |
Groups
*
Rule | Path |
---|---|
Disallow | /test/ |
applebot
Rule | Path |
---|---|
Disallow | /magazine/ |
Allow | /magazine/in-eigener-sache/ |
Allow | /magazine/unicef/ |
Allow | /magazine/so-arbeitet-die-redaktion/ |
chatgpt-user
Rule | Path |
---|---|
Disallow | /magazine/ |
Allow | /magazine/in-eigener-sache/ |
Allow | /magazine/unicef/ |
Allow | /magazine/so-arbeitet-die-redaktion/ |