gmx.de
robots.txt

Robots Exclusion Standard data for gmx.de

Resource Scan

Scan Details

Site Domain gmx.de
Base Domain gmx.de
Scan Status Ok
Last Scan2024-09-19T15:28:43+00:00
Next Scan 2024-09-26T15:28:43+00:00

Last Scan

Scanned2024-09-19T15:28:43+00:00
URL https://gmx.de/robots.txt
Redirect https://www.gmx.net/robots.txt
Redirect Domain www.gmx.net
Redirect Base gmx.net
Domain IPs 82.165.229.152, 82.165.229.87
Redirect IPs 82.165.229.85
Response IP 82.165.229.46
Found Yes
Hash ca86016f10097e688289e5284e689885d066f7ff1425a3c9c729ca6805e459b2
SimHash 78118b606133

Groups

*

Rule Path
Disallow /test/

applebot

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/

chatgpt-user

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/

gptbot

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/

google-extended

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/