magazine.web.de
robots.txt

Robots Exclusion Standard data for magazine.web.de

Resource Scan

Scan Details

Site Domain magazine.web.de
Base Domain web.de
Scan Status Ok
Last Scan2024-11-14T23:51:18+00:00
Next Scan 2024-11-21T23:51:18+00:00

Last Scan

Scanned2024-11-14T23:51:18+00:00
URL https://magazine.web.de/robots.txt
Redirect https://web.de/robots.txt
Redirect Domain web.de
Redirect Base web.de
Domain IPs 82.165.229.87
Redirect IPs 82.165.229.138, 82.165.229.83
Response IP 82.165.229.138
Found Yes
Hash 256ec067b29557c598c36d7f44de58187c6055cf77630954c574ed4651bc6ce7
SimHash e8528b206133

Groups

*

Rule Path
Disallow /test/

googlebot-news

Rule Path
Disallow /
Disallow /magazine/*/thema/
Allow /magazine/
Allow /amp/
Allow /$

applebot

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/

chatgpt-user

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/

gptbot

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/

google-extended

Rule Path
Disallow /magazine/
Allow /magazine/in-eigener-sache/
Allow /magazine/unicef/
Allow /magazine/so-arbeitet-die-redaktion/

Comments

  • https://web.de/robots.txt