magazine.web.de
robots.txt
Robots Exclusion Standard data for magazine.web.de
Resource Scan
Scan Details
Site Domain | magazine.web.de |
Base Domain | web.de |
Scan Status | Ok |
Last Scan | 2024-11-14T23:51:18+00:00 |
Next Scan | 2024-11-21T23:51:18+00:00 |
Last Scan
Scanned | 2024-11-14T23:51:18+00:00 |
URL | https://magazine.web.de/robots.txt |
Redirect | https://web.de/robots.txt |
Redirect Domain | web.de |
Redirect Base | web.de |
Domain IPs | 82.165.229.87 |
Redirect IPs | 82.165.229.138, 82.165.229.83 |
Response IP | 82.165.229.138 |
Found | Yes |
Hash | 256ec067b29557c598c36d7f44de58187c6055cf77630954c574ed4651bc6ce7 |
SimHash | e8528b206133 |
Groups
*
Rule | Path |
---|---|
Disallow | /test/ |
googlebot-news
Rule | Path |
---|---|
Disallow | / |
Disallow | /magazine/*/thema/ |
Allow | /magazine/ |
Allow | /amp/ |
Allow | /$ |
applebot
Rule | Path |
---|---|
Disallow | /magazine/ |
Allow | /magazine/in-eigener-sache/ |
Allow | /magazine/unicef/ |
Allow | /magazine/so-arbeitet-die-redaktion/ |
chatgpt-user
Rule | Path |
---|---|
Disallow | /magazine/ |
Allow | /magazine/in-eigener-sache/ |
Allow | /magazine/unicef/ |
Allow | /magazine/so-arbeitet-die-redaktion/ |
Comments