m.news.de
robots.txt

Robots Exclusion Standard data for m.news.de

Resource Scan

Scan Details

Site Domain m.news.de
Base Domain news.de
Scan Status Ok
Last Scan2024-05-12T16:08:47+00:00
Next Scan 2024-06-11T16:08:47+00:00

Last Scan

Scanned2024-05-12T16:08:47+00:00
URL https://m.news.de/robots.txt
Redirect https://www.news.de/robots.txt
Redirect Domain www.news.de
Redirect Base news.de
Domain IPs 62.141.58.43
Redirect IPs 62.141.58.43
Response IP 62.141.58.43
Found Yes
Hash a1a838ef9e3ad2e16711c7ed6c05b294030cc458e4ff350f4e5015106f19c97d
SimHash c50e5640c813

Groups

*

Rule Path
Disallow /print/
Disallow /search/
Disallow /article/
Disallow /index/
Disallow /newsfrontend/video/
Disallow /newsfrontend/imagegallery/
Disallow /newsfrontend/flash/
Disallow /newsfrontend/article/
Disallow /newsfrontend/search/
Disallow /image/
Disallow /newsclub/
Disallow /landing/
Disallow /textgallery/
Disallow /resources/
Disallow /track.php
Disallow /module/cms/
Allow /resources/xml/
Allow /images/
Allow /resources/images/
Allow /resources/thumbs/

googlebot-news

Rule Path
Disallow /article/
Disallow /morgenattacke/
Disallow /special/
Disallow /fotostrecke/
Disallow /video/

yahoo!-adcrawler

Rule Path
Allow /search/

Other Records

Field Value
crawl-delay 0.5

mediapartners-google

Rule Path
Allow /search/

Other Records

Field Value
sitemap https://www.news.de/resources/xml/metasitemap.xml
sitemap https://www.news.de/googlesitemap/