m.news.de
robots.txt
Robots Exclusion Standard data for m.news.de
Resource Scan
Scan Details
Site Domain | m.news.de |
Base Domain | news.de |
Scan Status | Ok |
Last Scan | 2024-05-12T16:08:47+00:00 |
Next Scan | 2024-06-11T16:08:47+00:00 |
Last Scan
Scanned | 2024-05-12T16:08:47+00:00 |
URL | https://m.news.de/robots.txt |
Redirect | https://www.news.de/robots.txt |
Redirect Domain | www.news.de |
Redirect Base | news.de |
Domain IPs | 62.141.58.43 |
Redirect IPs | 62.141.58.43 |
Response IP | 62.141.58.43 |
Found | Yes |
Hash | a1a838ef9e3ad2e16711c7ed6c05b294030cc458e4ff350f4e5015106f19c97d |
SimHash | c50e5640c813 |
Groups
*
Rule | Path |
---|---|
Disallow | /print/ |
Disallow | /search/ |
Disallow | /article/ |
Disallow | /index/ |
Disallow | /newsfrontend/video/ |
Disallow | /newsfrontend/imagegallery/ |
Disallow | /newsfrontend/flash/ |
Disallow | /newsfrontend/article/ |
Disallow | /newsfrontend/search/ |
Disallow | /image/ |
Disallow | /newsclub/ |
Disallow | /landing/ |
Disallow | /textgallery/ |
Disallow | /resources/ |
Disallow | /track.php |
Disallow | /module/cms/ |
Allow | /resources/xml/ |
Allow | /images/ |
Allow | /resources/images/ |
Allow | /resources/thumbs/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.news.de/resources/xml/metasitemap.xml |
sitemap | https://www.news.de/googlesitemap/ |