dw.com
robots.txt
Robots Exclusion Standard data for dw.com
Resource Scan
Scan Details
Site Domain | dw.com |
Base Domain | dw.com |
Scan Status | Ok |
Last Scan | 2024-04-26T19:34:11+00:00 |
Next Scan | 2024-05-03T19:34:11+00:00 |
Last Scan
Scanned | 2024-04-26T19:34:11+00:00 |
URL | https://dw.com/robots.txt |
Redirect | https://www.dw.com/robots.txt |
Redirect Domain | www.dw.com |
Redirect Base | dw.com |
Domain IPs | 194.55.26.46, 194.55.30.46 |
Redirect IPs | 173.222.146.24, 2600:1413:b000:885::2d63, 2600:1413:b000:89d::2d63 |
Response IP | 104.69.160.56 |
Found | Yes |
Hash | f021cd839c106748a82eb2d3abd0c22ffab9cf693189ba8af07d781eaaaff1ba |
SimHash | f188c942c1b1 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /overlay/ |
Disallow | /popups/mediaplayer/ |
Disallow | /popups/popup_gallery/ |
Disallow | /*/layoutvorlagen/ |
Disallow | /*/user/account$ |
Disallow | /*/user/activity$ |
Disallow | /*/user/profile$ |
Disallow | /*/user/password/change$ |
Disallow | /*/user/password/set$ |
Disallow | /*/user/feedback/status?type=* |
Disallow | /*/user/register/confirm$ |
Disallow | /*/user/email/change$ |
Disallow | /*?maca=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.dw.com/sitemap.xml |