dw.com
robots.txt

Robots Exclusion Standard data for dw.com

Resource Scan

Scan Details

Site Domain dw.com
Base Domain dw.com
Scan Status Ok
Last Scan2024-04-26T19:34:11+00:00
Next Scan 2024-05-03T19:34:11+00:00

Last Scan

Scanned2024-04-26T19:34:11+00:00
URL https://dw.com/robots.txt
Redirect https://www.dw.com/robots.txt
Redirect Domain www.dw.com
Redirect Base dw.com
Domain IPs 194.55.26.46, 194.55.30.46
Redirect IPs 173.222.146.24, 2600:1413:b000:885::2d63, 2600:1413:b000:89d::2d63
Response IP 104.69.160.56
Found Yes
Hash f021cd839c106748a82eb2d3abd0c22ffab9cf693189ba8af07d781eaaaff1ba
SimHash f188c942c1b1

Groups

*

Rule Path
Disallow /search/
Disallow /overlay/
Disallow /popups/mediaplayer/
Disallow /popups/popup_gallery/
Disallow /*/layoutvorlagen/
Disallow /*/user/account$
Disallow /*/user/activity$
Disallow /*/user/profile$
Disallow /*/user/password/change$
Disallow /*/user/password/set$
Disallow /*/user/feedback/status?type=*
Disallow /*/user/register/confirm$
Disallow /*/user/email/change$
Disallow /*?maca=*

twitterbot

Rule Path
Allow /*?maca=*

Other Records

Field Value
sitemap https://www.dw.com/sitemap.xml