akademie.dw.com
robots.txt

Robots Exclusion Standard data for akademie.dw.com

Resource Scan

Scan Details

Site Domain akademie.dw.com
Base Domain dw.com
Scan Status Ok
Last Scan2024-10-24T11:48:50+00:00
Next Scan 2024-11-07T11:48:50+00:00

Last Scan

Scanned2024-10-24T11:48:50+00:00
URL https://akademie.dw.com/robots.txt
Domain IPs 173.222.146.24, 2600:1413:b000:78d::2d63, 2600:1413:b000:790::2d63
Response IP 104.69.45.5
Found Yes
Hash f021cd839c106748a82eb2d3abd0c22ffab9cf693189ba8af07d781eaaaff1ba
SimHash f188c942c1b1

Groups

*

Rule Path
Disallow /search/
Disallow /overlay/
Disallow /popups/mediaplayer/
Disallow /popups/popup_gallery/
Disallow /*/layoutvorlagen/
Disallow /*/user/account$
Disallow /*/user/activity$
Disallow /*/user/profile$
Disallow /*/user/password/change$
Disallow /*/user/password/set$
Disallow /*/user/feedback/status?type=*
Disallow /*/user/register/confirm$
Disallow /*/user/email/change$
Disallow /*?maca=*

twitterbot

Rule Path
Allow /*?maca=*

Other Records

Field Value
sitemap https://www.dw.com/sitemap.xml