akademie.dw.com
robots.txt
Robots Exclusion Standard data for akademie.dw.com
Resource Scan
Scan Details
Site Domain | akademie.dw.com |
Base Domain | dw.com |
Scan Status | Ok |
Last Scan | 2024-10-24T11:48:50+00:00 |
Next Scan | 2024-11-07T11:48:50+00:00 |
Last Scan
Scanned | 2024-10-24T11:48:50+00:00 |
URL | https://akademie.dw.com/robots.txt |
Domain IPs | 173.222.146.24, 2600:1413:b000:78d::2d63, 2600:1413:b000:790::2d63 |
Response IP | 104.69.45.5 |
Found | Yes |
Hash | f021cd839c106748a82eb2d3abd0c22ffab9cf693189ba8af07d781eaaaff1ba |
SimHash | f188c942c1b1 |
Groups
*
Rule | Path |
---|---|
Disallow | /search/ |
Disallow | /overlay/ |
Disallow | /popups/mediaplayer/ |
Disallow | /popups/popup_gallery/ |
Disallow | /*/layoutvorlagen/ |
Disallow | /*/user/account$ |
Disallow | /*/user/activity$ |
Disallow | /*/user/profile$ |
Disallow | /*/user/password/change$ |
Disallow | /*/user/password/set$ |
Disallow | /*/user/feedback/status?type=* |
Disallow | /*/user/register/confirm$ |
Disallow | /*/user/email/change$ |
Disallow | /*?maca=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.dw.com/sitemap.xml |