dn.se
robots.txt

Robots Exclusion Standard data for dn.se

Resource Scan

Scan Details

Site Domain dn.se
Base Domain dn.se
Scan Status Ok
Last Scan2024-11-16T17:17:58+00:00
Next Scan 2024-11-23T17:17:58+00:00

Last Scan

Scanned2024-11-16T17:17:58+00:00
URL https://dn.se/robots.txt
Redirect https://www.dn.se/robots.txt
Redirect Domain www.dn.se
Redirect Base dn.se
Domain IPs 34.117.105.189
Redirect IPs 151.101.37.91, 2a04:4e42:8d::347
Response IP 151.101.37.91
Found Yes
Hash efd15164e2b292aa146dec0ed767a80c899c7ac344488c0b48f47b56cf6cd4b7
SimHash 7ea7104cec3a

Groups

*

Rule Path
Disallow /metrics/
Disallow /healthcheck/
Disallow /_alive/
Disallow /_alive

*

Rule Path
Disallow /sok/?*
Disallow /sok?*
Disallow /others-are-reading-article/
Disallow /partial-article/
Disallow /expire/
Disallow /expire
Disallow /refresh/
Disallow /refresh
Disallow /ajax/
Disallow /blahonga/
Disallow /login
Disallow /logout
Disallow /register
Disallow /register-account
Disallow /personalized/
Disallow /extern-sok/
Disallow /extern-sokning/
Disallow *rm%3Dprint
Disallow /Pages/ArticleForward.aspx*
Disallow /api/
Disallow /cached-api/
Disallow /loggain/
Disallow /loggaut/
Disallow /applinkstest
Disallow /ladda-ner-appen
Disallow /DNet/
Disallow /redirect
Disallow /r/
Disallow /res/expressen/

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dn.se/sitemap.xml