dn.se
robots.txt

Robots Exclusion Standard data for dn.se

Resource Scan

Scan Details

Site Domain dn.se
Base Domain dn.se
Scan Status Ok
Last Scan2024-03-23T21:05:41+00:00
Next Scan 2024-03-30T21:05:41+00:00

Last Scan

Scanned2024-03-23T21:05:41+00:00
URL https://dn.se/robots.txt
Redirect https://www.dn.se/robots.txt
Redirect Domain www.dn.se
Redirect Base dn.se
Domain IPs 34.117.105.189
Redirect IPs 23.52.113.111
Response IP 23.54.57.172
Found Yes
Hash 1dcd6ef28424ea783c59215bd3ef657e714d2f074acb3ed33114b6115073bb16
SimHash 7ea7100cec3a

Groups

*

Rule Path
Disallow /metrics/
Disallow /healthcheck/
Disallow /_alive/
Disallow /_alive

*

Rule Path
Disallow /sok/?*
Disallow /sok?*
Disallow /others-are-reading-article/
Disallow /partial-article/
Disallow /expire/
Disallow /expire
Disallow /refresh/
Disallow /refresh
Disallow /ajax/
Disallow /blahonga/
Disallow /login
Disallow /logout
Disallow /register
Disallow /register-account
Disallow /orderverifieringtm/
Disallow /personalized/
Disallow /extern-sok/
Disallow /extern-sokning/
Disallow *rm%3Dprint
Disallow /Pages/ArticleForward.aspx*
Disallow /api/
Disallow /cached-api/
Disallow /loggain/
Disallow /loggaut/
Disallow /applinkstest
Disallow /ladda-ner-appen
Disallow /DNet/
Disallow /redirect
Disallow /r/
Disallow /res/expressen/

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dn.se/sitemap.xml