dagensmedia.se
robots.txt

Robots Exclusion Standard data for dagensmedia.se

Resource Scan

Scan Details

Site Domain dagensmedia.se
Base Domain dagensmedia.se
Scan Status Ok
Last Scan2024-06-14T09:33:54+00:00
Next Scan 2024-06-21T09:33:54+00:00

Last Scan

Scanned2024-06-14T09:33:54+00:00
URL https://dagensmedia.se/robots.txt
Redirect https://www.dagensmedia.se/robots.txt
Redirect Domain www.dagensmedia.se
Redirect Base dagensmedia.se
Domain IPs 34.149.169.35
Redirect IPs 151.101.37.91, 2a04:4e42:9::347
Response IP 151.101.37.91
Found Yes
Hash 88898595b43e7b8c5b3a78bb2f67866193ed5624d501c62d7c70434d41e39a2d
SimHash 3a549800c111

Groups

*

Rule Path
Disallow /sok*
Disallow /logga-in*
Disallow /glomt-losenord/
Disallow /access/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.dagensmedia.se/sitemap.xml
sitemap https://www.dagensmedia.se/sitemap/news.xml