www2.hs.fi
robots.txt

Robots Exclusion Standard data for www2.hs.fi

Resource Scan

Scan Details

Site Domain www2.hs.fi
Base Domain hs.fi
Scan Status Ok
Last Scan2024-05-07T09:36:43+00:00
Next Scan 2024-05-14T09:36:43+00:00

Last Scan

Scanned2024-05-07T09:36:43+00:00
URL https://www2.hs.fi/robots.txt
Redirect https://www.hs.fi/robots.txt
Redirect Domain www.hs.fi
Redirect Base hs.fi
Domain IPs 108.138.94.15, 108.138.94.29, 108.138.94.86, 108.138.94.87, 2600:9000:2201:9a00:b:5b2c:9f40:93a1, 2600:9000:2201:a400:b:5b2c:9f40:93a1, 2600:9000:2201:b000:b:5b2c:9f40:93a1, 2600:9000:2201:b600:b:5b2c:9f40:93a1, 2600:9000:2201:d000:b:5b2c:9f40:93a1, 2600:9000:2201:d800:b:5b2c:9f40:93a1, 2600:9000:2201:e200:b:5b2c:9f40:93a1, 2600:9000:2201:ee00:b:5b2c:9f40:93a1
Redirect IPs 2600:9000:249b:600:10:3b34:7000:93a1, 2600:9000:249b:7000:10:3b34:7000:93a1, 2600:9000:249b:7e00:10:3b34:7000:93a1, 2600:9000:249b:8400:10:3b34:7000:93a1, 2600:9000:249b:8c00:10:3b34:7000:93a1, 2600:9000:249b:a00:10:3b34:7000:93a1, 2600:9000:249b:aa00:10:3b34:7000:93a1, 2600:9000:249b:d400:10:3b34:7000:93a1, 3.163.189.109, 3.163.189.18, 3.163.189.36, 3.163.189.87
Response IP 18.165.171.30
Found Yes
Hash bccdf68b60c459a2c2851e3373428cfe2c63b518a8a6a1474cc744cee0c1efac
SimHash 4b2c8a60e137

Groups

*

Rule Path
Disallow /promo/
Disallow /sivulaskuri
Disallow /api/
Disallow /rest/
Allow /api/paid-article/

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.hs.fi/sitemap/html/hs/sitemapindex.xml
sitemap https://www.hs.fi/rss/custom/news-sitemap.xml