helsinginsanomat.fi
robots.txt

Robots Exclusion Standard data for helsinginsanomat.fi

Resource Scan

Scan Details

Site Domain helsinginsanomat.fi
Base Domain helsinginsanomat.fi
Scan Status Ok
Last Scan2024-05-12T18:08:48+00:00
Next Scan 2024-05-19T18:08:48+00:00

Last Scan

Scanned2024-05-12T18:08:48+00:00
URL https://helsinginsanomat.fi/robots.txt
Redirect https://www.hs.fi/robots.txt
Redirect Domain www.hs.fi
Redirect Base hs.fi
Domain IPs 13.35.121.110, 13.35.121.111, 13.35.121.116, 13.35.121.64, 2600:9000:20fe:1400:b:5b2c:9f40:93a1, 2600:9000:20fe:1600:b:5b2c:9f40:93a1, 2600:9000:20fe:7000:b:5b2c:9f40:93a1, 2600:9000:20fe:8000:b:5b2c:9f40:93a1, 2600:9000:20fe:d600:b:5b2c:9f40:93a1, 2600:9000:20fe:e00:b:5b2c:9f40:93a1, 2600:9000:20fe:e200:b:5b2c:9f40:93a1, 2600:9000:20fe:f200:b:5b2c:9f40:93a1
Redirect IPs 2600:9000:2163:2a00:10:3b34:7000:93a1, 2600:9000:2163:3600:10:3b34:7000:93a1, 2600:9000:2163:4200:10:3b34:7000:93a1, 2600:9000:2163:5400:10:3b34:7000:93a1, 2600:9000:2163:7400:10:3b34:7000:93a1, 2600:9000:2163:c600:10:3b34:7000:93a1, 2600:9000:2163:e400:10:3b34:7000:93a1, 2600:9000:2163:f600:10:3b34:7000:93a1, 99.84.66.116, 99.84.66.31, 99.84.66.77, 99.84.66.91
Response IP 18.165.171.30
Found Yes
Hash ef5844ebcdb7ff1f3ae85ac402a42020f662c1103906ad3c627cb1fcd950ea53
SimHash 4b1cc260c537

Groups

*

Rule Path
Disallow /promo/
Disallow /sivulaskuri
Disallow /api/
Disallow /rest/
Disallow /public-transit-screen/
Allow /api/paid-article/

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.hs.fi/sitemap/html/hs/sitemapindex.xml
sitemap https://www.hs.fi/rss/custom/news-sitemap.xml