is.fi
robots.txt

Robots Exclusion Standard data for is.fi

Resource Scan

Scan Details

Site Domain is.fi
Base Domain is.fi
Scan Status Ok
Last Scan2024-05-03T03:28:13+00:00
Next Scan 2024-05-10T03:28:13+00:00

Last Scan

Scanned2024-05-03T03:28:13+00:00
URL https://is.fi/robots.txt
Redirect https://www.is.fi/robots.txt
Redirect Domain www.is.fi
Redirect Base is.fi
Domain IPs 18.65.25.51, 18.65.25.57, 18.65.25.58, 18.65.25.81, 2600:9000:2377:1400:10:27c5:9480:93a1, 2600:9000:2377:8000:10:27c5:9480:93a1, 2600:9000:2377:800:10:27c5:9480:93a1, 2600:9000:2377:c00:10:27c5:9480:93a1, 2600:9000:2377:d800:10:27c5:9480:93a1, 2600:9000:2377:f200:10:27c5:9480:93a1, 2600:9000:2377:f400:10:27c5:9480:93a1, 2600:9000:2377:f600:10:27c5:9480:93a1
Redirect IPs 2600:9000:24db:1000:1b:b70c:39c0:93a1, 2600:9000:24db:5200:1b:b70c:39c0:93a1, 2600:9000:24db:8000:1b:b70c:39c0:93a1, 2600:9000:24db:8a00:1b:b70c:39c0:93a1, 2600:9000:24db:d400:1b:b70c:39c0:93a1, 2600:9000:24db:d600:1b:b70c:39c0:93a1, 2600:9000:24db:f200:1b:b70c:39c0:93a1, 2600:9000:24db:fc00:1b:b70c:39c0:93a1, 65.9.112.22, 65.9.112.48, 65.9.112.54, 65.9.112.95
Response IP 3.160.246.40
Found Yes
Hash deb5f487af2e662cd062722afa6d37322f3932cc365e6d83b29780d302e051ca
SimHash 490cc0648533

Groups

*

Rule Path
Disallow /promo/
Disallow /sivulaskuri
Disallow /api/
Disallow /rest/
Allow /api/paid-article/

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.is.fi/sitemap/html/is/sitemapindex.xml
sitemap https://www.is.fi/rss/custom/news-sitemap.xml
sitemap https://www.is.fi/supersaa/assets/sitemap-index.xml