indy100.com
robots.txt

Robots Exclusion Standard data for indy100.com

Resource Scan

Scan Details

Site Domain indy100.com
Base Domain indy100.com
Scan Status Ok
Last Scan2024-11-09T18:44:09+00:00
Next Scan 2024-11-16T18:44:09+00:00

Last Scan

Scanned2024-11-09T18:44:09+00:00
URL https://indy100.com/robots.txt
Redirect https://www.indy100.com/robots.txt
Redirect Domain www.indy100.com
Redirect Base indy100.com
Domain IPs 151.101.1.186, 151.101.129.186, 151.101.193.186, 151.101.65.186
Redirect IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91, 2a04:4e42:200::347, 2a04:4e42:400::347, 2a04:4e42:600::347, 2a04:4e42::347
Response IP 199.232.45.91
Found Yes
Hash a40aeaf6f9ed3c9795d18975e992036977f80e925d3d3d3a06bb1a052b7546c5
SimHash 444485024f11

Groups

*

Rule Path
Disallow /71347885/
Disallow /core/dashboard
Disallow /core/*
Disallow /r/*
Disallow /mnt/*
Disallow /internal-api/*
Disallow /404.html
Allow /r/kappa/api/*

Other Records

Field Value
sitemap https://www.indy100.com/sitemap.xml
sitemap https://www.indy100.com/sitemap_video.xml
sitemap https://www.indy100.com/r/kappa/api/v1/reader/news_sitemap.xml