24heures.ch
robots.txt

Robots Exclusion Standard data for 24heures.ch

Resource Scan

Scan Details

Site Domain 24heures.ch
Base Domain 24heures.ch
Scan Status Ok
Last Scan2024-06-22T06:39:41+00:00
Next Scan 2024-06-29T06:39:41+00:00

Last Scan

Scanned2024-06-22T06:39:41+00:00
URL https://24heures.ch/robots.txt
Redirect https://www.24heures.ch/robots.txt
Redirect Domain www.24heures.ch
Redirect Base 24heures.ch
Domain IPs 13.226.2.114, 13.226.2.43, 13.226.2.46, 13.226.2.82, 2600:9000:25ef:1800:e:5a66:ac0:93a1, 2600:9000:25ef:200:e:5a66:ac0:93a1, 2600:9000:25ef:2600:e:5a66:ac0:93a1, 2600:9000:25ef:2c00:e:5a66:ac0:93a1, 2600:9000:25ef:c00:e:5a66:ac0:93a1, 2600:9000:25ef:d400:e:5a66:ac0:93a1, 2600:9000:25ef:e200:e:5a66:ac0:93a1, 2600:9000:25ef:f600:e:5a66:ac0:93a1
Redirect IPs 13.226.2.114, 13.226.2.43, 13.226.2.46, 13.226.2.82, 2600:9000:21f8:3800:e:5a66:ac0:93a1, 2600:9000:21f8:4000:e:5a66:ac0:93a1, 2600:9000:21f8:7c00:e:5a66:ac0:93a1, 2600:9000:21f8:9400:e:5a66:ac0:93a1, 2600:9000:21f8:a600:e:5a66:ac0:93a1, 2600:9000:21f8:b200:e:5a66:ac0:93a1, 2600:9000:21f8:d400:e:5a66:ac0:93a1, 2600:9000:21f8:e400:e:5a66:ac0:93a1
Response IP 18.165.171.107
Found Yes
Hash a5cdb5ab0d98821c5e582e158e83b61fc9e8618a03770f8c8b0b2f558f7938ab
SimHash 50168b485b37

Groups

psbot
yandex
petalbot
mail.ru_bot
megaindex
baiduspider
yisouspider
bytespider
sogou web spider
sogou inst spider
proximic
admantx
seekport crawler
semrushbot
blexbot
mj12bot
dotbot
gptbot
ccbot
google-extended

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.24heures.ch/sitemaps/sitemapindex.xml
sitemap https://www.24heures.ch/sitemaps/news.xml

Comments

  • Disallow commercial bots to prevent ad fraud, see DISC-2117
  • Allow crawling for other bots

Warnings

  • 1 invalid line.