infolocale.actu.fr
robots.txt

Robots Exclusion Standard data for infolocale.actu.fr

Resource Scan

Scan Details

Site Domain infolocale.actu.fr
Base Domain actu.fr
Scan Status Ok
Last Scan2024-11-07T13:49:13+00:00
Next Scan 2024-12-07T13:49:13+00:00

Last Scan

Scanned2024-11-07T13:49:13+00:00
URL https://infolocale.actu.fr/robots.txt
Domain IPs 212.95.74.38
Response IP 212.95.74.38
Found Yes
Hash 2be28815292f251c59eb60d4e393eeac0642c42c6a664fb467190f08319377f9
SimHash 4095d1f1e751

Groups

mediapartners-google
googlebot
googlebot-image
googlebot-news
googlebot-video
adsbot-google
storebot-google
adsbot-google-mobile
apis-google
twitterbot
applebot
ouestfrancebot
taboolabot
proximic
upday
bingbot
exabot
slurp
ia_archiver
grapeshot

Rule Path
Disallow /follow-organism/*
Disallow /organismes/*/contact
Disallow /login
Disallow /logout

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

*

Rule Path
Allow /

Comments

  • Allowed search engines directives
  • Paths
  • Exclusions User agent: