actu.capital.fr
robots.txt

Robots Exclusion Standard data for actu.capital.fr

Resource Scan

Scan Details

Site Domain actu.capital.fr
Base Domain capital.fr
Scan Status Ok
Last Scan2024-06-28T16:15:06+00:00
Next Scan 2024-07-05T16:15:06+00:00

Last Scan

Scanned2024-06-28T16:15:06+00:00
URL https://actu.capital.fr/robots.txt
Response IP 104.76.130.46
Found Yes
Hash 0969ca476ecf3ed00b0535fbb7af8a5b456fa17d61ebfa12ae8bac19895eea40
SimHash 8840d000c771

Groups

twitterbot

Rule Path
Disallow

grapeshot

Rule Path
Disallow

*

Rule Path
Allow /ads.txt
Disallow /

facebookexternalhit

Rule Path
Disallow

mediapartners-google

Rule Path
Allow /