www.quotidiano.ilsole24ore.com
robots.txt

Robots Exclusion Standard data for www.quotidiano.ilsole24ore.com

Resource Scan

Scan Details

Site Domain www.quotidiano.ilsole24ore.com
Base Domain ilsole24ore.com
Scan Status Ok
Last Scan2024-06-23T17:47:10+00:00
Next Scan 2024-07-07T17:47:10+00:00

Last Scan

Scanned2024-06-23T17:47:10+00:00
URL https://www.quotidiano.ilsole24ore.com/robots.txt
Domain IPs 35.219.242.145
Response IP 35.219.242.145
Found Yes
Hash 0b2d469ee6be8b50b15a38b4931a05f53e06bb56546a5c5491ac54998ca1068c
SimHash 880ac008c5b7

Groups

facebookexternalhit

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

twitterbot

Rule Path
Disallow /
Allow /art.php?*
Allow /_deploy/*/*.jpg
Allow /sfoglio/aviator.php?*articleId=*

linkedinbot

Rule Path
Disallow /
Allow /art.php?*
Allow /_deploy/*/*.jpg
Allow /sfoglio/aviator.php?*articleId=*

*

Rule Path
Disallow /
Allow /art.php?*
Allow /_deploy/*/*.jpg
Allow /sfoglio/aviator.php?*articleId=*