sfogliami.it
robots.txt

Robots Exclusion Standard data for sfogliami.it

Resource Scan

Scan Details

Site Domain sfogliami.it
Base Domain sfogliami.it
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-05T21:18:13+00:00
Next Scan 2024-10-06T21:18:13+00:00

Last Successful Scan

Scanned2024-09-28T21:17:52+00:00
URL https://www.sfogliami.it/robots.txt
Domain IPs 89.46.109.43
Response IP 89.46.109.43
Found Yes
Hash 4907d92eb7b3bcf813f6d4281a82ab2c770c9eb7d3d2b6f39d6d4a05ebb7ce4a
SimHash 657ccca2ca9b

Groups

domaincrawler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /*.jpg
Disallow /*.JPG
Disallow /*.png
Disallow /*.PDF
Disallow /*.pdf
Disallow /*.mp3
Disallow /*.MOV
Disallow /*.mov
Disallow /*.AVI
Disallow /*.avi
Disallow /*.csv
Disallow /*.data

Other Records

Field Value
crawl-delay 2

Other Records

Field Value
sitemap http://www.sfogliami.it/sitemap.php