hitparadeitalia.it
robots.txt

Robots Exclusion Standard data for hitparadeitalia.it

Resource Scan

Scan Details

Site Domain hitparadeitalia.it
Base Domain hitparadeitalia.it
Scan Status Ok
Last Scan2024-11-11T04:19:45+00:00
Next Scan 2024-11-18T04:19:45+00:00

Last Scan

Scanned2024-11-11T04:19:45+00:00
URL https://hitparadeitalia.it/robots.txt
Domain IPs 195.72.197.4
Response IP 195.72.197.4
Found Yes
Hash aea3ecc15966a2574dae3914a4607fd8be49e2c157b3fbe2c7e5b4a9367a7ba5
SimHash 802612f28993

Groups

*

Rule Path
Allow /

httrack*

Rule Path
Allow /

httrack 3.0x

Rule Path
Disallow /

bloodhound*

Rule Path
Disallow /

deweb*

Rule Path
Disallow /

doc*

Rule Path
Disallow /

download ninja*

Rule Path
Disallow /

fetch*

Rule Path
Disallow /

israbot*

Rule Path
Disallow

jbh*

Rule Path
Disallow

larbin*

Rule Path
Disallow /

libwww*

Rule Path
Disallow /

linko*

Rule Path
Disallow /

microsoft.url.control*

Rule Path
Disallow /

msiecrawler*

Rule Path
Disallow /

npbot*

Rule Path
Disallow /

offline explorer*

Rule Path
Disallow /

orthogaffe*

Rule Path
Disallow

packrat*

Rule Path
Disallow /

physics*

Rule Path
Disallow /

psbot*

Rule Path
Disallow /

pycurl*

Rule Path
Disallow /

sitecheck*

Rule Path
Disallow /

sitesnagger*

Rule Path
Disallow /

spiderbot*

Rule Path
Disallow /

teleport*

Rule Path
Disallow /

tlspider*

Rule Path
Disallow /

ubicrawler*

Rule Path
Disallow /

webcollage*

Rule Path
Disallow /

webcopier*

Rule Path
Disallow /

webmirror*

Rule Path
Disallow /

webstripper*

Rule Path
Disallow /

webzip*

Rule Path
Disallow /

xenu*

Rule Path
Disallow /

zao*

Rule Path
Disallow /

zealbot*

Rule Path
Disallow /

zyborg*

Rule Path
Disallow /

Warnings

  • 4 invalid lines.