marcocobianchi.it
robots.txt

Robots Exclusion Standard data for marcocobianchi.it

Resource Scan

Scan Details

Site Domain marcocobianchi.it
Base Domain marcocobianchi.it
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-12T19:07:16+00:00
Next Scan 2024-12-11T19:07:16+00:00

Last Successful Scan

Scanned2022-10-27T05:18:23+00:00
URL https://marcocobianchi.it/robots.txt
Response IP 178.33.174.50
Found Yes
Hash e7097d2af8b6f034cd471e99765b7d403baff50c4b2d3e8e76a8cc2629d10626
SimHash 9361b37f8a43

Groups

*

Rule Path
Disallow /cgi-bin/

spbot
mj12bot
rogerbot
ahrefsbot
dotbot
exabot
semrushbot
gigabot
sitebot
jamesbot
rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
bot\ [email="craftbot@yahoo.com"]mailto:craftbot@yahoo.com[/email]
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus

Rule Path
Disallow /