analisi-logica.it
robots.txt

Robots Exclusion Standard data for analisi-logica.it

Resource Scan

Scan Details

Site Domain analisi-logica.it
Base Domain analisi-logica.it
Scan Status Ok
Last Scan2024-09-27T10:00:27+00:00
Next Scan 2024-10-04T10:00:27+00:00

Last Scan

Scanned2024-09-27T10:00:27+00:00
URL https://analisi-logica.it/robots.txt
Domain IPs 77.81.225.78
Response IP 77.81.225.78
Found Yes
Hash 2c0abb331c7da9c1093588d70b3f79643787b2a26b28e43203511806599f5ec4
SimHash 6290f811b7dc

Groups

*

Rule Path
Disallow /_ads/*
Disallow /_avanzi/*
Disallow /_comuni/*
Disallow /_facebook/*
Disallow /_font/*
Disallow /_funzioni/*
Disallow /_google/*
Disallow /_javas/*
Disallow /_jsonld/*
Disallow /_mailer/*
Disallow /_pagina/*
Disallow /_twitter/*
Disallow /_utilita/*
Disallow /*.php
Disallow /*.sql
Disallow /*.csv

grapeshot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zeabot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack

Rule Path
Disallow /

xenu

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

kspider

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

nimblecrawler

Rule Path
Disallow /

usyd-nlp-spider

Rule Path
Disallow /

shim-crawler

Rule Path
Disallow /

myengines-bot

Rule Path
Disallow /

kfsw-bot

Rule Path
Disallow /

sbider

Rule Path
Disallow /

localcombot

Rule Path
Disallow /

iccrawler

Rule Path
Disallow /

knowitall

Rule Path
Disallow /

dcbspider

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

cfetch

Rule Path
Disallow /

findlinks

Rule Path
Disallow /

gonzo
gonzo
gonzop
gonzop

Rule Path
Disallow /

moni

Rule Path
Disallow /

georgios

Rule Path
Disallow /

cydralspider

Rule Path
Disallow /

objectssearch

Rule Path
Disallow /

hoowwwer

Rule Path
Disallow /

jemmathetourist

Rule Path
Disallow /

btbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

irlbot

Rule Path
Disallow /

becomebot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

amfibibot

Rule Path
Disallow /

gridbot

Rule Path
Disallow /

sna

Rule Path
Disallow /

tamu_cs_irl_crawler

Rule Path
Disallow /

npt

Rule Path
Disallow /

bruinbot

Rule Path
Disallow /

zipppbot

Rule Path
Disallow /

molbsy

Rule Path
Disallow /

phpdig

Rule Path
Disallow /

goforit.com

Rule Path
Disallow /

goforit

Rule Path
Disallow /

larbin

Rule Path
Disallow /

appie

Rule Path
Disallow /

libwww

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

sohu-search

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

webzip

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

linko

Rule Path
Disallow /

rpt-httpclient

Rule Path
Disallow /

dumbot

Rule Path
Disallow /

cowbot

Rule Path
Disallow /

superget

Rule Path
Disallow /

psbot

Rule Path
Disallow /

szukacz

Rule Path
Disallow /

antibot

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

jetbot

Rule Path
Disallow /

iconsurf

Rule Path
Disallow /

speedy

Rule Path
Disallow /

npbot

Rule Path
Disallow /

tutorgig

Rule Path
Disallow /

searchspider

Rule Path
Disallow /

lachesis

Rule Path
Disallow /

quepasacreep

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

haste

Rule Path
Disallow /

netresearchserver

Rule Path
Disallow /

nutch

Rule Path
Disallow /

nutchorg

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

steeler

Rule Path
Disallow /

ultraseek

Rule Path
Disallow /

spinne

Rule Path
Disallow /

spider_monkey

Rule Path
Disallow /

ixe crawler

Rule Path
Disallow /

coolbot

Rule Path
Disallow /

vse/.

Rule Path
Disallow /

Comments

  • crawl-delay: 10
  • Sitemap: https://analisi-logica.it/sitemap_archivium.php
  • MARKETING AND REALTIME BIDDING BOTS
  • ORACLE https://www.oracle.com/corporate/acquisitions/grapeshot/crawler.html
  • https://ahrefs.com/robot
  • FA RICHIESTE A /calendar/view.php?view=month&time=1338501600&lang=it CHE NON CAPISCO
  • https://www.semrush.com/bot.html
  • https://www.trendiction.com/en/publisher/bot
  • https://www.proximic.com/info/spider.php
  • Some bots are known to be trouble, particularly those designed to copy
  • entire sites. Please obey robots.txt.

Warnings

  • 2 invalid lines.