giorgiograesan.it
robots.txt

Robots Exclusion Standard data for giorgiograesan.it

Resource Scan

Scan Details

Site Domain giorgiograesan.it
Base Domain giorgiograesan.it
Scan Status Ok
Last Scan2024-11-13T19:13:17+00:00
Next Scan 2024-12-13T19:13:17+00:00

Last Scan

Scanned2024-11-13T19:13:17+00:00
URL https://giorgiograesan.it/robots.txt
Domain IPs 37.247.55.132
Response IP 37.247.55.132
Found Yes
Hash b12d9428322edd77070ff0445dac02be46989640b8bbf0142ad228a029ab9ca1
SimHash 485c4543669b

Groups

a6-indexer

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

blackboard safeassign

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

exabot-thumbnails

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mqqbrowser

Rule Path
Disallow /

nimbostratus-bot/v1.3.2

Rule Path
Disallow /

nutch

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

spiderbot/nutch-1.7

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

*

Rule Path
Disallow /showroom/