fogliettoillustrativo.net
robots.txt

Robots Exclusion Standard data for fogliettoillustrativo.net

Resource Scan

Scan Details

Site Domain fogliettoillustrativo.net
Base Domain fogliettoillustrativo.net
Scan Status Ok
Last Scan2024-09-25T19:41:34+00:00
Next Scan 2024-10-02T19:41:34+00:00

Last Scan

Scanned2024-09-25T19:41:34+00:00
URL https://fogliettoillustrativo.net/robots.txt
Redirect https://www.fogliettoillustrativo.net/robots.txt
Redirect Domain www.fogliettoillustrativo.net
Redirect Base fogliettoillustrativo.net
Domain IPs 104.21.67.108, 172.67.221.97, 2606:4700:3031::6815:436c, 2606:4700:3037::ac43:dd61
Redirect IPs 104.21.67.108, 172.67.221.97, 2606:4700:3031::6815:436c, 2606:4700:3037::ac43:dd61
Response IP 104.21.67.108
Found Yes
Hash 747d006a28677855d12453a31a2108fa48e43244f7fdc3ef65fe6d874666f618
SimHash c2548412ffb0

Groups

clickagy intelligence bot

Rule Path
Disallow /

clickagy intelligence bot v2

Rule Path
Disallow /

*

Rule Path
Disallow /admin/

seekport crawler

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

Comments

  • disable MauiBot (crawler.feedback+wc@gmail.com)
  • disable http://law.di.unimi.it/BUbiNG.html
  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • Backlink Analysis
  • https://megaindex.com/crawler
  • https://www.qwant.com/