braccialini.it
robots.txt

Robots Exclusion Standard data for braccialini.it

Resource Scan

Scan Details

Site Domain braccialini.it
Base Domain braccialini.it
Scan Status Ok
Last Scan2024-09-06T11:10:08+00:00
Next Scan 2024-10-06T11:10:08+00:00

Last Scan

Scanned2024-09-06T11:10:08+00:00
URL https://braccialini.it/robots.txt
Redirect https://www.braccialini.it/robots.txt
Redirect Domain www.braccialini.it
Redirect Base braccialini.it
Domain IPs 104.21.53.198, 172.67.218.116, 2606:4700:3031::ac43:da74, 2606:4700:3032::6815:35c6
Redirect IPs 104.21.53.198, 172.67.218.116, 2606:4700:3031::ac43:da74, 2606:4700:3032::6815:35c6
Response IP 172.67.218.116
Found Yes
Hash e293714762acf89f8cadbb9aabd79b2a6ee8bd54873e05dd488b55392781e1cb
SimHash 254a7375e7fc

Groups

*

Rule Path
Disallow */reserved/*
Disallow */checkout
Disallow /fr_fr/*
Disallow /eu_en/*
Disallow /ru_ru/*
Disallow /de_de/*
Disallow /us_en/*
Disallow /int_en/*
Disallow /es_es/*
Disallow /ea_en/*
Disallow /ba_en/*
Disallow /row_en/*
Disallow /it_it/*
Disallow /uk_en/*
Disallow /eu_de/*

facebookexternalhit/1.1

Rule Path
Disallow /csrf

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

adidxbot
bingbot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.braccialini.it/sitemaps/sitemap.xml

Comments

  • 2024.02.22
  • BOTs Directives
  • Facebook crawler too many requests
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Allow Bingbot(s) to crawl faster
  • Thanks Gucci developers for robots.txt tricks