braccialini.it
robots.txt

Robots Exclusion Standard data for braccialini.it

Archived Snapshots

Resource Scan

Scan Details

Site Domain	braccialini.it
Base Domain	braccialini.it
Scan Status	Ok
Last Scan	2024-09-06T11:10:08+00:00
Next Scan	2024-10-06T11:10:08+00:00

Last Scan

Scanned	2024-09-06T11:10:08+00:00
URL	https://braccialini.it/robots.txt
Redirect	https://www.braccialini.it/robots.txt
Redirect Domain	www.braccialini.it
Redirect Base	braccialini.it
Domain IPs	104.21.53.198, 172.67.218.116, 2606:4700:3031::ac43:da74, 2606:4700:3032::6815:35c6
Redirect IPs	104.21.53.198, 172.67.218.116, 2606:4700:3031::ac43:da74, 2606:4700:3032::6815:35c6
Response IP	172.67.218.116
Found	Yes
Hash	e293714762acf89f8cadbb9aabd79b2a6ee8bd54873e05dd488b55392781e1cb
SimHash	254a7375e7fc

Groups

*

Rule	Path
Disallow	/reserved/
Disallow	*/checkout
Disallow	/fr_fr/*
Disallow	/eu_en/*
Disallow	/ru_ru/*
Disallow	/de_de/*
Disallow	/us_en/*
Disallow	/int_en/*
Disallow	/es_es/*
Disallow	/ea_en/*
Disallow	/ba_en/*
Disallow	/row_en/*
Disallow	/it_it/*
Disallow	/uk_en/*
Disallow	/eu_de/*

Rule

Path

Disallow

*/reserved/*

Disallow

*/checkout

Disallow

/fr_fr/*

Disallow

/eu_en/*

Disallow

/ru_ru/*

Disallow

/de_de/*

Disallow

/us_en/*

Disallow

/int_en/*

Disallow

/es_es/*

Disallow

/ea_en/*

Disallow

/ba_en/*

Disallow

/row_en/*

Disallow

/it_it/*

Disallow

/uk_en/*

Disallow

/eu_de/*

facebookexternalhit/1.1

Rule	Path
Disallow	/csrf

Rule

Path

Disallow

/csrf

cazoodlebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

dotbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

/

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

adidxbot
bingbot

No rules defined. All paths allowed.

Back to top

Other Records

Field	Value
sitemap	https://www.braccialini.it/sitemaps/sitemap.xml

Field

Value

sitemap

https://www.braccialini.it/sitemaps/sitemap.xml

Back to top

Comments

2024.02.22
BOTs Directives
Facebook crawler too many requests
Block CazoodleBot as it does not present correct accept content headers
Block MJ12bot as it is just noise
Block dotbot as it cannot parse base urls properly
Block Gigabot
Allow Bingbot(s) to crawl faster
Thanks Gucci developers for robots.txt tricks

Back to top

braccialini.itrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

facebookexternalhit/1.1

cazoodlebot

mj12bot

dotbot/1.0

gigabot

adidxbotbingbot

Other Records

Comments

braccialini.it
robots.txt

adidxbot
bingbot