aizubrandhall-ec.com
robots.txt
Robots Exclusion Standard data for aizubrandhall-ec.com
Resource Scan
Scan Details
Site Domain | aizubrandhall-ec.com |
Base Domain | aizubrandhall-ec.com |
Scan Status | Ok |
Last Scan | 2024-10-29T05:01:36+00:00 |
Next Scan | 2024-11-12T05:01:36+00:00 |
Last Scan
Scanned | 2024-10-29T05:01:36+00:00 |
URL | https://www.aizubrandhall-ec.com/robots.txt |
Domain IPs | 13.230.149.252, 3.113.186.52, 54.249.246.233 |
Response IP | 3.113.186.52 |
Found | Yes |
Hash | 54433003e0fca53a7d996fbdf367f26c55b21677d4c21e47e5a4b215ae0e8bd7 |
SimHash | c20cc810e6f3 |
Groups
thesis-research-bot
fidget-spinner-bot
my-tiny-bot
semrushbot
ahrefsbot
dotbot
mj12bot
amazonbot
go-http-client
geedoproductsearch
Rule | Path |
---|---|
Disallow | / |
bingbot
Rule | Path |
---|---|
Allow | / |
Disallow | /cart/ |
Disallow | /web_cart/ |
Disallow | /shops/ |
Disallow | /en/shops/ |
Disallow | /api/shops/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Allow | / |
Disallow | /cart/ |
Disallow | /web_cart/ |
Disallow | /shops/ |
Disallow | /en/shops/ |
Disallow | /api/shops/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.aizubrandhall-ec.com/sitemap.xml |