bata.pl
robots.txt

Robots Exclusion Standard data for bata.pl

Resource Scan

Scan Details

Site Domain bata.pl
Base Domain bata.pl
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-18T03:22:13+00:00
Next Scan 2025-01-16T03:22:13+00:00

Last Successful Scan

Scanned2023-11-30T23:05:50+00:00
URL https://bata.pl/robots.txt
Redirect https://www.bata.com/robots.txt
Redirect Domain www.bata.com
Redirect Base bata.com
Domain IPs 104.16.49.40, 104.16.50.40
Redirect IPs 23.52.171.219, 23.59.168.178, 2600:1413:b000:1b::17d7:705, 2600:1413:b000:1b::17d7:717
Response IP 23.52.171.210
Found Yes
Hash f48fbbcfa1ea1b56e478b9637cc524c734f7e1028f6ceb6240a50474a7faa0a6
SimHash b95f70c287e7

Groups

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

fast

Rule Path
Disallow /

slurp

Rule Path
Disallow /

maxthon

Rule Path
Disallow /

cncdialer

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /*/order/*
Disallow /*/ordini/*
Disallow /*/pedidos/*
Disallow /*/commandes/*
Disallow /*/objedn%C3%A1vky/*
Disallow /*/zam%C3%B3wienia/*
Disallow /*/pesanan/*
Disallow /*/myaccount/*
Disallow /*/mujucet/*
Disallow /*/mojucet/*
Disallow /*/mojekonto/*
Disallow /*/akun/*
Disallow /search?q=*
Disallow /*/search/*
Disallow /search?q=*
Disallow /*/search?q=*

Other Records

Field Value
sitemap https://www.bata.com/es/sitemap_index.xml
sitemap https://www.bata.com/sk/sitemap_index.xml
sitemap https://www.bata.com/pl/sitemap_index.xml
sitemap https://www.bata.com/in/sitemap_index.xml
sitemap https://www.bata.com/my/sitemap_index.xml
sitemap https://www.bata.com/th/sitemap_index.xml
sitemap https://www.bata.com/id/sitemap_index.xml

Comments

  • Bots we do not need
  • TO-DO: Modify the following disallow to adapt our site and to avoid internal search
  • Order pages
  • Account pages
  • Search pages

Warnings

  • `host` is not a known field.