czesciameryka.pl
robots.txt

Robots Exclusion Standard data for czesciameryka.pl

Resource Scan

Scan Details

Site Domain czesciameryka.pl
Base Domain czesciameryka.pl
Scan Status Ok
Last Scan2024-11-09T15:59:34+00:00
Next Scan 2024-12-09T15:59:34+00:00

Last Scan

Scanned2024-11-09T15:59:34+00:00
URL https://czesciameryka.pl/robots.txt
Domain IPs 5.187.53.244
Response IP 5.187.53.244
Found Yes
Hash b99e014a6c1ba00ec79eb1cfad3be736d2d30c746228bd45c190ed081d71c871
SimHash 4b5b4442c693

Groups

*

Rule Path
Disallow /public/libraries
Disallow /print_product/*
Allow /

mj12bot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

a6-indexer

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blackboard safeassign

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

mqqbrowser

Rule Path
Disallow /

nimbostratus-bot/v1.3.2

Rule Path
Disallow /

qwant-news

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

tinytestbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://czesciameryka.pl/sitemap.xml