canada-today.com
robots.txt

Robots Exclusion Standard data for canada-today.com

Resource Scan

Scan Details

Site Domain canada-today.com
Base Domain canada-today.com
Scan Status Ok
Last Scan2024-06-13T16:37:16+00:00
Next Scan 2024-06-20T16:37:16+00:00

Last Scan

Scanned2024-06-13T16:37:16+00:00
URL https://canada-today.com/robots.txt
Domain IPs 185.104.28.21
Response IP 185.104.28.21
Found Yes
Hash 2a67c0068332f812eb2985743b22d5b60ec4707b4cc0f314c27b05e5098cccc1
SimHash 78966da0ddc0

Groups

*

Rule Path
Disallow /impressum/

Other Records

Field Value
crawl-delay 20

googlebot-image

Rule Path
Disallow /

yahoo! slurp

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

alexibot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

xenu

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

veooz

Rule Path
Disallow /

veoozbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mediawords

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkdexbot/2.0

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

deusu

Rule Path
Disallow /

feedlybot/1.0

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

jamesbot

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

toweyabot

Rule Path
Disallow /

python/3.5

Rule Path
Disallow /

yandex

Rule Path
Disallow /

idbot

Rule Path
Disallow /

idbot/1.1

Rule Path
Disallow /

addthis

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

findxbot

Rule Path
Disallow /

uipbot

Rule Path
Disallow /

umbot-ln/1.0

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

laserlikebot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

um-fc/1.0

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

buckyohare

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

yandexmobilebot

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

expo9

Rule Path
Disallow /

a6-indexer

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

barkrowler/0.9

Rule Path
Disallow /

lcc

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.zeitungheute.com/sitemap.xml
sitemap http://www.zeitungheute.com/articles.xml

Warnings

  • 2 invalid lines.