zut.tv
robots.txt

Robots Exclusion Standard data for zut.tv

Archived Snapshots

Resource Scan

Scan Details

Site Domain	zut.tv
Base Domain	zut.tv
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-10-06T06:14:33+00:00
Next Scan	2025-01-04T06:14:33+00:00

Last Successful Scan

Scanned	2023-05-23T04:57:50+00:00
URL	https://zut.tv/robots.txt
Redirect	https://www.zut.tv/robots.txt
Redirect Domain	www.zut.tv
Redirect Base	zut.tv
Domain IPs	104.21.41.82, 172.67.163.65, 2606:4700:3033::6815:2952, 2606:4700:3035::ac43:a341
Redirect IPs	104.21.41.82, 172.67.163.65, 2606:4700:3033::6815:2952, 2606:4700:3035::ac43:a341
Response IP	172.67.163.65
Found	Yes
Hash	761f15ce442e19dcd96e5602ff106ebd2f88f423052510029dcc9bc0d52a9bab
SimHash	38d41d09c578

Groups

googlebot-image

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/misc/*.css$
Allow	/misc/*.css?
Allow	/misc/*.js$
Allow	/misc/*.js?
Allow	/misc/*.gif
Allow	/misc/*.jpg
Allow	/misc/*.jpeg
Allow	/misc/*.png
Allow	/modules/*.css$
Allow	/modules/*.css?
Allow	/modules/*.js$
Allow	/modules/*.js?
Allow	/modules/*.gif
Allow	/modules/*.jpg
Allow	/modules/*.jpeg
Allow	/modules/*.png
Allow	/profiles/*.css$
Allow	/profiles/*.css?
Allow	/profiles/*.js$
Allow	/profiles/*.js?
Allow	/profiles/*.gif
Allow	/profiles/*.jpg
Allow	/profiles/*.jpeg
Allow	/profiles/*.png
Allow	/themes/*.css$
Allow	/themes/*.css?
Allow	/themes/*.js$
Allow	/themes/*.js?
Allow	/themes/*.gif
Allow	/themes/*.jpg
Allow	/themes/*.jpeg
Allow	/themes/*.png
Disallow	/includes/
Disallow	/misc/
Disallow	/modules/
Disallow	/profiles/
Disallow	/scripts/
Disallow	/themes/
Disallow	/CHANGELOG.txt
Disallow	/cron.php
Disallow	/INSTALL.mysql.txt
Disallow	/INSTALL.pgsql.txt
Disallow	/INSTALL.sqlite.txt
Disallow	/install.php
Disallow	/INSTALL.txt
Disallow	/LICENSE.txt
Disallow	/MAINTAINERS.txt
Disallow	/update.php
Disallow	/UPGRADE.txt
Disallow	/xmlrpc.php
Disallow	/admin/
Disallow	/comment/reply/
Disallow	/filter/tips/
Disallow	/node/add/
Disallow	/search/
Disallow	/user/register/
Disallow	/user/password/
Disallow	/user/login/
Disallow	/user/logout/
Disallow	/?q=admin%2F
Disallow	/?q=comment%2Freply%2F
Disallow	/?q=filter%2Ftips%2F
Disallow	/?q=node%2Fadd%2F
Disallow	/?q=search%2F
Disallow	/?q=user%2Fpassword%2F
Disallow	/?q=user%2Fregister%2F
Disallow	/?q=user%2Flogin%2F
Disallow	/?q=user%2Flogout%2F

Rule

Path

Allow

/misc/*.css$

Allow

/misc/*.css?

Allow

/misc/*.js$

Allow

/misc/*.js?

Allow

/misc/*.gif

Allow

/misc/*.jpg

Allow

/misc/*.jpeg

Allow

/misc/*.png

Allow

/modules/*.css$

Allow

/modules/*.css?

Allow

/modules/*.js$

Allow

/modules/*.js?

Allow

/modules/*.gif

Allow

/modules/*.jpg

Allow

/modules/*.jpeg

Allow

/modules/*.png

Allow

/profiles/*.css$

Allow

/profiles/*.css?

Allow

/profiles/*.js$

Allow

/profiles/*.js?

Allow

/profiles/*.gif

Allow

/profiles/*.jpg

Allow

/profiles/*.jpeg

Allow

/profiles/*.png

Allow

/themes/*.css$

Allow

/themes/*.css?

Allow

/themes/*.js$

Allow

/themes/*.js?

Allow

/themes/*.gif

Allow

/themes/*.jpg

Allow

/themes/*.jpeg

Allow

/themes/*.png

Disallow

/includes/

Disallow

/misc/

Disallow

/modules/

Disallow

/profiles/

Disallow

/scripts/

Disallow

/themes/

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/INSTALL.sqlite.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/xmlrpc.php

Disallow

/admin/

Disallow

/comment/reply/

Disallow

/filter/tips/

Disallow

/node/add/

Disallow

/search/

Disallow

/user/register/

Disallow

/user/password/

Disallow

/user/login/

Disallow

/user/logout/

Disallow

/?q=admin%2F

Disallow

/?q=comment%2Freply%2F

Disallow

/?q=filter%2Ftips%2F

Disallow

/?q=node%2Fadd%2F

Disallow

/?q=search%2F

Disallow

/?q=user%2Fpassword%2F

Disallow

/?q=user%2Fregister%2F

Disallow

/?q=user%2Flogin%2F

Disallow

/?q=user%2Flogout%2F

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

jobs.de-robot

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

unisterbot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

surveybot

Rule	Path
Disallow	/

Rule

Path

Disallow

seodiver

Rule	Path
Disallow	/

Rule

Path

Disallow

spbot

Rule	Path
Disallow	/

Rule

Path

Disallow

wotbox

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meanpathbot

Rule	Path
Disallow	/

Rule

Path

Disallow

backlinkcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

obot

Rule	Path
Disallow	/

Rule

Path

Disallow

fr-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.ru

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.com

Rule	Path
Disallow	/

Rule

Path

Disallow

cloudservermarketspider

Rule	Path
Disallow	/

Rule

Path

Disallow

trendictionbot

Rule	Path
Disallow	/

Rule

Path

Disallow

exabot

Rule	Path
Disallow	/

Rule

Path

Disallow

careerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

lipperhey-kaus-australis

Rule	Path
Disallow	/

Rule

Path

Disallow

seoscanners.net

Rule	Path
Disallow	/

Rule

Path

Disallow

metajobbot

Rule	Path
Disallow	/

Rule

Path

Disallow

spiderbot

Rule	Path
Disallow	/

Rule

Path

Disallow

linkstats

Rule	Path
Disallow	/

Rule

Path

Disallow

jobboersebot

Rule	Path
Disallow	/

Rule

Path

Disallow

iccrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

plista

Rule	Path
Disallow	/

Rule

Path

Disallow

domain re-animator bot

Rule	Path
Disallow	/

Rule

Path

Disallow

lipperhey-kaus-australis

Rule	Path
Disallow	/

Rule

Path

Disallow

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

coccoc

Rule	Path
Disallow	/

Rule

Path

Disallow

um-ic

Rule	Path
Disallow	/

Rule

Path

Disallow

mindupbot

Rule	Path
Disallow	/

Rule

Path

Disallow

sg-orbiter

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule

Path

Disallow

qwantify

Rule

Path

Disallow

kraken

Rule

Path

Disallow

plukkie

Rule

Path

Disallow

safednsbot

Rule

Path

Disallow

haosouspider

Rule

Path

Disallow

rogerbot

Rule

Path

Disallow

openhosebot

Rule

Path

Disallow

screaming frog seo spider

Rule

Path

Disallow

thumbsniper

Rule

Path

Disallow

r6_commentreader

Rule

Path

Disallow

implisensebot

Rule

Path

Disallow

cliqzbot

Rule

Path

Disallow

aihitbot

Rule

Path

Disallow

trendictionbot

Rule

Path

Disallow

wbsearchbot

Rule

Path

Disallow

bingbot

Rule

Path

Disallow

semrushbot

Rule

Path

Disallow

semrushbot-sa

Rule

Path

Disallow

alphaseobot

Rule

Path

Disallow

alphaseobot-sa

Rule

Path

Disallow

yahoo-mmcrawler

Rule

Path

Disallow

slurp

Rule

Path

Disallow

facebookexternalhit

Rule

Path

Disallow

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
CSS, JS, Images
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)
Disallow: Sistrix
Disallow: Sistrix
Disallow: Sistrix
Disallow: SEOkicks-Robot
Disallow: jobs.de-Robot
Backlink Analysis
Bot der Leipziger Unister Holding GmbH
http://moz.com/products
http://www.searchmetrics.com
http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
http://www.domaintools.com/webmasters/surveybot.php
http://www.seodiver.com/bot
http://openlinkprofiler.org/bot
http://www.wotbox.com/bot/
http://www.opensiteexplorer.org/dotbot
http://moz.com/researchtools/ose/dotbot
http://www.meanpath.com/meanpathbot.html
http://www.backlinktest.com/crawler.html
http://www.brandwatch.com/magpie-crawler/
http://filterdb.iss.net/crawler/
http://webmeup-crawler.com
https://megaindex.com/crawler
http://www.cloudservermarket.com
http://www.trendiction.de/de/publisher/bot
http://www.exalead.com
http://www.career-x.de/bot.html
https://www.lipperhey.com/en/about/
https://www.lipperhey.com/en/about/
https://turnitin.com/robot/crawlerinfo.html
http://help.coccoc.com/
ubermetrics-technologies.com
datenbutler.de
http://searchgears.de/uber-uns/crawling-faq.html
http://commoncrawl.org/faq/
https://www.qwant.com/
http://linkfluence.net/
http://www.botje.com/plukkie.htm
https://www.safedns.com/searchbot
http://www.haosou.com/help/help_3_2.html
http://www.haosou.com/help/help_3_2.html
http://www.moz.com/dp/rogerbot
http://www.openhose.org/bot.html
http://www.screamingfrog.co.uk/seo-spider/
http://thumbsniper.com
http://www.radian6.com/crawler
http://cliqz.com/company/cliqzbot
https://www.aihitdata.com/about
http://www.trendiction.com/en/publisher/bot
http://warebay.com/bot.html
http://www.bing.com/
To block SEMrushBot from crawling your site for web graph of links, add:
To remove SEMrushBot from crawling your site for different SEO and technical issues, add:
To block AlphaSeoBot from crawling your site for web graph of links, add:
To remove AlphaSeoBot from crawling your site for different SEO and technical issues, add:
yahoo

Warnings

2 invalid lines.

zut.tvrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

googlebot-image

ahrefsbot

mj12bot

yandexbot

*

Other Records

sistrix

sistrix crawler

sistrix

seokicks-robot

jobs.de-robot

ahrefsbot

unisterbot

dotbot

searchmetricsbot

mj12bot

surveybot

seodiver

spbot

wotbox

dotbot

meanpathbot

backlinkcrawler

magpie-crawler

obot

fr-crawler

blexbot

megaindex.ru

megaindex.com

cloudservermarketspider

trendictionbot

exabot

careerbot

lipperhey-kaus-australis

seoscanners.net

metajobbot

spiderbot

linkstats

jobboersebot

iccrawler

plista

domain re-animator bot

lipperhey-kaus-australis

turnitinbot

coccoc

um-ic

mindupbot

sg-orbiter

ccbot

qwantify

kraken

plukkie

safednsbot

haosouspider

rogerbot

openhosebot

screaming frog seo spider

thumbsniper

r6_commentreader

implisensebot

cliqzbot

aihitbot

trendictionbot

wbsearchbot

bingbot

semrushbot

semrushbot-sa

alphaseobot

alphaseobot-sa

yahoo-mmcrawler

slurp

facebookexternalhit

Comments

Warnings

zut.tv
robots.txt