thinkzon.com
robots.txt

Robots Exclusion Standard data for thinkzon.com

Resource Scan

Scan Details

Site Domain thinkzon.com
Base Domain thinkzon.com
Scan Status Ok
Last Scan2024-06-13T11:33:21+00:00
Next Scan 2024-06-20T11:33:21+00:00

Last Scan

Scanned2024-06-13T11:33:21+00:00
URL https://thinkzon.com/robots.txt
Redirect https://www.thinkzon.com/robots.txt
Redirect Domain www.thinkzon.com
Redirect Base thinkzon.com
Domain IPs 222.122.6.226
Redirect IPs 222.122.6.226
Response IP 222.122.6.226
Found Yes
Hash 1539706532ea6db4713025c34c4a758bf7e43a32b6e7fe69cdd7db87c5af4f20
SimHash 5a0ff142f343

Groups

*

Rule Path
Disallow /stats/click_visit_db.php
Disallow /stats/check_link.php
Disallow /search/result_count_ajax.php
Disallow /search/index_v2022.php
Disallow /notice/ck.php
Disallow /mypage/
Disallow /share/blog/
Disallow /z_n/mail/
Disallow /member/

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

exabot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

buibui-bot

Rule Path
Disallow /

buibui-bot/1.0

Rule Path
Disallow /

stq_bot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

vscooter

Rule Path
Disallow /

psbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

yandex

Rule Path
Disallow /

taptubot

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

infopath

Rule Path
Disallow /

infopath.2

Rule Path
Disallow /

swebot

Rule Path
Disallow /

ec2linkfinder

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

searchmetericsbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

ip-web-crawler.com

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

a6-indexer

Rule Path
Disallow /

riddler

Rule Path
Disallow /

loadtimebot

Rule Path
Disallow /

obot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

gtwek

Rule Path
Disallow /

truebot/1.0

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

icc-crawler/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

psbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

Comments

  • robots.txt
  • -----------end
  • Disallow LIST
  • http://ahrefs.com/robot/
  • User-agent: archive.org_bot
  • Disallow: /
  • https://archive.org/details/archive.org_bot
  • User-agent: ia_archiver
  • Disallow: /
  • User-agent: ia_archiver-web.archive.org
  • Disallow: /
  • http://webmeup-crawler.com/
  • http://www.discoveryengine.com/discoverybot.html
  • http://www.opensiteexplorer.org/dotbot
  • http://www.exalead.com/search/webmasterguide
  • https://asv.informatik.uni-leipzig.de/
  • http://www.majestic12.co.uk/bot.php
  • Block netEstate NE Crawler (+http://www.website-datenbank.de/)
  • http://www.searchmetrics.com/en/searchmetrics-bot/
  • http://www.seokicks.de/robot.html
  • http://www.sogou.com/docs/help/webmasters.htm#07
  • http://help.yandex.com/search/robots/agent.xml
  • http://www.warebay.com/bot.html
  • http://www.searchteq.de/ º¸°í Çϴ°ÍÀº ¾Æ´Ô. ¿ì¼± ¹æÈ­º® Â÷´ÜÀü¿¡ ³Ö¾îµÐ´Ù.
  • http://www.semrush.com/bot.html
  • http://riddler.io/about
  • http://www.exalead.com/search/webmasterguide
  • http://www.exabot.com/go/robot
  • http://www.linkpad.ru
  • https://www.similartech.com/smtbot
  • http://kc.nict.go.jp/project1/crawl.html
  • https://www.linkpad.co.ru
  • https://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • http://law.di.unimi.it/BUbiNG.html#wc
  • https://www.feitsui.com/en/article/32
  • Image Bot
  • -----------------------------
  • robots.txt
  • http://www.robotstxt.org/robotstxt.html
  • »çÀÌÆ®µî·Ï: http://www.google.co.kr/intl/ko//add_url.html
  • Âü°í: http://www.google.co.kr/intl/ko/remove.html
  • http://store.nikon.co.uk/robots.txt