m.yesform.com
robots.txt

Robots Exclusion Standard data for m.yesform.com

Resource Scan

Scan Details

Site Domain m.yesform.com
Base Domain yesform.com
Scan Status Ok
Last Scan2024-09-21T06:52:08+00:00
Next Scan 2024-10-05T06:52:08+00:00

Last Scan

Scanned2024-09-21T06:52:08+00:00
URL https://m.yesform.com/robots.txt
Domain IPs 222.122.6.226
Response IP 222.122.6.226
Found Yes
Hash 8a3447212f053a67b299ca9b612b4d62dbac442c767e7174232db769731d3201
SimHash ca1efd42f3c3

Groups

*

Rule Path
Disallow /qrcode/
Disallow /active/
Disallow /excel/
Disallow /bizmail/
Disallow /insamal/
Disallow /stamp/
Disallow /pop/
Disallow /premium/
Disallow /plan/
Disallow /contract/
Disallow /resume/
Disallow /powerpoint/
Disallow /english/
Disallow /docs/
Disallow /newbiz/
Disallow /shop/
Disallow /counsel/
Disallow /ir/
Disallow /speech/
Disallow /freeimage/
Disallow /contents/
Disallow /auto/
Disallow /free/
Disallow /sample_bizf/
Disallow /sample-bizf/
Disallow /sample_ir/
Disallow /sample-ir/
Disallow /sample_share/
Disallow /sample-share/
Disallow /sample_fshr/
Disallow /sample-fshr/
Disallow /sample_bizpr/
Disallow /sample-bizpr/

*

Rule Path
Disallow /form/

*

Rule Path
Disallow /form/A11/

*

Rule Path
Disallow /form/A14/

*

Rule Path
Disallow /form/A15/

*

Rule Path
Disallow /form/A17/

*

Rule Path
Disallow /form/A18/

*

Rule Path
Disallow /form/A19/

*

Rule Path
Disallow /form/A20/

*

Rule Path
Disallow /form/A21/

*

Rule Path
Disallow /form/A22/

*

Rule Path
Disallow /form/A23/

*

Rule Path
Disallow /form/A24/

*

Rule Path
Disallow /form/A25/

*

Rule Path
Disallow /form/A26/

*

Rule Path
Disallow /form/A27/

*

Rule Path
Disallow /form/A28/

*

Rule Path
Disallow /form/A29/

*

Rule Path
Disallow /form/A30/

*

Rule Path
Disallow /form/A31/

*

Rule Path
Disallow /bizf/

*

Rule Path
Disallow /buse/

*

Rule Path
Disallow /corp/

*

Rule Path
Disallow /bizpr/

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

exabot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

netestate ne crawler (+http://www.website-datenbank.de/)

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

buibui-bot

Rule Path
Disallow /

buibui-bot/1.0

Rule Path
Disallow /

stq_bot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

vscooter

Rule Path
Disallow /

psbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

yandex

Rule Path
Disallow /

taptubot

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

infopath

Rule Path
Disallow /

infopath.2

Rule Path
Disallow /

swebot

Rule Path
Disallow /

ec2linkfinder

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

searchmetericsbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

ip-web-crawler.com

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

a6-indexer

Rule Path
Disallow /

riddler

Rule Path
Disallow /

loadtimebot

Rule Path
Disallow /

obot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

gtwek

Rule Path
Disallow /

truebot/1.0

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

icc-crawler/2.0

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

psbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

Comments

  • robots.txt
  • User-agent: *
  • Disallow: /view/
  • -----------end
  • Disallow LIST
  • http://ahrefs.com/robot/
  • User-agent: archive.org_bot
  • Disallow: /
  • https://archive.org/details/archive.org_bot
  • User-agent: ia_archiver
  • Disallow: /
  • User-agent: ia_archiver-web.archive.org
  • Disallow: /
  • http://webmeup-crawler.com/
  • http://www.discoveryengine.com/discoverybot.html
  • http://www.opensiteexplorer.org/dotbot
  • http://www.exalead.com/search/webmasterguide
  • https://asv.informatik.uni-leipzig.de/
  • http://www.majestic12.co.uk/bot.php
  • Block netEstate NE Crawler (+http://www.website-datenbank.de/)
  • http://www.searchmetrics.com/en/searchmetrics-bot/
  • http://www.seokicks.de/robot.html
  • http://www.sogou.com/docs/help/webmasters.htm#07
  • http://help.yandex.com/search/robots/agent.xml
  • http://www.warebay.com/bot.html
  • http://www.searchteq.de/ º¸°í Çϴ°ÍÀº ¾Æ´Ô. ¿ì¼± ¹æÈ­º® Â÷´ÜÀü¿¡ ³Ö¾îµÐ´Ù.
  • http://www.semrush.com/bot.html
  • http://riddler.io/about
  • http://www.exalead.com/search/webmasterguide
  • http://www.exabot.com/go/robot
  • http://www.linkpad.ru
  • https://www.similartech.com/smtbot
  • http://kc.nict.go.jp/project1/crawl.html
  • https://www.linkpad.co.ru
  • https://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • http://law.di.unimi.it/BUbiNG.html#wc
  • https://www.feitsui.com/en/article/32
  • Image Bot
  • -----------------------------
  • robots.txt
  • http://www.robotstxt.org/robotstxt.html
  • »çÀÌÆ®µî·Ï: http://www.google.co.kr/intl/ko//add_url.html
  • Âü°í: http://www.google.co.kr/intl/ko/remove.html
  • http://store.nikon.co.uk/robots.txt