missings.ca
robots.txt

Robots Exclusion Standard data for missings.ca

Resource Scan

Scan Details

Site Domain missings.ca
Base Domain missings.ca
Scan Status Ok
Last Scan2024-10-03T19:39:33+00:00
Next Scan 2024-10-10T19:39:33+00:00

Last Scan

Scanned2024-10-03T19:39:33+00:00
URL https://www.missings.ca/robots.txt
Domain IPs 23.22.5.68, 3.226.182.14, 52.21.227.162, 54.237.159.171
Response IP 23.22.5.68
Found Yes
Hash c3ae3f60c08e9bee87a507986aaf4856d02ea7bce9119f891e52ecec8feb5b8f
SimHash 387752529912

Groups

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

red

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

seekport

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

lipperhey spider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ncbot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

pagesinventory

Rule Path
Disallow /

aboundexbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

majestic-12

Rule Path
Disallow /

majestic-seo

Rule Path
Disallow /

dsearch

Rule Path
Disallow /

mj12

Rule Path
Disallow /

blekkobot

Rule Path
Disallow /

nerdybot

Rule Path
Disallow /

jamesbot

Rule Path
Disallow /

tineye

Rule Path
Disallow /

serpstat

Rule Path
Disallow /

spyfu

Rule Path
Disallow /

prlog

Rule Path
Disallow /

*

Rule Path
Disallow /dashboard
Disallow /admin/

Other Records

Field Value
sitemap https://www.missings.ca/sitemap.xml

Comments

  • Block Baidu
  • http://help.baidu.com/question?prod_en=master&class=Baiduspider
  • Block Yandex
  • https://yandex.com/support/webmaster/controlling-robot/robots-txt.xml
  • Block Redbot
  • https://redbot.org/
  • Block Dotbot
  • https://wowrack.org/
  • Block Semrush
  • https://www.semrush.com/bot/
  • Block TrovitBot
  • https://www.trovit.com/bot.html
  • Block RogerBot
  • https://moz.com/help/moz-procedures/crawlers/rogerbot
  • Block SeekPort
  • http://seekport.com/
  • Generic bot rules