tarantula.jp
robots.txt

Robots Exclusion Standard data for tarantula.jp

Resource Scan

Scan Details

Site Domain tarantula.jp
Base Domain tarantula.jp
Scan Status Ok
Last Scan2024-10-19T03:20:12+00:00
Next Scan 2024-11-18T03:20:12+00:00

Last Scan

Scanned2024-10-19T03:20:12+00:00
URL https://tarantula.jp/robots.txt
Domain IPs 133.242.9.122
Response IP 133.242.9.122
Found Yes
Hash 33f6c3222ddc74c2f5849fc84d6cab1b44516ad880c5f38e8311cf0e39d84688
SimHash 4ad762f0e0ea

Groups

amazonbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

bidswitchbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cloudservermarketspider

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

fr-crawler

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

iccrawler

Rule Path
Disallow /

implisensebot

Rule Path
Disallow /

integralads

Rule Path
Disallow /

jet-bot

Rule Path
Disallow /

jobboersebot

Rule Path
Disallow /

jobs.de-robot

Rule Path
Disallow /

kraken

Rule Path
Disallow /

linguee

Rule Path
Disallow /

linkstats

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

mappy

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

mindupbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

obot

Rule Path
Disallow /

openhosebot

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

plista

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

proximic

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seodiver

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

sg-orbiter

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

sougou web spider

Rule Path
Disallow /

spbot

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

steeler

Rule Path
Disallow /

stratagems kumo

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

thumbsniper

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yeti

Rule Path
Disallow /

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

y!j-asr

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://tarantula.jp/s/sitemap.php?p=index

Comments

  • -- set_deny_bot.php --
  • -- http://vps1.8jpn.net/blacklist/bot.php?sign=6711d891470a8 (Thu, 26 Sep 2024 01:33:58 GMT)
  • -- /set_deny_bot.php --

Warnings

  • 4 invalid lines.