n0i.net
robots.txt

Robots Exclusion Standard data for n0i.net

Resource Scan

Scan Details

Site Domain n0i.net
Base Domain n0i.net
Scan Status Ok
Last Scan2024-06-21T00:55:03+00:00
Next Scan 2024-07-21T00:55:03+00:00

Last Scan

Scanned2024-06-21T00:55:03+00:00
URL https://www.n0i.net/robots.txt
Domain IPs 79.112.123.129
Response IP 79.112.123.129
Found Yes
Hash 8a5e90ad075f32d38c942251c0f5c8f9f13a0f32af6ec23b617e27ca08a52019
SimHash 511658f0be91

Groups

acapbot

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

advanced email extractor

Rule Path
Disallow /

alphabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

aihitdata

Rule Path
Disallow /

awariobot
awariorssbot
awariosmartbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

builtwith

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

catexplorador

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

checkmarknetwork

Rule Path
Disallow /

cipacrawler

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

cloud mapping experiment

Rule Path
Disallow /

coccocbot

Rule Path
Disallow /

coccocbot-image

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

dataprovider.com

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

daum

Rule Path
Disallow /

deusu

Rule Path
Disallow /

domaincheck.io crawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

experibot

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

findxbot

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

gowikibot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

iplexx spider

Rule Path
Disallow /

jobboersebot

Rule Path
Disallow /

jorgee

Rule Path
Disallow /

lightspeedsystemscrawler

Rule Path
Disallow /

linguee

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

masscan

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

netcraftsurveyagent

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

nimbostratus-bot

Rule Path
Disallow /

nsrbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

obot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

researchscan

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

scan4mail

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

spbot

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

vebidoobot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

wada.vn

Rule Path
Disallow /

www.probethenet.com scanner

Rule Path
Disallow /

woobot

Rule Path
Disallow /

woorank

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

yahoo! slurp

Rule Path
Disallow /

zgrab

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

*

Rule Path
Disallow /archive
Disallow /gfx
Disallow /.honey
Disallow *.rpm$
Disallow /mediainfo

Other Records

Field Value
crawl-delay 16

Warnings

  • 2 invalid lines.