npir.org
robots.txt

Robots Exclusion Standard data for npir.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	npir.org
Base Domain	npir.org
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-11-14T17:53:38+00:00
Next Scan	2025-01-13T17:53:38+00:00

Last Successful Scan

Scanned	2024-03-20T17:32:03+00:00
URL	https://npir.org/robots.txt
Domain IPs	104.26.10.241, 104.26.11.241, 172.67.69.97, 2606:4700:20::681a:af1, 2606:4700:20::681a:bf1, 2606:4700:20::ac43:4561
Response IP	104.26.11.241
Found	Yes
Hash	93285c842b476ae29e5bf1328dde310f6f0387d4b982ec5ce145745935696ed6
SimHash	535c114446d3

Groups

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bananabot

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

brightbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

coccocbot-web

Rule	Path
Disallow	/

Rule

Path

Disallow

criteobot/0.1

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

dataprovider.com

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

femtosearchbot

Rule	Path
Disallow	/

Rule

Path

Disallow

getintent

Rule	Path
Disallow	/

Rule

Path

Disallow

googlezip.net

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow	/

Rule

Path

Disallow

gumgum

Rule	Path
Disallow	/

Rule

Path

Disallow

ias_crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

ioncrawl

Rule	Path
Disallow	/

Rule

Path

Disallow

linespider

Rule	Path
Disallow	/

Rule

Path

Disallow

mail.ru_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

mappy

Rule	Path
Disallow	/

Rule

Path

Disallow

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.ru

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

neevabot

Rule	Path
Disallow	/

Rule

Path

Disallow

scraperbot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

seamonkey

Rule	Path
Disallow	/

Rule

Path

Disallow

seekportbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

surdotlybot

Rule	Path
Disallow	/

Rule

Path

Disallow

velenpublicwebcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

yeti

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://npir.org/sitemap.xml

Field

Value

sitemap

https://npir.org/sitemap.xml

Comments

robots.txt for https://npir.org

npir.orgrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

ahrefsbot

bananabot

barkrowler

blexbot

brightbot

bytespider

coccocbot-web

criteobot/0.1

dataforseobot

dataprovider.com

dotbot

femtosearchbot

getintent

googlezip.net

gptbot

grapeshot

gumgum

ias_crawler

ioncrawl

linespider

mail.ru_bot

mappy

mauibot

megaindex.ru

mj12bot

neevabot

scraperbot

scrapy

seamonkey

seekportbot

seznambot

surdotlybot

velenpublicwebcrawler

wget

yandex

yeti

Other Records

Comments

npir.org
robots.txt