npir.org
robots.txt

Robots Exclusion Standard data for npir.org

Resource Scan

Scan Details

Site Domain npir.org
Base Domain npir.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-14T17:53:38+00:00
Next Scan 2025-01-13T17:53:38+00:00

Last Successful Scan

Scanned2024-03-20T17:32:03+00:00
URL https://npir.org/robots.txt
Domain IPs 104.26.10.241, 104.26.11.241, 172.67.69.97, 2606:4700:20::681a:af1, 2606:4700:20::681a:bf1, 2606:4700:20::ac43:4561
Response IP 104.26.11.241
Found Yes
Hash 93285c842b476ae29e5bf1328dde310f6f0387d4b982ec5ce145745935696ed6
SimHash 535c114446d3

Groups

ahrefsbot

Rule Path
Disallow /

bananabot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

brightbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

dataprovider.com

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

femtosearchbot

Rule Path
Disallow /

getintent

Rule Path
Disallow /

googlezip.net

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

gumgum

Rule Path
Disallow /

ias_crawler

Rule Path
Disallow /

ioncrawl

Rule Path
Disallow /

linespider

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

mappy

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

scraperbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

seamonkey

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

wget

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yeti

Rule Path
Disallow /

Other Records

Field Value
sitemap https://npir.org/sitemap.xml

Comments

  • robots.txt for https://npir.org