/.well-known/

Log In Sign Up

npckc.site
robots.txt

Robots Exclusion Standard data for npckc.site

Archived Snapshots

Resource Scan

Scan Details

Site Domain	npckc.site
Base Domain	npckc.site
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-12-09T11:53:15+00:00
Next Scan	2026-01-08T11:53:15+00:00

Last Successful Scan

Scanned	2025-07-24T16:45:28+00:00
URL	http://npckc.site/robots.txt
Redirect	https://npckc.net/robots.txt
Redirect Domain	npckc.net
Redirect Base	npckc.net
Domain IPs	107.161.23.204, 198.251.81.30, 209.141.38.71
Redirect IPs	198.51.233.1, 2620:2:6000::bad:dab:cafe
Response IP	198.51.233.1
Found	Yes
Hash	ddb68297127bc9f63e405c6db2c907501c395f8ffe7994de084d04677735a0dc
SimHash	769f4941c1c4

Groups

*

Rule

Path

Disallow

ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
crawlspace
diffbot
duckassistbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
perplexitybot
petalbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot

Rule

Path

Disallow

/

Back to top

Other Records

Field

Value

sitemap

https://npckc.net/sitemap.xml

Back to top