npckc.site
robots.txt

Robots Exclusion Standard data for npckc.site

Resource Scan

Scan Details

Site Domain npckc.site
Base Domain npckc.site
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-12-09T11:53:15+00:00
Next Scan 2026-01-08T11:53:15+00:00

Last Successful Scan

Scanned2025-07-24T16:45:28+00:00
URL http://npckc.site/robots.txt
Redirect https://npckc.net/robots.txt
Redirect Domain npckc.net
Redirect Base npckc.net
Domain IPs 107.161.23.204, 198.251.81.30, 209.141.38.71
Redirect IPs 198.51.233.1, 2620:2:6000::bad:dab:cafe
Response IP 198.51.233.1
Found Yes
Hash ddb68297127bc9f63e405c6db2c907501c395f8ffe7994de084d04677735a0dc
SimHash 769f4941c1c4

Groups

*

Rule Path
Disallow

ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
crawlspace
diffbot
duckassistbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
perplexitybot
petalbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://npckc.net/sitemap.xml