net14.org
robots.txt

Robots Exclusion Standard data for net14.org

Resource Scan

Scan Details

Site Domain net14.org
Base Domain net14.org
Scan Status Ok
Last Scan2025-09-19T09:58:58+00:00
Next Scan 2025-10-19T09:58:58+00:00

Last Scan

Scanned2025-09-19T09:58:58+00:00
URL https://net14.org/robots.txt
Domain IPs 104.21.88.60, 172.67.173.80, 2606:4700:3037::6815:583c, 2606:4700:3037::ac43:ad50
Response IP 104.21.88.60
Found Yes
Hash 23c37030ccfed3fe460c897caf4c7c7718c6c468e17ee3257e17293f03c63bcf
SimHash 50174300809a

Groups

oai-searchbot
velenpublicwebcrawler
gptbot
chatgpt-user
applebot
facebookexternalhit
peer39_crawler
criteobot

Rule Path
Disallow /

perplexitybot
amazonbot
claudebot
omgilibot
facebookbot
anthropic-ai
bytespider
diffbot
semrushbot
imagesiftbot
omgili
youbot
ccbot
piplbot
senutobot
shortpixel
bytedance
meta-externalagent
petalbot
seznambot
mechanize
mj12bot
dotbot

Rule Path
Disallow /

Comments

  • Block All Other Bots from Entire Site