crux.ninja
robots.txt

Robots Exclusion Standard data for crux.ninja

Resource Scan

Scan Details

Site Domain crux.ninja
Base Domain crux.ninja
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-02-19T19:25:56+00:00
Next Scan 2025-05-20T19:25:56+00:00

Last Successful Scan

Scanned2024-09-30T19:24:10+00:00
URL https://crux.ninja/robots.txt
Domain IPs 69.164.199.184
Response IP 69.164.199.184
Found Yes
Hash 20284a02d0c9b68c25ec1a8bdc9557904bead1f86914c9d7c28d5098f0e9f92f
SimHash b846d50afaea

Groups

*

Rule Path
Disallow /distfiles/

*

Rule Path
Disallow /portdb/

semrushbot

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

semrushbot-ocob

Rule Path
Disallow /

semrush*

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta*

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

facebookexternalhit*

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mj12bot*

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

bytespider*

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot*

Rule Path
Disallow /

botpoke

Rule Path
Disallow /

botpoke*

Rule Path
Disallow /

Comments

  • general
  • semrushbot
  • petalbot
  • facebook/meta bot
  • mj12bot?
  • bytespider
  • dotbot
  • BotPoke?