nodeguardians.io
robots.txt

Robots Exclusion Standard data for nodeguardians.io

Resource Scan

Scan Details

Site Domain nodeguardians.io
Base Domain nodeguardians.io
Scan Status Ok
Last Scan5/15/2025, 1:54:28 AM
Next Scan 6/14/2025, 1:54:28 AM

Last Scan

Scanned5/15/2025, 1:54:28 AM
URL https://nodeguardians.io/robots.txt
Domain IPs 34.147.6.185
Response IP 34.147.6.185
Found Yes
Hash e7027578a585e9d6645126eeb564eef0c774864f899ad18c8a21c4d67113f588
SimHash 4d0482220eb0

Groups

*

Rule Path
Allow /

openai gpt

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google gemini

Rule Path
Disallow /

anthropic

Rule Path
Disallow /

istariai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://nodeguardians.io/sitemap.xml

Comments

  • *
  • OpenAI GPT
  • GPTBot
  • Google Gemini
  • Anthropic
  • IstariAI
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.