trainingcred.com
robots.txt

Robots Exclusion Standard data for trainingcred.com

Resource Scan

Scan Details

Site Domain trainingcred.com
Base Domain trainingcred.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-26T17:25:58+00:00
Next Scan 2026-03-12T17:25:58+00:00

Last Successful Scan

Scanned2026-01-19T17:25:16+00:00
URL https://trainingcred.com/robots.txt
Domain IPs 172.66.40.104, 172.66.43.152, 2606:4700:3108::ac42:2868, 2606:4700:3108::ac42:2b98
Response IP 172.66.40.104
Found Yes
Hash 272463391e45257d542715a459982e18bb242c403823d59de452f1a2fc96f1f0
SimHash 222a885045aa

Groups

*

Rule Path
Disallow /login/
Disallow /register/
Disallow /dashboard/
Allow /

gptbot

Rule Path
Allow /

ccbot

Rule Path
Allow /

bytespider

Rule Path
Allow /

google-extended

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

Other Records

Field Value
sitemap https://trainingcred.com/sitemap.xml

Comments

  • robots.txt for https://trainingcred.com/
  • This file defines crawling and content-use permissions.
  • (a) "search=yes" allows indexing for traditional search engines.
  • (b) "ai-input=yes" allows AI systems to retrieve or reference content in real time.
  • (c) "ai-train=yes" allows using content for AI model training or fine-tuning.
  • Content-signal: search=yes,ai-input=yes,ai-train=yes
  • Allow high-volume commercial model scrapers
  • Sitemap location