trainingcred.com
robots.txt

Robots Exclusion Standard data for trainingcred.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	trainingcred.com
Base Domain	trainingcred.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2026-02-26T17:25:58+00:00
Next Scan	2026-03-12T17:25:58+00:00

Last Successful Scan

Scanned	2026-01-19T17:25:16+00:00
URL	https://trainingcred.com/robots.txt
Domain IPs	172.66.40.104, 172.66.43.152, 2606:4700:3108::ac42:2868, 2606:4700:3108::ac42:2b98
Response IP	172.66.40.104
Found	Yes
Hash	272463391e45257d542715a459982e18bb242c403823d59de452f1a2fc96f1f0
SimHash	222a885045aa

Groups

*

Rule	Path
Disallow	/login/
Disallow	/register/
Disallow	/dashboard/
Allow	/

Rule

Path

Disallow

/login/

Disallow

/register/

Disallow

/dashboard/

Allow

/

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

/

ccbot

Rule	Path
Allow	/

Rule

Path

Allow

/

bytespider

Rule	Path
Allow	/

Rule

Path

Allow

/

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

/

meta-externalagent

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://trainingcred.com/sitemap.xml

Field

Value

sitemap

https://trainingcred.com/sitemap.xml

Back to top

Comments

robots.txt for https://trainingcred.com/
This file defines crawling and content-use permissions.
(a) "search=yes" allows indexing for traditional search engines.
(b) "ai-input=yes" allows AI systems to retrieve or reference content in real time.
(c) "ai-train=yes" allows using content for AI model training or fine-tuning.
Content-signal: search=yes,ai-input=yes,ai-train=yes
Allow high-volume commercial model scrapers
Sitemap location

Back to top

trainingcred.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

gptbot

ccbot

bytespider

google-extended

meta-externalagent

Other Records

Comments

trainingcred.com
robots.txt