topia.io
robots.txt

Robots Exclusion Standard data for topia.io

Archived Snapshots

Resource Scan

Scan Details

Site Domain	topia.io
Base Domain	topia.io
Scan Status	Ok
Last Scan	2025-10-27T12:44:43+00:00
Next Scan	2025-11-26T12:44:43+00:00

Last Scan

Scanned	2025-10-27T12:44:43+00:00
URL	https://topia.io/robots.txt
Redirect	https://topia-website.webflow.io/robots.txt
Redirect Domain	topia-website.webflow.io
Redirect Base	webflow.io
Domain IPs	104.26.4.61, 104.26.5.61, 172.67.71.171, 2606:4700:20::681a:43d, 2606:4700:20::681a:53d, 2606:4700:20::ac43:47ab
Redirect IPs	104.18.36.248, 172.64.151.8, 2606:4700:440c::ac40:9708, 2a06:98c1:3100::6812:24f8
Response IP	172.64.151.8
Found	Yes
Hash	8eefc42e6fa6a1b64d2da9d96b2a90564a3b99721b7df605ac6e078856076481
SimHash	05159cd0ede5

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

googleother

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

bingpreview

Rule	Path
Allow	/

Rule

Path

Allow

duckduckbot

Rule	Path
Allow	/

Rule

Path

Allow

applebot

Rule	Path
Allow	/

Rule

Path

Allow

oai-searchbot

Rule	Path
Allow	/

Rule

Path

Allow

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

claudebot

Rule	Path
Allow	/

Rule

Path

Allow

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

perplexity-user

Rule	Path
Allow	/

Rule

Path

Allow

ccbot

Rule	Path
Allow	/

Rule

Path

Allow

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

applebot-extended

Rule	Path
Allow	/

Rule

Path

Allow

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot-extended

Rule	Path
Allow	/

Rule

Path

Allow

bytespider

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
sitemap	https://topia.io/sitemap.xml
sitemap	https://schoolspace.io/sitemap.xml

Field

Value

sitemap

https://topia.io/sitemap.xml

sitemap

https://schoolspace.io/sitemap.xml

Comments

robots.txt for https://topia.io
Goal: maximize search and AI retrieval crawlability while disallowing AI model training
Default: allow everything for standard web indexing crawlers
Core search crawlers
OpenAI - search and live fetch
Anthropic
Perplexity
Common Crawl and others
Optional sensitive areas
Disallow: /admin/
Disallow: /account/
Disallow: /api/private/

topia.iorobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot

googleother

bingbot

bingpreview

duckduckbot

applebot

oai-searchbot

chatgpt-user

claudebot

perplexitybot

perplexity-user

ccbot

google-extended

applebot-extended

gptbot

amazonbot-extended

bytespider

Other Records

Comments

topia.io
robots.txt