curiosus.app
robots.txt

Robots Exclusion Standard data for curiosus.app

Archived Snapshots

Resource Scan

Scan Details

Site Domain	curiosus.app
Base Domain	curiosus.app
Scan Status	Ok
Last Scan	2025-12-10T22:13:35+00:00
Next Scan	2025-12-17T22:13:35+00:00

Last Scan

Scanned	2025-12-10T22:13:35+00:00
URL	https://curiosus.app/robots.txt
Domain IPs	185.22.110.122
Response IP	185.22.110.122
Found	Yes
Hash	2803ece8a6c9ce60d5621369e4b8384d30a670a54a9614524063d9024b2ed6f3
SimHash	66128811e5a5

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

/

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

1

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

/

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

/

slurp

Rule	Path
Allow	/

Rule

Path

Allow

/

duckduckbot

Rule	Path
Allow	/

Rule

Path

Allow

/

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

/

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

/

ccbot

Rule	Path
Allow	/

Rule

Path

Allow

/

anthropic-ai

Rule	Path
Allow	/

Rule

Path

Allow

/

claude-web

Rule	Path
Allow	/

Rule

Path

Allow

/

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://curiosus.app/sitemap.xml

Field

Value

sitemap

https://curiosus.app/sitemap.xml

Back to top

Comments

robots.txt for Curiosus
Sitemap location
Crawl-delay for polite bots
Allow all major search engines and AI crawlers
AI Crawlers

Back to top

curiosus.approbots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

googlebot

bingbot

slurp

duckduckbot

gptbot

chatgpt-user

ccbot

anthropic-ai

claude-web

google-extended

Other Records

Comments

curiosus.app
robots.txt