startupworld.com
robots.txt

Robots Exclusion Standard data for startupworld.com

Resource Scan

Scan Details

Site Domain startupworld.com
Base Domain startupworld.com
Scan Status Ok
Last Scan2025-10-13T20:06:23+00:00
Next Scan 2025-10-20T20:06:23+00:00

Last Scan

Scanned2025-10-13T20:06:23+00:00
URL https://startupworld.com/robots.txt
Domain IPs 172.66.40.247, 172.66.43.9, 2606:4700:3108::ac42:28f7, 2606:4700:3108::ac42:2b09
Response IP 172.66.43.9
Found Yes
Hash 4be20d028cd2cd0d2ba818f56229f62826cb29cede6ca1cb67a039174796b94f
SimHash 61114b93f212

Groups

*

Rule Path
Disallow /zmn/
Disallow /api/
Disallow /ref/
Disallow /xgz/
Disallow /pages/
Disallow /go/
Disallow /adminx/
Allow /

oai-searchbot

Rule Path
Allow /

chatgpt-user
chatgpt-user/2.0

Rule Path
Allow /

gptbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

claudebot
claude-web
anthropic-ai

Rule Path
Allow /

grok

Rule Path
Allow /

bytespider

Rule Path
Allow /

youbot

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

amazonbot

Rule Path
Allow /

applebot
applebot-extended

Rule Path
Allow /

facebookbot
meta-externalagent

Rule Path
Allow /

ccbot

Rule Path
Allow /

diffbot

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

semrushbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

bingbot

Rule Path
Allow /

microsoftbot

Rule Path
Allow /

slurp

Product Comment
slurp Yahoo
Rule Path
Allow /

duckduckbot

Rule Path
Allow /

yandex

Rule Path
Allow /