tracxn.com
robots.txt

Robots Exclusion Standard data for tracxn.com

Resource Scan

Scan Details

Site Domain tracxn.com
Base Domain tracxn.com
Scan Status Ok
Last Scan2024-09-23T21:04:11+00:00
Next Scan 2024-10-07T21:04:11+00:00

Last Scan

Scanned2024-09-23T21:04:11+00:00
URL https://tracxn.com/robots.txt
Domain IPs 18.155.68.30, 18.155.68.47, 18.155.68.92, 18.155.68.98, 2600:9000:23d2:400:15:54a:8980:93a1, 2600:9000:23d2:6800:15:54a:8980:93a1, 2600:9000:23d2:7a00:15:54a:8980:93a1, 2600:9000:23d2:8a00:15:54a:8980:93a1, 2600:9000:23d2:9e00:15:54a:8980:93a1, 2600:9000:23d2:e00:15:54a:8980:93a1, 2600:9000:23d2:ee00:15:54a:8980:93a1, 2600:9000:23d2:f000:15:54a:8980:93a1
Response IP 18.155.68.92
Found Yes
Hash dbb496f74e5d45773f7a577411da8aa2faba33f8ced6d6c41140473f079590c1
SimHash 4514c335c087

Groups

google-extended
chatgpt-user
gptbot
anthropic-ai
claudebot
claude-web
perplexitybot
cohere-ai
googlebot
bingbot
msnbot
slurp
applebot
baiduspider
duckduckbot
yandexbot
exabot
teoma
aolbuild
twitterbot
rogerbot
deepcrawl
sogou web spider
sogou inst spider
ecosia
naver
naverbot
naverrobot
seznambot
googleother

Rule Path
Allow /$
Allow /b/media$
Allow /callback$
Allow /contactsales$
Allow /contactus$
Allow /contributionspolicy$
Allow /cookiesbythirdpartyservices$
Allow /demo$
Allow /listyourstartup$
Allow /login$
Allow /p/reports$
Allow /pricing
Allow /privacypolicy$
Allow /tracxninvestors$
Allow /gdpr$
Allow /cookiepolicy$
Allow /emailpolicy$
Allow /termsofuse$
Allow /signup$
Allow /sectors
Allow /d*
Allow /d/*/*/*
Allow /d/*/sitemap-*-index.xml$
Allow /d/*/sitemap-*-*.xml.gz$
Disallow /d/*/*
Disallow /

*

Rule Path
Disallow /