tracxn.com
robots.txt
Robots Exclusion Standard data for tracxn.com
Resource Scan
Scan Details
Site Domain | tracxn.com |
Base Domain | tracxn.com |
Scan Status | Ok |
Last Scan | 2024-09-23T21:04:11+00:00 |
Next Scan | 2024-10-07T21:04:11+00:00 |
Last Scan
Scanned | 2024-09-23T21:04:11+00:00 |
URL | https://tracxn.com/robots.txt |
Domain IPs | 18.155.68.30, 18.155.68.47, 18.155.68.92, 18.155.68.98, 2600:9000:23d2:400:15:54a:8980:93a1, 2600:9000:23d2:6800:15:54a:8980:93a1, 2600:9000:23d2:7a00:15:54a:8980:93a1, 2600:9000:23d2:8a00:15:54a:8980:93a1, 2600:9000:23d2:9e00:15:54a:8980:93a1, 2600:9000:23d2:e00:15:54a:8980:93a1, 2600:9000:23d2:ee00:15:54a:8980:93a1, 2600:9000:23d2:f000:15:54a:8980:93a1 |
Response IP | 18.155.68.92 |
Found | Yes |
Hash | dbb496f74e5d45773f7a577411da8aa2faba33f8ced6d6c41140473f079590c1 |
SimHash | 4514c335c087 |
Groups
google-extended
chatgpt-user
gptbot
anthropic-ai
claudebot
claude-web
perplexitybot
cohere-ai
googlebot
bingbot
msnbot
slurp
applebot
baiduspider
duckduckbot
yandexbot
exabot
teoma
aolbuild
twitterbot
rogerbot
deepcrawl
sogou web spider
sogou inst spider
ecosia
naver
naverbot
naverrobot
seznambot
googleother
Rule | Path |
---|---|
Allow | /$ |
Allow | /b/media$ |
Allow | /callback$ |
Allow | /contactsales$ |
Allow | /contactus$ |
Allow | /contributionspolicy$ |
Allow | /cookiesbythirdpartyservices$ |
Allow | /demo$ |
Allow | /listyourstartup$ |
Allow | /login$ |
Allow | /p/reports$ |
Allow | /pricing |
Allow | /privacypolicy$ |
Allow | /tracxninvestors$ |
Allow | /gdpr$ |
Allow | /cookiepolicy$ |
Allow | /emailpolicy$ |
Allow | /termsofuse$ |
Allow | /signup$ |
Allow | /sectors |
Allow | /d* |
Allow | /d/*/*/* |
Allow | /d/*/sitemap-*-index.xml$ |
Allow | /d/*/sitemap-*-*.xml.gz$ |
Disallow | /d/*/* |
Disallow | / |
*
Rule | Path |
---|---|
Disallow | / |