cacompanyregistry.com
robots.txt

Robots Exclusion Standard data for cacompanyregistry.com

Resource Scan

Scan Details

Site Domain cacompanyregistry.com
Base Domain cacompanyregistry.com
Scan Status Ok
Last Scan2025-10-15T05:13:47+00:00
Next Scan 2025-11-14T05:13:47+00:00

Last Scan

Scanned2025-10-15T05:13:47+00:00
URL https://cacompanyregistry.com/robots.txt
Domain IPs 104.21.84.62, 172.67.187.167, 2606:4700:3035::6815:543e, 2606:4700:3035::ac43:bba7
Response IP 172.67.187.167
Found Yes
Hash de4633808eb3aa8c63daa9d7055464fa05fa83a1805106a1945c6295f6eaa4b5
SimHash 501d4850e8da

Groups

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

brightbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /companies/hit-solution-limited/