credenceresearch.com
robots.txt

Robots Exclusion Standard data for credenceresearch.com

Resource Scan

Scan Details

Site Domain credenceresearch.com
Base Domain credenceresearch.com
Scan Status Ok
Last Scan2025-09-30T07:38:37+00:00
Next Scan 2025-10-07T07:38:37+00:00

Last Scan

Scanned2025-09-30T07:38:37+00:00
URL https://credenceresearch.com/robots.txt
Domain IPs 34.233.30.223
Response IP 34.233.30.223
Found Yes
Hash 2d1af8fec444fa000e7f03b5f3e52add4ef8fd05a4c250051810cfb1944697fe
SimHash 6d105b50fc37

Groups

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

google-extended

Rule Path
Allow /

claudebot

Rule Path
Allow /

anthropic-user-agent

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

aicodebot

Rule Path
Allow /

youbot

Rule Path
Allow /

*

Rule Path
Allow /report/*
Allow /news/*
Disallow /checkout
Disallow /cart
Disallow /inquiry/*
Disallow /ja/*

Other Records

Field Value
sitemap https://www.credenceresearch.com/sitemap_index.xml