ccearkade.com
robots.txt

Robots Exclusion Standard data for ccearkade.com

Resource Scan

Scan Details

Site Domain ccearkade.com
Base Domain ccearkade.com
Scan Status Ok
Last Scan2025-10-23T03:51:46+00:00
Next Scan 2025-11-06T03:51:46+00:00

Last Scan

Scanned2025-10-23T03:51:46+00:00
URL https://ccearkade.com/robots.txt
Redirect https://www.ccearkade.com/robots.txt
Redirect Domain www.ccearkade.com
Redirect Base ccearkade.com
Domain IPs 178.32.112.217
Redirect IPs 178.32.112.217
Response IP 178.32.112.217
Found Yes
Hash ea245df17cbc17d9ade494204f2357dcdb42b887818aa3d81d6d0bc4de427257
SimHash 480f2af0e20b

Groups

amazonbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /public/modules/
Disallow /public/tools/
Disallow /admin/
Disallow /classes/
Disallow /controllers/
Disallow /functions/
Disallow /modeles/
Disallow /tools/
Disallow /private/
Disallow /cron/
Disallow /acl/
Disallow /api/

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /public/modules/
Disallow /public/tools/
Disallow /admin/
Disallow /classes/
Disallow /controllers/
Disallow /functions/
Disallow /modeles/
Disallow /tools/
Disallow /private/
Disallow /cron/
Disallow /acl/
Disallow /api/