philippekacou.org
robots.txt

Robots Exclusion Standard data for philippekacou.org

Resource Scan

Scan Details

Site Domain philippekacou.org
Base Domain philippekacou.org
Scan Status Ok
Last Scan2026-02-19T22:51:55+00:00
Next Scan 2026-03-05T22:51:55+00:00

Last Scan

Scanned2026-02-19T22:51:55+00:00
URL https://www.philippekacou.org/robots.txt
Domain IPs 66.33.60.129, 76.76.21.22
Response IP 66.33.60.67
Found Yes
Hash f80a5b20a085348abcf7b2e535b97e7f087a8eda10e01329bbff015e092684e0
SimHash 45086873e5b2

Groups

*

Rule Path
Allow /
Disallow /api/
Disallow /admin/
Disallow /_next/
Disallow /private/

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

claude-web

Rule Path
Allow /

ccbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

deepseekbot

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

youbot

Rule Path
Allow /

bingbot

Rule Path
Allow /

amazonbot

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

facebookbot

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

yandexbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.philippekacou.org/sitemap.xml

Warnings

  • `host` is not a known field.