corporate-codes.com
robots.txt

Robots Exclusion Standard data for corporate-codes.com

Resource Scan

Scan Details

Site Domain corporate-codes.com
Base Domain corporate-codes.com
Scan Status Ok
Last Scan2025-10-21T22:28:08+00:00
Next Scan 2025-10-28T22:28:08+00:00

Last Scan

Scanned2025-10-21T22:28:08+00:00
URL https://corporate-codes.com/robots.txt
Domain IPs 104.21.86.141, 172.67.220.145, 2606:4700:3031::ac43:dc91, 2606:4700:3036::6815:568d
Response IP 104.21.86.141
Found Yes
Hash ef714d045041801b5b195102eeb3fff33b3c8211820db57755758e1328573757
SimHash 5d058f68e493

Groups

*

Rule Path
Allow /
Disallow /App/
Disallow /ThinkPHP/

gptbot
claude-web
anthropic-ai
perplexitybot
googleother
duckassistbot

Rule Path
Allow /
Disallow /App/
Disallow /ThinkPHP/

Other Records

Field Value
sitemap https://corporate-codes.com/sitemaps/sitemapindex.xml

Warnings

  • `llm-content` is not a known field.
  • `llm-full-content` is not a known field.