trongraulamvuon.com
robots.txt

Robots Exclusion Standard data for trongraulamvuon.com

Resource Scan

Scan Details

Site Domain trongraulamvuon.com
Base Domain trongraulamvuon.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-09-19T14:18:14+00:00
Next Scan 2025-10-19T14:18:14+00:00

Last Successful Scan

Scanned2025-08-21T09:52:48+00:00
URL http://trongraulamvuon.com/robots.txt
Redirect https://trongraulamvuon.com/robots.txt
Domain IPs 104.21.1.162, 172.67.129.159
Response IP 172.67.129.159
Found Yes
Hash 9d60cac61029b1b911129a2f4efe2178f2e8b51d0190245aeddbdea7a9cf5879
SimHash 48105dd2c097

Groups

*

Rule Path
Allow /
Disallow /api/
Disallow /_next/
Disallow /admin/

gptbot

Rule Path
Disallow /20*/
Disallow /archive/

Other Records

Field Value
crawl-delay 10

chatgpt-user

Rule Path
Disallow /20*/
Disallow /archive/

Other Records

Field Value
crawl-delay 10

ccbot

Rule Path
Disallow /20*/
Disallow /archive/

Other Records

Field Value
crawl-delay 10

anthropic-ai

Rule Path
Disallow /20*/
Disallow /archive/

Other Records

Field Value
crawl-delay 10

claude-web

Rule Path
Disallow /20*/
Disallow /archive/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://yourdomain.com/sitemap.xml

Comments

  • Specific rules for AI bots
  • Allow sitemap access