hoangtienphat.com
robots.txt

Robots Exclusion Standard data for hoangtienphat.com

Resource Scan

Scan Details

Site Domain hoangtienphat.com
Base Domain hoangtienphat.com
Scan Status Ok
Last Scan2026-02-08T00:55:42+00:00
Next Scan 2026-03-10T00:55:42+00:00

Last Scan

Scanned2026-02-08T00:55:42+00:00
URL https://hoangtienphat.com/robots.txt
Domain IPs 103.255.237.127
Response IP 103.255.237.127
Found Yes
Hash 5f740cb721abaccf226a32628edcc09fda00238716f58115a51b9dcaa4a1d77f
SimHash 62b078d5cfb6

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

oai-searchbot
chatgpt-user
perplexitybot
firecrawlagent
andibot
exabot
phindbot
youbot

Rule Path
Allow /

gptbot
ccbot
google-extended

Rule Path
Disallow /

googlebot
bingbot

Rule Path
Allow /

*

Rule Path
Disallow /admin/
Disallow /internal/

Other Records

Field Value
sitemap https://hoangtienphat.com/sitemap_index.xml

Comments

  • Allow AI search and agent use
  • Disallow AI training data collection
  • Allow traditional search indexing
  • Disallow access to admin areas for all bots