thailandtdac.pages.dev
robots.txt

Robots Exclusion Standard data for thailandtdac.pages.dev

Resource Scan

Scan Details

Site Domain thailandtdac.pages.dev
Base Domain thailandtdac.pages.dev
Scan Status Ok
Last Scan2026-02-27T02:36:45+00:00
Next Scan 2026-03-29T02:36:45+00:00

Last Scan

Scanned2026-02-27T02:36:45+00:00
URL https://thailandtdac.pages.dev/robots.txt
Domain IPs 172.66.44.144, 172.66.47.112, 2606:4700:310c::ac42:2c90, 2606:4700:310c::ac42:2f70
Response IP 172.66.47.112
Found Yes
Hash fc8fcaaadc6cde8442eec6d67296bdc2fb35ce4f18b699eca908ec2b7ff7a4e5
SimHash 66132f524d35

Groups

*

Rule Path
Allow /
Allow /ads.txt

Other Records

Field Value
sitemap https://thailandtdac.pages.dev/sitemap.xml

Comments

  • Allow all crawlers to access the entire site
  • Sitemap location
  • Explicitly allow ads.txt
  • Although not required, this clarifies access for ad crawlers