thethingstech.com
robots.txt

Robots Exclusion Standard data for thethingstech.com

Resource Scan

Scan Details

Site Domain thethingstech.com
Base Domain thethingstech.com
Scan Status Ok
Last Scan2026-02-15T14:42:34+00:00
Next Scan 2026-02-22T14:42:34+00:00

Last Scan

Scanned2026-02-15T14:42:34+00:00
URL https://thethingstech.com/robots.txt
Domain IPs 104.21.21.249, 172.67.201.118, 2606:4700:3035::6815:15f9, 2606:4700:3037::ac43:c976
Response IP 104.21.21.249
Found Yes
Hash b02e2193bb7ca8556e532b4ffe862cc5684d85cf77c91eb76426c5a526dfe293
SimHash a910cc52a48e

Groups

*

Rule Path
Allow /
Disallow /tmp/
Disallow /dev-tests/
Disallow /test-buttons.html
Disallow /blog/admin
Disallow /blog/admin.html
Disallow /blog/editor
Disallow /blog/editor.html
Disallow /games/gobang/node_modules/

Other Records

Field Value
sitemap https://thethingstech.com/sitemap.xml

Comments

  • 避免暫存/測試頁被收錄(也會在部署時排除)
  • 避免後台頁被搜尋引擎收錄
  • Prevent accidental indexing of bundled dependencies
  • Sitemap helps search engines discover your pages