thdigital.cl
robots.txt

Robots Exclusion Standard data for thdigital.cl

Resource Scan

Scan Details

Site Domain thdigital.cl
Base Domain thdigital.cl
Scan Status Ok
Last Scan2025-11-19T08:20:17+00:00
Next Scan 2025-11-26T08:20:17+00:00

Last Scan

Scanned2025-11-19T08:20:17+00:00
URL https://thdigital.cl/robots.txt
Domain IPs 104.21.14.45, 172.67.157.188, 2606:4700:3035::ac43:9dbc, 2606:4700:3036::6815:e2d
Response IP 104.21.14.45
Found Yes
Hash 2c0027d1d0dcbc166d439a4dd82771eded92c757a91c4300d14124b858bf4c50
SimHash ab3d5844aaf3

Groups

*

Rule Path
Disallow /getlinks*
Disallow /adminbox/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thdigital.cl/sitemap.xml