thuonline.com
robots.txt
Robots Exclusion Standard data for thuonline.com
Resource Scan
Scan Details
| Site Domain | thuonline.com |
| Base Domain | thuonline.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Request timed out. |
| Last Scan | 2026-01-14T05:53:53+00:00 |
| Next Scan | 2026-04-14T05:53:53+00:00 |
Last Successful Scan
| Scanned | 2025-03-20T20:12:52+00:00 |
| URL | https://thuonline.com/robots.txt |
| Domain IPs | 104.21.92.73, 172.67.188.160, 2606:4700:3036::6815:5c49, 2606:4700:3036::ac43:bca0 |
| Response IP | 104.21.92.73 |
| Found | Yes |
| Hash | e1e306e5fe7afd9cbbe0e3056e5fdb376e71231f3dad1112d021ae1613922d7d |
| SimHash | 505c9002ef58 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /feed/ |
| Disallow | /author/ |
| Disallow | /tags/ |
| Disallow | /ajax/ |