sina.com.tw
robots.txt
Robots Exclusion Standard data for sina.com.tw
Resource Scan
Scan Details
Site Domain | sina.com.tw |
Base Domain | sina.com.tw |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-04-08T21:13:37+00:00 |
Next Scan | 2024-07-07T21:13:37+00:00 |
Last Successful Scan
Scanned | 2022-07-30T16:13:14+00:00 |
URL | https://www.sina.com.tw/robots.txt |
Response IP | 210.17.38.45 |
Found | Yes |
Hash | fb2806b02a3741f08014e8084ab6ace898a3783b467e1374bc41df938d517718 |
SimHash | 94401d5461d1 |
Groups
*
Rule | Path |
---|---|
Disallow | /tpl_cache/ |
Disallow | /tpl_templates/ |
Disallow | /tpl_templates_c/ |
Disallow | /include/ |
Disallow | /_common/ |
Disallow | /images/ |
Disallow | /_data/ |
Disallow | /css/ |
Disallow | /js/ |