digitimes.com
robots.txt
Robots Exclusion Standard data for digitimes.com
Resource Scan
Scan Details
Site Domain | digitimes.com |
Base Domain | digitimes.com |
Scan Status | Ok |
Last Scan | 2024-05-15T05:48:25+00:00 |
Next Scan | 2024-05-22T05:48:25+00:00 |
Last Scan
Scanned | 2024-05-15T05:48:25+00:00 |
URL | https://digitimes.com/robots.txt |
Redirect | https://www.digitimes.com/robots.txt |
Redirect Domain | www.digitimes.com |
Redirect Base | digitimes.com |
Domain IPs | 108.157.254.105, 108.157.254.118, 108.157.254.13, 108.157.254.64 |
Redirect IPs | 108.157.254.105, 108.157.254.118, 108.157.254.13, 108.157.254.64 |
Response IP | 108.157.254.118 |
Found | Yes |
Hash | 7e6d65c779f61865421919756e2ab624a6aad085610e8bce4184bf32d3144994 |
SimHash | 6110cd40ad13 |
Groups
*
Rule | Path |
---|---|
Disallow | /update/ |
Disallow | /Update/ |
Disallow | /DailyMail/ |
Disallow | /dailymail/ |
Disallow | /Tornado/ |
Disallow | /tornado/ |
Disallow | /webad/ |
Disallow | /Webad/ |
Disallow | /Inc/ |
Disallow | /inc/ |
Disallow | /chart/ |
Disallow | /computexfiles/ |
Disallow | /ComputexFiles/ |
Disallow | /newssites.asp |
Disallow | /Newssites.asp |
Other Records
Field | Value |
---|---|
sitemap | https://www.digitimes.com/rss/daily.xml |