digitimes.com
robots.txt

Robots Exclusion Standard data for digitimes.com

Resource Scan

Scan Details

Site Domain digitimes.com
Base Domain digitimes.com
Scan Status Ok
Last Scan2024-05-15T05:48:25+00:00
Next Scan 2024-05-22T05:48:25+00:00

Last Scan

Scanned2024-05-15T05:48:25+00:00
URL https://digitimes.com/robots.txt
Redirect https://www.digitimes.com/robots.txt
Redirect Domain www.digitimes.com
Redirect Base digitimes.com
Domain IPs 108.157.254.105, 108.157.254.118, 108.157.254.13, 108.157.254.64
Redirect IPs 108.157.254.105, 108.157.254.118, 108.157.254.13, 108.157.254.64
Response IP 108.157.254.118
Found Yes
Hash 7e6d65c779f61865421919756e2ab624a6aad085610e8bce4184bf32d3144994
SimHash 6110cd40ad13

Groups

*

Rule Path
Disallow /update/
Disallow /Update/
Disallow /DailyMail/
Disallow /dailymail/
Disallow /Tornado/
Disallow /tornado/
Disallow /webad/
Disallow /Webad/
Disallow /Inc/
Disallow /inc/
Disallow /chart/
Disallow /computexfiles/
Disallow /ComputexFiles/
Disallow /newssites.asp
Disallow /Newssites.asp

gptbot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.digitimes.com/rss/daily.xml