news.tvb.com
robots.txt

Robots Exclusion Standard data for news.tvb.com

Resource Scan

Scan Details

Site Domain news.tvb.com
Base Domain tvb.com
Scan Status Ok
Last Scan2025-12-26T06:26:34+00:00
Next Scan 2026-01-02T06:26:34+00:00

Last Scan

Scanned2025-12-26T06:26:34+00:00
URL https://news.tvb.com/robots.txt
Domain IPs 35.186.198.41
Response IP 35.186.198.41
Found Yes
Hash 6e9d5dcd85a54b3c7e39c48cdf4e65a6233aab8d6409537058825d268ae6d63b
SimHash a10747026b91

Groups

*

Rule Path
Allow /
Allow /tc
Disallow /tc/focus/
Disallow /tc/instant/
Disallow /tc/search?*
Disallow /sc/focus/
Disallow /sc/instant/
Disallow /sc/search?*

petalbot

Rule Path
Allow /
Disallow /tc/focus/
Disallow /tc/instant/
Disallow /tc/search?*
Disallow /sc/focus/
Disallow /sc/instant/
Disallow /sc/search?*

Other Records

Field Value
sitemap https://news.tvb.com/sitemap.xml
sitemap https://news.tvb.com/sitemap_category.xml