sites.thestar.com.my
robots.txt

Robots Exclusion Standard data for sites.thestar.com.my

Resource Scan

Scan Details

Site Domain sites.thestar.com.my
Base Domain thestar.com.my
Scan Status Ok
Last Scan2024-04-22T11:38:46+00:00
Next Scan 2024-05-22T11:38:46+00:00

Last Scan

Scanned2024-04-22T11:38:46+00:00
URL https://sites.thestar.com.my/robots.txt
Domain IPs 13.228.188.75
Response IP 13.228.188.75
Found Yes
Hash 0cb31239832bd4b80f24ae6a8adedf43c8a0636411c8c557dd22d9ae6ce045a1
SimHash 3415b168eff6

Groups

*

Rule Path
Disallow /search
Disallow /audio/printerfriendly.asp
Disallow /starbuys/
Disallow /mobile-2013

Comments

  • This file is to block the /search directory from being crawled by web-robots.
  • Please do not remove this file.
  • All bots
  • Are disallowed from getting any directory or file beginning with search