life.mingpao.com
robots.txt

Robots Exclusion Standard data for life.mingpao.com

Resource Scan

Scan Details

Site Domain life.mingpao.com
Base Domain mingpao.com
Scan Status Ok
Last Scan2025-03-09T06:29:23+00:00
Next Scan 2025-03-23T06:29:23+00:00

Last Scan

Scanned2025-03-09T06:29:23+00:00
URL https://life.mingpao.com/robots.txt
Domain IPs 104.22.10.84, 104.22.11.84, 172.67.12.106, 2606:4700:10::6816:a54, 2606:4700:10::6816:b54, 2606:4700:10::ac43:c6a
Response IP 104.22.11.84
Found Yes
Hash bd9754ae5026d3cea80fd63decfd0104b211115f525301a4afb85d688c48a067
SimHash ee07e86ada96

Groups

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /cfm/
Disallow /php/class/
Disallow /php/manage/
Disallow /php/nocache/
Disallow *dummy%3Dtrue$
Disallow /dat/
Disallow /htm/dummy/
Disallow /php/log*.php
Disallow *page%3D%5B0-9%5D%5B0-9%5D%5B0-9%5D$
Disallow /search