m.mingpao.com
robots.txt

Robots Exclusion Standard data for m.mingpao.com

Resource Scan

Scan Details

Site Domain m.mingpao.com
Base Domain mingpao.com
Scan Status Ok
Last Scan2024-05-22T09:38:45+00:00
Next Scan 2024-06-05T09:38:45+00:00

Last Scan

Scanned2024-05-22T09:38:45+00:00
URL https://m.mingpao.com/robots.txt
Domain IPs 202.80.6.228, 202.80.6.244
Response IP 202.80.6.244
Found Yes
Hash 402dfb5b7b86b190212e54572bd214b38c0bb57f9d03a46779f649d569ea83d0
SimHash c406c840ca11

Groups

trade desk ads.txt & sellers.json crawler

Rule Path
Disallow

grapeshot

Rule Path
Disallow

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /cfm/
Disallow /dummy%3D*$
Disallow /submit%3D
Disallow /php/manage/
Disallow /php/class/
Disallow /dat/
Disallow /htm/dummy/
Disallow /login0
Disallow /login3
Disallow /logout