happypama.mingpao.com
robots.txt

Robots Exclusion Standard data for happypama.mingpao.com

Resource Scan

Scan Details

Site Domain happypama.mingpao.com
Base Domain mingpao.com
Scan Status Ok
Last Scan2024-04-13T05:35:39+00:00
Next Scan 2024-05-13T05:35:39+00:00

Last Scan

Scanned2024-04-13T05:35:39+00:00
URL https://happypama.mingpao.com/robots.txt
Domain IPs 104.22.10.84, 104.22.11.84, 172.67.12.106, 2606:4700:10::6816:a54, 2606:4700:10::6816:b54, 2606:4700:10::ac43:c6a
Response IP 104.22.10.84
Found Yes
Hash 41ec68c0558448ef77bae578a649303d0a59633e6ed44cbe5f6870e52bdc0144
SimHash 4905c860cb91

Groups

trade desk ads.txt & sellers.json crawler

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Disallow /*/feed/
Disallow /feed/
Disallow /htm/dummy/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://happypama.mingpao.com/sitemap.xml