jump2.mingpao.com
robots.txt
Robots Exclusion Standard data for jump2.mingpao.com
Resource Scan
Scan Details
Site Domain | jump2.mingpao.com |
Base Domain | mingpao.com |
Scan Status | Ok |
Last Scan | 2024-06-07T09:38:17+00:00 |
Next Scan | 2024-07-07T09:38:17+00:00 |
Last Scan
Scanned | 2024-06-07T09:38:17+00:00 |
URL | https://jump2.mingpao.com/robots.txt |
Redirect | https://jump.mingpao.com/robots.txt |
Redirect Domain | jump.mingpao.com |
Redirect Base | mingpao.com |
Domain IPs | 202.80.6.28 |
Redirect IPs | 202.80.6.28 |
Response IP | 202.80.6.28 |
Found | Yes |
Hash | 063379bb1fb2ad97c42423792483702d65c317e453b25f6743262cf2f4bf94a1 |
SimHash | 245cf651c543 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Disallow | /htm/dummy/ |
Disallow | /m/ |
Disallow | */api/ |
Other Records
Field | Value |
---|---|
sitemap | https://jump.mingpao.com/sitemap2/static.xml |
sitemap | https://jump.mingpao.com/career-news/sitemap_index.xml |
sitemap | https://jump.mingpao.com/career-news/post-sitemap.xml |
sitemap | https://jump.mingpao.com/sitemap2/courses/sitemap.xml |
sitemap | https://jump.mingpao.com/sitemap2/job/sitemap.xml |
Warnings
- 2 invalid lines.