archive.shine.cn
robots.txt
Robots Exclusion Standard data for archive.shine.cn
Resource Scan
Scan Details
Site Domain | archive.shine.cn |
Base Domain | shine.cn |
Scan Status | Ok |
Last Scan | 2024-10-30T08:08:01+00:00 |
Next Scan | 2024-11-29T08:08:01+00:00 |
Last Scan
Scanned | 2024-10-30T08:08:01+00:00 |
URL | https://archive.shine.cn/robots.txt |
Domain IPs | 138.113.115.37, 2a01:53c0:ffcc::55 |
Response IP | 138.113.115.37 |
Found | Yes |
Hash | cc6fb104628871ce3ca1a43a8a7c4762adee6edc82525f5a2d533d5d4e015eb5 |
SimHash | 088460448712 |
Groups
*
Rule | Path |
---|---|
Disallow | /bin/ |
Disallow | /ucenter/ |
Disallow | /UserCenter/ |
Disallow | /Style/ |
Disallow | /article/sl.aspx* |
Other Records
Field | Value |
---|---|
sitemap | https://archive.shine.cn/sitemapindex.xml |
sitemap | https://archive.shine.cn/sitemap-metro.xml |
sitemap | https://archive.shine.cn/sitemap-biz.xml |
sitemap | https://archive.shine.cn/sitemap-nation.xml |
sitemap | https://archive.shine.cn/sitemap-world.xml |
sitemap | https://archive.shine.cn/sitemap-sports.xml |
sitemap | https://archive.shine.cn/sitemap-others.xml |
sitemap | https://archive.shine.cn/sitemap-shd.xml |