nytchina.com
robots.txt
Robots Exclusion Standard data for nytchina.com
Resource Scan
Scan Details
Site Domain | nytchina.com |
Base Domain | nytchina.com |
Scan Status | Ok |
Last Scan | 2024-05-02T11:47:15+00:00 |
Next Scan | 2024-06-01T11:47:15+00:00 |
Last Scan
Scanned | 2024-05-02T11:47:15+00:00 |
URL | https://www.nytchina.com/robots.txt |
Domain IPs | 104.21.24.208, 172.67.220.203, 2606:4700:3031::ac43:dccb, 2606:4700:3033::6815:18d0 |
Response IP | 104.21.24.208 |
Found | Yes |
Hash | cd30faf3d560d984c7b83db629624e70332749a014c3fdcbb1238fa671614e58 |
SimHash | fc11111b8f31 |
Groups
*
Rule | Path |
---|---|
Disallow | /ad-test/ |
Disallow | /helix-test/ |
Disallow | /async/ |
Disallow | /users/ |
Disallow | /sso/ |
Disallow | /tools/ |
Disallow | /search/ |
Disallow | /email/ |
Disallow | /*?*changeLang=zh-hant |
Disallow | /*?*changeLang=zh-hans |
Other Records
Field | Value |
---|---|
sitemap | https://cn.nytimes.com/sitemap.xml |
Comments