m.news.nate.com
robots.txt
Robots Exclusion Standard data for m.news.nate.com
Resource Scan
Scan Details
Site Domain | m.news.nate.com |
Base Domain | nate.com |
Scan Status | Ok |
Last Scan | 2024-05-09T09:59:09+00:00 |
Next Scan | 2024-06-08T09:59:09+00:00 |
Last Scan
Scanned | 2024-05-09T09:59:09+00:00 |
URL | https://m.news.nate.com/robots.txt |
Domain IPs | 117.53.117.21 |
Response IP | 117.53.117.21 |
Found | Yes |
Hash | 9a8f694fc99e8541992c252479d8d7c5cdfd0ad28d8b8c9cd6c5a95c8335f4de |
SimHash | 314578900678 |
Groups
*
Rule | Path |
---|---|
Disallow | / |
Allow | /ads.txt |
Disallow | /view/summary* |
grapeshot
mediapartners-google
twitterbot
Rule | Path |
---|---|
Allow | /view/* |
Allow | /View/* |
Disallow | /view/summary* |
googlebot
Rule | Path |
---|---|
Disallow | /apollo/ |
Disallow | /search?*&page= |
Allow | /search?*&page=1$ |
Allow | /search?*&page=2$ |
Allow | /search?*&page=3$ |
Allow | /search?*&page=4$ |
Allow | /search?*&page=5$ |
Allow | /search?*&page=6$ |
Allow | /search?*&page=7$ |
Allow | /search?*&page=8$ |
Allow | /search?*&page=9$ |
Disallow | /view/summary* |
Other Records
Field | Value |
---|---|
sitemap | https://m.news.nate.com/sitemap?data=index |
sitemap | https://m.news.nate.com/sitemap?data=index |