m.news.nate.com
robots.txt

Robots Exclusion Standard data for m.news.nate.com

Resource Scan

Scan Details

Site Domain m.news.nate.com
Base Domain nate.com
Scan Status Ok
Last Scan2024-05-09T09:59:09+00:00
Next Scan 2024-06-08T09:59:09+00:00

Last Scan

Scanned2024-05-09T09:59:09+00:00
URL https://m.news.nate.com/robots.txt
Domain IPs 117.53.117.21
Response IP 117.53.117.21
Found Yes
Hash 9a8f694fc99e8541992c252479d8d7c5cdfd0ad28d8b8c9cd6c5a95c8335f4de
SimHash 314578900678

Groups

*

Rule Path
Disallow /
Allow /ads.txt
Disallow /view/summary*

grapeshot
mediapartners-google
twitterbot

Rule Path
Allow /view/*
Allow /View/*
Disallow /view/summary*

googlebot

Rule Path
Disallow /apollo/
Disallow /search?*&page=
Allow /search?*&page=1$
Allow /search?*&page=2$
Allow /search?*&page=3$
Allow /search?*&page=4$
Allow /search?*&page=5$
Allow /search?*&page=6$
Allow /search?*&page=7$
Allow /search?*&page=8$
Allow /search?*&page=9$
Disallow /view/summary*

yeti
daum
bingbot
msnbot
zumbot
facebookexternalhit

Rule Path
Allow /
Disallow /apollo/
Disallow /view/summary*

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://m.news.nate.com/sitemap?data=index
sitemap https://m.news.nate.com/sitemap?data=index