news.nate.com
robots.txt

Robots Exclusion Standard data for news.nate.com

Resource Scan

Scan Details

Site Domain news.nate.com
Base Domain nate.com
Scan Status Ok
Last Scan2024-11-07T14:11:42+00:00
Next Scan 2024-12-07T14:11:42+00:00

Last Scan

Scanned2024-11-07T14:11:42+00:00
URL https://news.nate.com/robots.txt
Domain IPs 117.53.117.12
Response IP 117.53.117.12
Found Yes
Hash 5fcf9fd6b070bb2b3ba11181fb76a8978b5b07e25e6d0100e39a15d144e97a23
SimHash 3155589002d8

Groups

*

Rule Path
Disallow /
Allow /ads.txt
Disallow /view/summary*

mediapartners-google
twitterbot

Rule Path
Allow /view/*
Allow /View/*
Disallow /view/summary*

googlebot

Rule Path
Disallow /search?*&page=
Allow /search?*&page=1$
Allow /search?*&page=2$
Allow /search?*&page=3$
Allow /search?*&page=4$
Allow /search?*&page=5$
Allow /search?*&page=6$
Allow /search?*&page=7$
Allow /search?*&page=8$
Allow /search?*&page=9$
Disallow /view/summary*

yeti
daum
bingbot
msnbot
zumbot
facebookexternalhit

Rule Path
Allow /
Disallow /view/summary*

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://news.nate.com/sitemap?data=index
sitemap https://news.nate.com/sitemap?data=index