mediakhan.com
robots.txt

Robots Exclusion Standard data for mediakhan.com

Resource Scan

Scan Details

Site Domain mediakhan.com
Base Domain mediakhan.com
Scan Status Ok
Last Scan2024-11-09T04:14:39+00:00
Next Scan 2024-11-16T04:14:39+00:00

Last Scan

Scanned2024-11-09T04:14:39+00:00
URL http://www.mediakhan.com/robots.txt
Domain IPs 220.230.126.51, 49.50.169.147
Response IP 49.50.169.147
Found Yes
Hash 84021250c437566c63393a638d16956603590191c6d7a009675d9a9eb143fee0
SimHash 781cb8a0eca3

Groups

ahrefsbot
semrushbot
claudebot
gptbot
chatgpt-user
google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /login
Disallow /SecListData.html
Disallow /national/national-general/article/201709021732001
Disallow /national/national-general/article/201609241422011
Disallow /article/201709021732001

Other Records

Field Value
sitemap https://www.khan.co.kr/sitemap.xml
sitemap https://www.khan.co.kr/sitemap/latest-articles.xml
sitemap https://www.khan.co.kr/sitemap/daily-images.xml
sitemap https://www.khan.co.kr/sitemap/authors.xml

Comments

  • DaumWebMasterTool:35bc55071cd98ae09fdf97e6c96768225c02c9f6ab5b8fe3c1118aefe5139d74:QyRvaXUU8jb2tE+VnsReWg==