mediakhan.com
robots.txt

Robots Exclusion Standard data for mediakhan.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mediakhan.com
Base Domain	mediakhan.com
Scan Status	Ok
Last Scan	2024-11-09T04:14:39+00:00
Next Scan	2024-11-16T04:14:39+00:00

Last Scan

Scanned	2024-11-09T04:14:39+00:00
URL	http://www.mediakhan.com/robots.txt
Domain IPs	220.230.126.51, 49.50.169.147
Response IP	49.50.169.147
Found	Yes
Hash	84021250c437566c63393a638d16956603590191c6d7a009675d9a9eb143fee0
SimHash	781cb8a0eca3

Groups

ahrefsbot
semrushbot
claudebot
gptbot
chatgpt-user
google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/login
Disallow	/SecListData.html
Disallow	/national/national-general/article/201709021732001
Disallow	/national/national-general/article/201609241422011
Disallow	/article/201709021732001

Rule

Path

Disallow

/login

Disallow

/SecListData.html

Disallow

/national/national-general/article/201709021732001

Disallow

/national/national-general/article/201609241422011

Disallow

/article/201709021732001

Back to top

Other Records

Field	Value
sitemap	https://www.khan.co.kr/sitemap.xml
sitemap	https://www.khan.co.kr/sitemap/latest-articles.xml
sitemap	https://www.khan.co.kr/sitemap/daily-images.xml
sitemap	https://www.khan.co.kr/sitemap/authors.xml

Field

Value

sitemap

https://www.khan.co.kr/sitemap.xml

sitemap

https://www.khan.co.kr/sitemap/latest-articles.xml

sitemap

https://www.khan.co.kr/sitemap/daily-images.xml

sitemap

https://www.khan.co.kr/sitemap/authors.xml

Back to top

Comments

DaumWebMasterTool:35bc55071cd98ae09fdf97e6c96768225c02c9f6ab5b8fe3c1118aefe5139d74:QyRvaXUU8jb2tE+VnsReWg==

Back to top

mediakhan.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

ahrefsbotsemrushbotclaudebotgptbotchatgpt-usergoogle-extended

*

Other Records

Comments

mediakhan.com
robots.txt

ahrefsbot
semrushbot
claudebot
gptbot
chatgpt-user
google-extended