mediakhan.com
robots.txt

Robots Exclusion Standard data for mediakhan.com

Resource Scan

Scan Details

Site Domain mediakhan.com
Base Domain mediakhan.com
Scan Status Ok
Last Scan2024-05-17T17:26:45+00:00
Next Scan 2024-05-24T17:26:45+00:00

Last Scan

Scanned2024-05-17T17:26:45+00:00
URL http://www.mediakhan.com/robots.txt
Domain IPs 220.230.126.51, 49.50.169.147
Response IP 49.50.169.147
Found Yes
Hash bdff8d080faf6ffa97af560fbd57d1679b625788a7c07c3fb6a80347e95e211a
SimHash 781dbaa0ec72

Groups

ahrefsbot
semrushbot
claudebot

Rule Path
Disallow /

*

Rule Path
Disallow /login
Disallow /SecListData.html
Disallow /national/national-general/article/201709021732001
Disallow /national/national-general/article/201609241422011
Disallow /article/201709021732001

Other Records

Field Value
sitemap https://www.khan.co.kr/sitemap.xml
sitemap https://www.khan.co.kr/latest-articles.xml
sitemap https://www.khan.co.kr/daily-articles.xml
sitemap https://www.khan.co.kr/daily-images.xml
sitemap https://www.khan.co.kr/authors.xml

Comments

  • DaumWebMasterTool:35bc55071cd98ae09fdf97e6c96768225c02c9f6ab5b8fe3c1118aefe5139d74:0E6OJlJ9yrgU7KPu0EEXQg==