udn.news
robots.txt

Robots Exclusion Standard data for udn.news

Resource Scan

Scan Details

Site Domain udn.news
Base Domain udn.news
Scan Status Ok
Last Scan2024-05-17T00:23:13+00:00
Next Scan 2024-05-24T00:23:13+00:00

Last Scan

Scanned2024-05-17T00:23:13+00:00
URL http://udn.news/robots.txt
Redirect https://udn.com/robots.txt
Redirect Domain udn.com
Redirect Base udn.com
Domain IPs 210.243.166.144
Redirect IPs 23.210.100.170
Response IP 23.202.129.93
Found Yes
Hash f65b605689f5131f363ec79f2fdee6c07133c4bb735888b2ff222dd9e8c3a30d
SimHash 2328db70a793

Groups

*

Rule Path
Disallow /BT/*
Disallow /fcm/*
Disallow /NEWS/*
Disallow /NEWS/BREAKINGNEWS/*
Disallow /Project/*
Disallow /UDN/FOUNDER/*
Disallow /UDN/UDNENGLISH/*
Disallow /morakot/*
Allow /page/topic/184
Allow /page/topic/495
Allow /page/topic/496
Disallow /page/topic/*

gptbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://udn.com/sitemapxml/news/mapindex.xml
sitemap https://udn.com/sitemap/gnews/2
sitemap https://udn.com/sitemap/gnews/1013
sitemap https://udn.com/sitemap/gnews/1015

Comments

  • robots.txt
  • Disallow: /.well-known/amphtml/apikey.pub
  • chat bot
  • another bot