udn.com
robots.txt

Robots Exclusion Standard data for udn.com

Resource Scan

Scan Details

Site Domain udn.com
Base Domain udn.com
Scan Status Ok
Last Scan2024-04-20T19:14:31+00:00
Next Scan 2024-04-27T19:14:31+00:00

Last Scan

Scanned2024-04-20T19:14:31+00:00
URL https://udn.com/robots.txt
Domain IPs 23.210.100.170
Response IP 23.210.100.170
Found Yes
Hash fd166620eb6e55def791bc182c92f2761831646d892371b2d68055590744082b
SimHash 0328dd703791

Groups

*

Rule Path
Disallow /BT/*
Disallow /fcm/*
Disallow /NEWS/*
Disallow /NEWS/BREAKINGNEWS/*
Disallow /Project/*
Disallow /UDN/FOUNDER/*
Disallow /UDN/UDNENGLISH/*
Disallow /morakot/*
Allow /page/topic/184
Allow /page/topic/495
Allow /page/topic/496
Disallow /page/topic/*

gptbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://udn.com/sitemapxml/news/mapindex.xml
sitemap https://udn.com/sitemap/gnews/2
sitemap https://udn.com/sitemap/gnews/1013
sitemap https://udn.com/sitemap/gnews/1015

Comments

  • robots.txt
  • Disallow: /.well-known/amphtml/apikey.pub
  • chatbot