reading.udn.com
robots.txt

Robots Exclusion Standard data for reading.udn.com

Resource Scan

Scan Details

Site Domain reading.udn.com
Base Domain udn.com
Scan Status Ok
Last Scan2024-05-11T17:30:01+00:00
Next Scan 2024-06-10T17:30:01+00:00

Last Scan

Scanned2024-05-11T17:30:01+00:00
URL https://reading.udn.com/robots.txt
Domain IPs 210.243.166.93
Response IP 210.243.166.93
Found Yes
Hash 27869d2ab050152845bc4249ff6c73356607cd05db51cefd5be9f1a31341ab7d
SimHash af3c65a1ec02

Groups

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 200

*

Rule Path
Disallow /reading/*

*

Rule Path
Disallow /library/*

*

Rule Path
Disallow /pod/*

*

Rule Path
Disallow /store/store/store_search.do

gigabot

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

mqqbrowser

Rule Path
Disallow /

nimbostratus-bot/v1.3.2

Rule Path
Disallow /

qwant-news

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

tinytestbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://reading.udn.com/sitemap/gnews/1014
sitemap https://reading.udn.com/sitemapxml/read/mapindex.xml

Comments

  • go away
  • siteamp
  • RDH, 06.30.21: Very temporary to get some relief.
  • User-Agent: Googlebot
  • Disallow: /
  • chat bot
  • another bot