rcn.com.co
robots.txt

Robots Exclusion Standard data for rcn.com.co

Resource Scan

Scan Details

Site Domain rcn.com.co
Base Domain rcn.com.co
Scan Status Ok
Last Scan2024-09-23T22:04:55+00:00
Next Scan 2024-09-30T22:04:55+00:00

Last Scan

Scanned2024-09-23T22:04:55+00:00
URL https://rcn.com.co/robots.txt
Redirect https://www.rcnradio.com/robots.txt
Redirect Domain www.rcnradio.com
Redirect Base rcnradio.com
Domain IPs 54.192.18.126, 54.192.18.56, 54.192.18.78, 54.192.18.99
Redirect IPs 108.156.133.101, 108.156.133.104, 108.156.133.2, 108.156.133.40
Response IP 108.156.133.40
Found Yes
Hash 0313cbc6b50a77d2e8c2111215bb7664fe7bee887425a60c8b15b55bd4bfddf8
SimHash 505c2b04b812

Groups

*

Rule Path
Disallow /mcontent/*
Disallow /amp/
Disallow /amp/*
Disallow /*?*edicion=
Disallow /profiles/
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /user/*
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /index.php/admin/
Disallow /index.php/comment/reply/
Disallow /index.php/filter/tips/
Disallow /index.php/node/add/
Disallow /index.php/search/
Disallow /index.php/user/password/
Disallow /index.php/user/register/
Disallow /index.php/user/login/
Disallow /index.php/user/logout/
Disallow /wp-admin
Disallow /wp-admin*
Disallow /wp-admin/*
Disallow /search/node?keys=*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

seekport

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.rcnradio.com/sitemap.xml
sitemap https://www.rcnradio.com/post-sitemap-index.xml
sitemap https://www.rcnradio.com/google-news-sitemap.xml
sitemap https://www.rcnradio.com/articles-currents.xml

Comments

  • Sitemap:
  • Paths (clean URLs)
  • Agentes AI