rcn.com.co
robots.txt

Robots Exclusion Standard data for rcn.com.co

Resource Scan

Scan Details

Site Domain rcn.com.co
Base Domain rcn.com.co
Scan Status Ok
Last Scan2024-11-11T22:08:51+00:00
Next Scan 2024-11-18T22:08:51+00:00

Last Scan

Scanned2024-11-11T22:08:51+00:00
URL https://rcn.com.co/robots.txt
Redirect https://www.rcnradio.com/robots.txt
Redirect Domain www.rcnradio.com
Redirect Base rcnradio.com
Domain IPs 13.225.4.103, 13.225.4.31, 13.225.4.61, 13.225.4.73
Redirect IPs 108.156.133.101, 108.156.133.104, 108.156.133.2, 108.156.133.40
Response IP 108.156.133.2
Found Yes
Hash 223996650192247095c72ba916bffa022c1c9a6c3963f93f4504e03a78f6f48e
SimHash 505cab04b812

Groups

*

Rule Path
Disallow /mcontent/*
Disallow /amp/
Disallow /amp/*
Disallow /*?*edicion=
Disallow /profiles/
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /user/*
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /index.php/admin/
Disallow /index.php/comment/reply/
Disallow /index.php/filter/tips/
Disallow /index.php/node/add/
Disallow /index.php/search/
Disallow /index.php/user/password/
Disallow /index.php/user/register/
Disallow /index.php/user/login/
Disallow /index.php/user/logout/
Disallow /wp-admin
Disallow /wp-admin*
Disallow /wp-admin/*
Disallow /search/node?keys=*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

seekport

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.rcnradio.com/sitemap.xml
sitemap https://www.rcnradio.com/google-news-sitemap.xml

Comments

  • Sitemap:
  • Paths (clean URLs)
  • Agentes AI