colombia.co
robots.txt

Robots Exclusion Standard data for colombia.co

Resource Scan

Scan Details

Site Domain colombia.co
Base Domain colombia.co
Scan Status Ok
Last Scan2024-05-08T12:33:01+00:00
Next Scan 2024-06-07T12:33:01+00:00

Last Scan

Scanned2024-05-08T12:33:01+00:00
URL https://colombia.co/robots.txt
Redirect https://www.colombia.co/robots.txt
Redirect Domain www.colombia.co
Redirect Base colombia.co
Domain IPs 104.26.8.6, 104.26.9.6, 172.67.69.54, 2606:4700:20::681a:806, 2606:4700:20::681a:906, 2606:4700:20::ac43:4536
Redirect IPs 104.26.8.6, 104.26.9.6, 172.67.69.54, 2606:4700:20::681a:806, 2606:4700:20::681a:906, 2606:4700:20::ac43:4536
Response IP 172.67.69.54
Found Yes
Hash 95845db29af7a187e8a4139ee6dfb789bc2c359aeaffe51acf268910f1aa46e3
SimHash 3c961d090564

Groups

*

Rule Path
Disallow /wp-content/uploads/*.pdf
Disallow /wp-content/uploads/*.html
Disallow /wp-content/uploads/*.mp4
Disallow /wp-content/uploads/*.xls
Disallow /wp-content/uploads/*.xlsx
Disallow /wp-content/uploads/*.doc
Disallow /wp-content/uploads/*.docx
Disallow /wp-content/uploads/*.ppt
Disallow /wp-content/uploads/*.pptx
Disallow /wp-content/uploads/*.exe
Disallow /wp-content/uploads/*.swf
Disallow /wp-login
Disallow /wp-admin
Disallow /*/feed/
Disallow /*/trackback/
Disallow /*/attachment/
Disallow /author/
Disallow /*/page/
Disallow /*/feed/
Disallow /page/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /*?s=
Disallow /*/*/*/feed.xml
Disallow /?attachment_id*
Disallow /tp_eventos/
Disallow /home-slide/
Disallow /homebox/
Disallow /faq/
Disallow /download/
Disallow /tp_lugares_unicos/
Disallow /banners/
Disallow en/*/feed/
Disallow en/feed/*
Disallow /cdn-cgi/
Disallow /*/amp$
Disallow /*https%3A//qa.colombia.co/
Disallow /*https%3A//qa-4.colombia.co/
Disallow /*https%3A//qa-6.colombia.co/

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • For syntax checking, see:
  • http://www.frobee.com/robots-txt-check
  • Multimedia
  • url nuevas

Warnings

  • 1 invalid line.