infobit.com.ar
robots.txt

Robots Exclusion Standard data for infobit.com.ar

Resource Scan

Scan Details

Site Domain infobit.com.ar
Base Domain infobit.com.ar
Scan Status Ok
Last Scan2025-06-25T18:32:11+00:00
Next Scan 2025-07-25T18:32:11+00:00

Last Scan

Scanned2025-06-25T18:32:11+00:00
URL https://infobit.com.ar/robots.txt
Response IP 66.70.158.197
Found Yes
Hash 737f946a29a7587c3e855654cd2fb75513cc2ab1fc2b335c63d8c1aea6dd8a6d
SimHash 208e7fdd590b

Groups

googlebot
infonavirobot
tv33_mercator
avsearch
mercator
scooter
slurp
searchenginelicencesheep
shadow
multitext
fast-webcrawler
lycos_spider
atomz
htdig
spider00.logika.net
netmechanic
libwww-perl
teleport pro

Rule Path
Disallow /searchtools-rss.xml

*

Rule Path
Disallow /%21OLD/
Disallow /admin/
Disallow /classes/
Disallow /cms-panel/
Disallow /demos/
Disallow /download/
Disallow /ga/
Disallow /include/
Disallow /inscripcion/
Disallow /logos_dineromail/
Disallow /pago_con_tarjeta_credito/
Disallow /paypal/
Disallow /sitemap/
Disallow /speedtest/
Disallow /upload/
Disallow /vistas/
Disallow /ws/

Comments

  • don't let search engines see the RSS feed, it's just confusing.
  • updated 2010-10-09 (disallow rtestprob links)
  • updated 2010-10-09 (disallow info/slides links, info/robots/)
  • updated 2010-10-09 (disallow /searchtools/ which is an alias)
  • updated 2010-10-09 (rearranged as per Enrico's advice)