actc.org.ar
robots.txt

Robots Exclusion Standard data for actc.org.ar

Resource Scan

Scan Details

Site Domain actc.org.ar
Base Domain actc.org.ar
Scan Status Ok
Last Scan2025-05-17T01:38:44+00:00
Next Scan 2025-06-16T01:38:44+00:00

Last Scan

Scanned2025-05-17T01:38:44+00:00
URL https://actc.org.ar/robots.txt
Domain IPs 66.70.158.205
Response IP 66.70.158.205
Found Yes
Hash 39283d8f0164eb448d6f93077881823740b7edd805cc8b0b4b34e430bf74bd08
SimHash ac8f7ed55951

Groups

googlebot
infonavirobot
tv33_mercator
avsearch
mercator
scooter
slurp
searchenginelicencesheep
shadow
multitext
fast-webcrawler
lycos_spider
atomz
htdig
spider00.logika.net
netmechanic
libwww-perl
teleport pro

Rule Path
Disallow /searchtools-rss.xml

*

Rule Path
Disallow /admin/
Disallow /acredita/
Disallow /db/
Disallow /migracion/
Disallow /classes/
Disallow /include/
Disallow /sitemap/
Disallow /modules/login/
Disallow /vistas/

Comments

  • don't let search engines see the RSS feed, it's just confusing.
  • updated 2019-05-05 (disallow rtestprob links)
  • updated 2019-05-05 (disallow info/slides links, info/robots/)
  • updated 2019-05-05 (disallow /searchtools/ which is an alias)
  • updated 2019-05-05 (rearranged as per Enrico's advice)