actc.org.ar
robots.txt

Robots Exclusion Standard data for actc.org.ar

Archived Snapshots

Resource Scan

Scan Details

Site Domain	actc.org.ar
Base Domain	actc.org.ar
Scan Status	Ok
Last Scan	2025-05-17T01:38:44+00:00
Next Scan	2025-06-16T01:38:44+00:00

Last Scan

Scanned	2025-05-17T01:38:44+00:00
URL	https://actc.org.ar/robots.txt
Domain IPs	66.70.158.205
Response IP	66.70.158.205
Found	Yes
Hash	39283d8f0164eb448d6f93077881823740b7edd805cc8b0b4b34e430bf74bd08
SimHash	ac8f7ed55951

Groups

googlebot
infonavirobot
tv33_mercator
avsearch
mercator
scooter
slurp
searchenginelicencesheep
shadow
multitext
fast-webcrawler
lycos_spider
atomz
htdig
spider00.logika.net
netmechanic
libwww-perl
teleport pro

Rule	Path
Disallow	/searchtools-rss.xml

Rule

Path

Disallow

/searchtools-rss.xml

*

Rule	Path
Disallow	/admin/
Disallow	/acredita/
Disallow	/db/
Disallow	/migracion/
Disallow	/classes/
Disallow	/include/
Disallow	/sitemap/
Disallow	/modules/login/
Disallow	/vistas/

Rule

Path

Disallow

/admin/

Disallow

/acredita/

Disallow

/db/

Disallow

/migracion/

Disallow

/classes/

Disallow

/include/

Disallow

/sitemap/

Disallow

/modules/login/

Disallow

/vistas/

Back to top

Comments

don't let search engines see the RSS feed, it's just confusing.
updated 2019-05-05 (disallow rtestprob links)
updated 2019-05-05 (disallow info/slides links, info/robots/)
updated 2019-05-05 (disallow /searchtools/ which is an alias)
updated 2019-05-05 (rearranged as per Enrico's advice)

Back to top

actc.org.arrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebotinfonavirobottv33_mercatoravsearchmercatorscooterslurpsearchenginelicencesheepshadowmultitextfast-webcrawlerlycos_spideratomzhtdigspider00.logika.netnetmechaniclibwww-perlteleport pro

*

Comments

actc.org.ar
robots.txt

googlebot
infonavirobot
tv33_mercator
avsearch
mercator
scooter
slurp
searchenginelicencesheep
shadow
multitext
fast-webcrawler
lycos_spider
atomz
htdig
spider00.logika.net
netmechanic
libwww-perl
teleport pro