sistemas.clacso.org
robots.txt
Robots Exclusion Standard data for sistemas.clacso.org
Resource Scan
Scan Details
Site Domain | sistemas.clacso.org |
Base Domain | clacso.org |
Scan Status | Ok |
Last Scan | 2025-08-19T21:52:14+00:00 |
Next Scan | 2025-09-18T21:52:14+00:00 |
Last Scan
Scanned | 2025-08-19T21:52:14+00:00 |
URL | https://sistemas.clacso.org/robots.txt |
Domain IPs | 190.7.60.20 |
Response IP | 190.7.60.20 |
Found | Yes |
Hash | 2795dc1f0f9aeda437bb53b7c91f514a7871bb2557755857fff9cc8c0743e588 |
SimHash | 6040779056f5 |
Groups
*
Rule | Path |
---|---|
Disallow | /inscripciones/becas/cuenta_CLACSO/ |
Disallow | /inscripciones/becas/cuenta_CLACSO/* |
Disallow | /*/attachment/ |
Disallow | xmlrpc.php |
Disallow | /*/*/*.pdf$ |
Disallow | /*/*/*/*.pdf$ |
Disallow | /*/*/*/*/*.pdf$ |
Disallow | /*? |
*
Rule | Path |
---|---|
Disallow | /?s= |
Disallow | /search |
libwww
Rule | Path |
---|---|
Disallow | / |
Disallow | /?s= |
Disallow | /search |
Disallow | /*? |
Disallow | /*.php$ |
Disallow | /*.js$ |
Disallow | /*.inc$ |
Disallow | /*.css$ |
Disallow | */feed/ |
Disallow | */trackback/ |
Disallow | /page/ |
Disallow | /tag/ |
Disallow | /category/ |
Disallow | /*.sql$ |
Disallow | /*.tgz$ |
Disallow | /*.gz$ |
Disallow | /*.tar$ |
Disallow | /*.svn$ |
Comments