repositorio.cuc.edu.co
robots.txt
Robots Exclusion Standard data for repositorio.cuc.edu.co
Resource Scan
Scan Details
Site Domain | repositorio.cuc.edu.co |
Base Domain | cuc.edu.co |
Scan Status | Ok |
Last Scan | 2025-03-04T14:27:18+00:00 |
Next Scan | 2025-04-03T14:27:18+00:00 |
Last Scan
Scanned | 2025-03-04T14:27:18+00:00 |
URL | https://repositorio.cuc.edu.co/robots.txt |
Domain IPs | 178.32.61.149 |
Response IP | 178.32.61.149 |
Found | Yes |
Hash | e994f93b3ef34d70bb075b37c5a7e122874f1422ec6d2fe0a91080272f00dd30 |
SimHash | af14d53fc1b1 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /admin/* |
Disallow | /processes |
Disallow | /submit |
Disallow | /workspaceitems |
Disallow | /profile |
Disallow | /workflowitems |
Disallow | /simple-search |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://repositorio.cuc.edu.co/sitemap_index.html |
sitemap | https://repositorio.cuc.edu.co/sitemap_index.html |
Warnings
- 4 invalid lines.
Comments