www.dspace.uce.edu.ec
robots.txt
Robots Exclusion Standard data for www.dspace.uce.edu.ec
Resource Scan
Scan Details
Site Domain | www.dspace.uce.edu.ec |
Base Domain | uce.edu.ec |
Scan Status | Ok |
Last Scan | 2025-02-18T00:44:45+00:00 |
Next Scan | 2025-03-20T00:44:45+00:00 |
Last Scan
Scanned | 2025-02-18T00:44:45+00:00 |
URL | https://www.dspace.uce.edu.ec/robots.txt |
Response IP | 54.39.87.130 |
Found | Yes |
Hash | 1309ff16892542661938a838ae30362bc0e6a675fe037d84382df4e1ae82f058 |
SimHash | ef14d53bc1b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /admin/* |
Disallow | /processes |
Disallow | /submit |
Disallow | /workspaceitems |
Disallow | /profile |
Disallow | /workflowitems |
Disallow | /simple-search |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | http://www.dspace.uce.edu.ec/sitemap_index.xml |
sitemap | http://www.dspace.uce.edu.ec/sitemap_index.html |
Warnings
- 4 invalid lines.
Comments