repositorio.usil.edu.pe
robots.txt
Robots Exclusion Standard data for repositorio.usil.edu.pe
Resource Scan
Scan Details
Site Domain | repositorio.usil.edu.pe |
Base Domain | usil.edu.pe |
Scan Status | Ok |
Last Scan | 2025-03-04T19:49:08+00:00 |
Next Scan | 2025-04-03T19:49:08+00:00 |
Last Scan
Scanned | 2025-03-04T19:49:08+00:00 |
URL | https://repositorio.usil.edu.pe/robots.txt |
Domain IPs | 54.39.90.202 |
Response IP | 54.39.90.202 |
Found | Yes |
Hash | c028dbeb73b3e7c14ae30f18d9d9137b68b889da4860be269f94df430449e50d |
SimHash | 2f14d53fc1b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /admin/* |
Disallow | /processes |
Disallow | /submit |
Disallow | /workspaceitems |
Disallow | /profile |
Disallow | /workflowitems |
Disallow | /simple-search |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://repositorio.usil.edu.pe/sitemap_index.xml |
sitemap | https://repositorio.usil.edu.pe/sitemap_index.html |
Warnings
- 4 invalid lines.
Comments