minerva.usc.es
robots.txt
Robots Exclusion Standard data for minerva.usc.es
Resource Scan
Scan Details
Site Domain | minerva.usc.es |
Base Domain | usc.es |
Scan Status | Ok |
Last Scan | 2025-09-23T03:44:11+00:00 |
Next Scan | 2025-10-23T03:44:11+00:00 |
Last Scan
Scanned | 2025-09-23T03:44:11+00:00 |
URL | https://minerva.usc.es/robots.txt |
Redirect | https://minerva.usc.gal/robots.txt |
Redirect Domain | minerva.usc.gal |
Redirect Base | usc.gal |
Domain IPs | 52.18.147.28 |
Redirect IPs | 52.18.147.28 |
Response IP | 52.18.147.28 |
Found | Yes |
Hash | b7ff5b9127aa7e9cbc7aaa854abafc6f27bf443aaa1a47891654e23f094d702c |
SimHash | 3694c109e5b7 |
Groups
*
Rule | Path |
---|---|
Disallow | /entities/*?f |
Disallow | /search |
Disallow | /admin/* |
Disallow | /processes |
Disallow | /submit |
Disallow | /workspaceitems |
Disallow | /profile |
Disallow | /workflowitems |
Disallow | /statistics |
Disallow | /browse/* |
Disallow | /contact |
Disallow | /feedback |
Disallow | /forgot |
Disallow | /login |
Disallow | /register |
Disallow | /contact |
Other Records
Field | Value |
---|---|
crawl-delay | 6 |
Other Records
Field | Value |
---|---|
sitemap | https://minerva.usc.gal/sitemap_index.xml |
sitemap | https://minerva.usc.gal/sitemap_index.html |
Warnings
- `disaluser-agent` is not a known field.
Comments