unitesi.unive.it
robots.txt
Robots Exclusion Standard data for unitesi.unive.it
Resource Scan
Scan Details
Site Domain | unitesi.unive.it |
Base Domain | unive.it |
Scan Status | Ok |
Last Scan | 2025-08-13T03:33:50+00:00 |
Next Scan | 2025-09-12T03:33:50+00:00 |
Last Scan
Scanned | 2025-08-13T03:33:50+00:00 |
URL | https://unitesi.unive.it/robots.txt |
Domain IPs | 130.186.6.62 |
Response IP | 130.186.6.62 |
Found | Yes |
Hash | 1f417854e63ee5c16e78d6581b7d978a196d8ba9d0a6098be6bffe191ee68e18 |
SimHash | a5945f15e5bd |
Groups
*
Rule | Path |
---|---|
Disallow | /discover |
Disallow | /simple-search |
Disallow | /itemExternal |
Disallow | /itemExternalCitation |
Disallow | /itemExternalPercentile |
Other Records
Field | Value |
---|---|
sitemap | https://unitesi.unive.it/sitemap |
sitemap | https://unitesi.unive.it/htmlmap |
Warnings
- 1 invalid line.
Comments