docta.ucm.es
robots.txt
Robots Exclusion Standard data for docta.ucm.es
Resource Scan
Scan Details
Site Domain | docta.ucm.es |
Base Domain | ucm.es |
Scan Status | Ok |
Last Scan | 2024-11-03T08:15:55+00:00 |
Next Scan | 2024-12-03T08:15:55+00:00 |
Last Scan
Scanned | 2024-11-03T08:15:55+00:00 |
URL | https://docta.ucm.es/robots.txt |
Domain IPs | 147.96.2.167 |
Response IP | 147.96.2.167 |
Found | Yes |
Hash | 874df9370dbbda3c8353052b002ba1a0a3203471cc5c03583507cf856af81afd |
SimHash | 369c5f09e5b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /admin/* |
Disallow | /processes |
Disallow | /submit |
Disallow | /workspaceitems |
Disallow | /profile |
Disallow | /workflowitems |
Disallow | /entities/*?f |
Disallow | /statistics |
Disallow | /browse/* |
Disallow | /contact |
Disallow | /feedback |
Disallow | /forgot |
Disallow | /login |
Disallow | /register |
Disallow | /browse/* |
Disallow | /statistics |
Disallow | /contact |
Disallow | /feedback |
Disallow | /forgot |
Disallow | /login |
Disallow | /register |
Other Records
Field | Value |
---|---|
crawl-delay | 4 |
Other Records
Field | Value |
---|---|
sitemap | https://docta.ucm.es/sitemap_index.xml |
sitemap | https://docta.ucm.es/sitemap_index.html |
Comments