cv.hal.science
robots.txt
Robots Exclusion Standard data for cv.hal.science
Resource Scan
Scan Details
| Site Domain | cv.hal.science |
| Base Domain | hal.science |
| Scan Status | Ok |
| Last Scan | 2025-11-28T16:13:11+00:00 |
| Next Scan | 2025-12-28T16:13:11+00:00 |
Last Scan
| Scanned | 2025-11-28T16:13:11+00:00 |
| URL | https://cv.hal.science/robots.txt |
| Domain IPs | 193.48.96.72 |
| Response IP | 193.48.96.72 |
| Found | Yes |
| Hash | 464a5200d998322bf933bfa52dcafab1ac3ab267b61356568583bb1e7ff1a79a |
| SimHash | 290d0a40d383 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /user/* |
| Disallow | /error/* |
| Disallow | /*/authFullName_t/* |
| Disallow | /*/primaryDomain_s/* |
| Disallow | /*/producedDateY_i/* |
| Disallow | /*/journalId_i/* |
| Disallow | /*/keyword_s/* |
| Disallow | /*/structId_i/* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://cv.hal.science/robots/sitemap |
Comments