cv.hal.science
robots.txt
Robots Exclusion Standard data for cv.hal.science
Resource Scan
Scan Details
Site Domain | cv.hal.science |
Base Domain | hal.science |
Scan Status | Ok |
Last Scan | 2025-03-03T13:54:43+00:00 |
Next Scan | 2025-04-02T13:54:43+00:00 |
Last Scan
Scanned | 2025-03-03T13:54:43+00:00 |
URL | https://cv.hal.science/robots.txt |
Domain IPs | 193.48.96.10 |
Response IP | 193.48.96.10 |
Found | Yes |
Hash | c3c618afea34858c3130111d275b27aecbc66ecc79cac1c3780911c30902f4fa |
SimHash | 391d0e4dc781 |
Groups
*
Rule | Path |
---|---|
Disallow | /user/* |
Disallow | /error/* |
Disallow | /*/authFullName_t/* |
Disallow | /*/authIdHal_s/* |
Disallow | /*/primaryDomain_s/* |
Disallow | /*/producedDateY_i/* |
Disallow | /*/journalId_i/* |
Disallow | /*/keyword_s/* |
Disallow | /*/structId_i/* |
Other Records
Field | Value |
---|---|
sitemap | http://cv.hal.science/robots/sitemap |
Comments