hal.science
robots.txt
Robots Exclusion Standard data for hal.science
Resource Scan
Scan Details
Site Domain | hal.science |
Base Domain | hal.science |
Scan Status | Ok |
Last Scan | 2025-02-26T04:22:54+00:00 |
Next Scan | 2025-03-28T04:22:54+00:00 |
Last Scan
Scanned | 2025-02-26T04:22:54+00:00 |
URL | https://hal.science/robots.txt |
Redirect | https://hal.science/robots |
Domain IPs | 193.48.96.10 |
Response IP | 193.48.96.10 |
Found | Yes |
Hash | 0448039ee295f72c4f4f198cdcc6818e37b8b59557a66ce32e8c50f299ae8b99 |
SimHash | 700d7159c282 |
Groups
*
Rule | Path |
---|---|
Disallow | /RePEc/ |
Disallow | /search/ |
Disallow | /*/search/ |
Disallow | /*/browse/last |
Disallow | /browse/last |
Disallow | /*/browse/latest-publications |
Disallow | /browse/latest-publications |
Disallow | /browse/domain |
Disallow | /*/browse/domain |
Disallow | /browse/author-structure |
Disallow | /*/browse/author-structure |
Disallow | /browse/laboratory |
Disallow | /*/browse/laboratory |
Disallow | /browse/author |
Disallow | /*/browse/author |
Disallow | */tei |
Disallow | */rdf |
Disallow | */bibtex |
Disallow | */dc |
Disallow | */datacite |
Disallow | */openaire |
Disallow | */dcterms |
Disallow | */endnote |
Disallow | */json |
Disallow | /ping |
Disallow | /login |
Disallow | /submit |
Disallow | /user |
Disallow | /*/user/* |
Disallow | /error |
Disallow | */preview/* |
Disallow | /view/resolver/* |
Disallow | */ajax* |
Disallow | */widget* |
Other Records
Field | Value |
---|---|
sitemap | http://hal.science/robots/sitemap |
Comments