cv.hal.science
robots.txt

Robots Exclusion Standard data for cv.hal.science

Resource Scan

Scan Details

Site Domain cv.hal.science
Base Domain hal.science
Scan Status Ok
Last Scan2025-03-03T13:54:43+00:00
Next Scan 2025-04-02T13:54:43+00:00

Last Scan

Scanned2025-03-03T13:54:43+00:00
URL https://cv.hal.science/robots.txt
Domain IPs 193.48.96.10
Response IP 193.48.96.10
Found Yes
Hash c3c618afea34858c3130111d275b27aecbc66ecc79cac1c3780911c30902f4fa
SimHash 391d0e4dc781

Groups

*

Rule Path
Disallow /user/*
Disallow /error/*
Disallow /*/authFullName_t/*
Disallow /*/authIdHal_s/*
Disallow /*/primaryDomain_s/*
Disallow /*/producedDateY_i/*
Disallow /*/journalId_i/*
Disallow /*/keyword_s/*
Disallow /*/structId_i/*

Other Records

Field Value
sitemap http://cv.hal.science/robots/sitemap

Comments

  • CV robots.txt
  • Sitemap