cv.hal.science
robots.txt

Robots Exclusion Standard data for cv.hal.science

Resource Scan

Scan Details

Site Domain cv.hal.science
Base Domain hal.science
Scan Status Ok
Last Scan2025-11-28T16:13:11+00:00
Next Scan 2025-12-28T16:13:11+00:00

Last Scan

Scanned2025-11-28T16:13:11+00:00
URL https://cv.hal.science/robots.txt
Domain IPs 193.48.96.72
Response IP 193.48.96.72
Found Yes
Hash 464a5200d998322bf933bfa52dcafab1ac3ab267b61356568583bb1e7ff1a79a
SimHash 290d0a40d383

Groups

*

Rule Path
Disallow /user/*
Disallow /error/*
Disallow /*/authFullName_t/*
Disallow /*/primaryDomain_s/*
Disallow /*/producedDateY_i/*
Disallow /*/journalId_i/*
Disallow /*/keyword_s/*
Disallow /*/structId_i/*

Other Records

Field Value
sitemap https://cv.hal.science/robots/sitemap

Comments

  • CV robots.txt
  • Sitemap