livres-concours.cap-public.fr
robots.txt

Robots Exclusion Standard data for livres-concours.cap-public.fr

Resource Scan

Scan Details

Site Domain livres-concours.cap-public.fr
Base Domain cap-public.fr
Scan Status Ok
Last Scan2024-10-03T21:04:03+00:00
Next Scan 2024-11-02T21:04:03+00:00

Last Scan

Scanned2024-10-03T21:04:03+00:00
URL https://livres-concours.cap-public.fr/robots.txt
Domain IPs 188.72.70.85, 2a00:b6e0:1:20:16::1
Response IP 188.72.70.85
Found Yes
Hash 89c91b86e61481c856a0547d691f9f3e1b2092460b4e32c341c38dcc5ed5130f
SimHash 6f079c55bf91

Groups

*

Rule Path
Allow /local/cache-css/
Allow /local/cache-js/
Disallow /ecrire/
Disallow /lib/
Disallow /prive/
Disallow /spip.php?action=*
Disallow /spip.php?page=login*
Disallow /*.api/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://livres-concours.cap-public.fr/sitemap.xml

Comments

  • robots.txt
  • @url: https://livres-concours.cap-public.fr
  • @generator: SPIP 4.3.2
  • @template: squelettes-dist/robots.txt.html

Warnings

  • `noindex` is not a known field.