cfa-roosevelt.fr
robots.txt

Robots Exclusion Standard data for cfa-roosevelt.fr

Resource Scan

Scan Details

Site Domain cfa-roosevelt.fr
Base Domain cfa-roosevelt.fr
Scan Status Ok
Last Scan2026-02-09T00:29:19+00:00
Next Scan 2026-03-11T00:29:19+00:00

Last Scan

Scanned2026-02-09T00:29:19+00:00
URL https://cfa-roosevelt.fr/robots.txt
Domain IPs 104.21.41.113, 172.67.164.115, 2606:4700:3033::6815:2971, 2606:4700:3033::ac43:a473
Response IP 172.67.164.115
Found Yes
Hash 1e75f562ea6443accc031da15e0213e1db13e745da0f641e84e4d74662cc3c4d
SimHash 4b105972a7b3

Groups

*

Rule Path
Disallow /?
Disallow /*?
Disallow /*?page=
Disallow /cgi-bin*
Disallow /functions/sitemap-generation.php
Allow /*.css
Allow /*.js

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cfa-roosevelt.fr/sitemap.xml

Warnings

  • `host` is not a known field.