cfa-roosevelt.fr
robots.txt
Robots Exclusion Standard data for cfa-roosevelt.fr
Resource Scan
Scan Details
| Site Domain | cfa-roosevelt.fr |
| Base Domain | cfa-roosevelt.fr |
| Scan Status | Ok |
| Last Scan | 2026-02-09T00:29:19+00:00 |
| Next Scan | 2026-03-11T00:29:19+00:00 |
Last Scan
| Scanned | 2026-02-09T00:29:19+00:00 |
| URL | https://cfa-roosevelt.fr/robots.txt |
| Domain IPs | 104.21.41.113, 172.67.164.115, 2606:4700:3033::6815:2971, 2606:4700:3033::ac43:a473 |
| Response IP | 172.67.164.115 |
| Found | Yes |
| Hash | 1e75f562ea6443accc031da15e0213e1db13e745da0f641e84e4d74662cc3c4d |
| SimHash | 4b105972a7b3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /? |
| Disallow | /*? |
| Disallow | /*?page= |
| Disallow | /cgi-bin* |
| Disallow | /functions/sitemap-generation.php |
| Allow | /*.css |
| Allow | /*.js |
Other Records
| Field | Value |
|---|---|
| sitemap | https://cfa-roosevelt.fr/sitemap.xml |
Warnings
- `host` is not a known field.