ceea.edu
robots.txt

Robots Exclusion Standard data for ceea.edu

Resource Scan

Scan Details

Site Domain ceea.edu
Base Domain ceea.edu
Scan Status Ok
Last Scan2024-09-14T05:43:31+00:00
Next Scan 2024-10-14T05:43:31+00:00

Last Scan

Scanned2024-09-14T05:43:31+00:00
URL https://www.ceea.edu/robots.txt
Domain IPs 213.186.33.24
Response IP 213.186.33.24
Found Yes
Hash 1eb245ac97eccf0fed038194e5c9a0d174fd90fd625b63c1e1811748fb8f78ab
SimHash 2a8edcb49bb3

Groups

*

Rule Path
Disallow /local/
Disallow /ecrire/
Disallow /plugins-dist/
Disallow /lib/
Disallow /plugins/
Disallow /prive/
Disallow /squelettes-dist/
Disallow /squelettes/

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ceea.edu/sitemap.xml

Comments

  • robots.txt
  • @url: https://www.ceea.edu
  • @generator: SPIP 3.0.21 [22462]
  • @template: squelettes/robots.txt.html