chusj.org
robots.txt

Robots Exclusion Standard data for chusj.org

Resource Scan

Scan Details

Site Domain chusj.org
Base Domain chusj.org
Scan Status Ok
Last Scan2025-05-18T21:33:52+00:00
Next Scan 2025-06-17T21:33:52+00:00

Last Scan

Scanned2025-05-18T21:33:52+00:00
URL https://chusj.org/robots.txt
Domain IPs 206.167.252.145
Response IP 206.167.252.145
Found Yes
Hash 832cbb701c7c7526373de5ad22c365992071eb504922b61f9068101ace68e676
SimHash 4b8fd5f0d233

Groups

*

Rule Path
Disallow /Infolettres/
Disallow *?searchtext=*
Disallow /Admin/
Disallow /CMSHelp/
Disallow /Divers/
Disallow /Infolettres/Reflexe/

Other Records

Field Value
crawl-delay 120

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

clark-crawler2

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

obot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

semanticscholarbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

yahoo

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

netestate

Rule Path
Disallow /

applewebkit

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

yahoo! slurp

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.chusj.org/Ressources/Divers/Sitemaps/SitemapPagesSujet
sitemap https://www.chusj.org/Ressources/Divers/sitemap-profil-bio.aspx
sitemap https://www.chusj.org/googlesitemap-xml
sitemap https://www.chusj.org/sitemap.xml