dial.uclouvain.be
robots.txt

Robots Exclusion Standard data for dial.uclouvain.be

Resource Scan

Scan Details

Site Domain dial.uclouvain.be
Base Domain uclouvain.be
Scan Status Ok
Last Scan2025-09-13T18:59:12+00:00
Next Scan 2025-10-13T18:59:12+00:00

Last Scan

Scanned2025-09-13T18:59:12+00:00
URL https://dial.uclouvain.be/robots.txt
Domain IPs 130.104.33.19, 2001:6a8:3081:1010:a::13
Response IP 130.104.33.19
Found Yes
Hash 08b0c29fc9ed8c40688512883a83b333b68107881ddabb1934109b8ed0a1c5c7
SimHash c31306f0ea80

Groups

*

Rule Path
Allow /handle/*
Allow /assets/*
Allow /styles/*
Allow /DialExport/*
Allow /sitemap/*
Allow /downloader/*
Disallow /*adre%3A*
Disallow /*relatedLinks.cgi
Disallow /*SaxonServlet*
Disallow /*?f0=*
Disallow /*?f1=*
Disallow /*mesh%3D*
Disallow /*site_name%3DADRE*
Disallow /*site_name%3DSPIRES*
Disallow /*site_name%3DNUMERISATION*
Disallow /vital/access/services/Feed*
Disallow /vital/access/services/Browse*
Disallow /vital/access/services/Advanced*
Disallow /vital/access/manager/Repository?sort=ss_dateNormalized*
Disallow /vital/access/manager/Browse/*
Disallow /cgi-bin/*
Disallow /ebook/*
Disallow /solr/*
Disallow /valet/*
Disallow /Rebulous/*
Disallow /vital/access/services/*
Disallow /fop/*
Disallow /fedora/*
Disallow */storage/*
Disallow /oai/*

Other Records

Field Value
crawl-delay 20

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://dial.uclouvain.be/sitemap/index_UCL.xml
sitemap https://dial.uclouvain.be/sitemap/index_UCL.xml