jahia.com
robots.txt

Robots Exclusion Standard data for jahia.com

Resource Scan

Scan Details

Site Domain jahia.com
Base Domain jahia.com
Scan Status Ok
Last Scan2024-05-30T21:34:20+00:00
Next Scan 2024-06-29T21:34:20+00:00

Last Scan

Scanned2024-05-30T21:34:20+00:00
URL https://www.jahia.com/robots.txt
Domain IPs 18.155.68.111, 18.155.68.26, 18.155.68.38, 18.155.68.79
Response IP 18.155.68.38
Found Yes
Hash 3410273abbb3091115f8296f39d314e40277151db69c81ea8ae55afba73dd24a
SimHash 4969e45045b3

Groups

*

Rule Path
Disallow /modules/
Disallow /cms/
Disallow /downloads/diffs/
Disallow /jahia/ajaxaction/
Disallow /cms/contribute/
Disallow /*/login
Disallow /*?reply=
Disallow /*?filter=
Disallow /*?pagesize=
Disallow /*?jahia_url_web_clipping=
Disallow /*/captcha
Disallow /*/jahias-blog.html?N-blog-posts=
Disallow /*newPost.html
Disallow /en/users/
Disallow /fr/users/
Disallow /*newPost?

Other Records

Field Value
sitemap https://www.jahia.com/cms/render/live/en/sites/www/home.full-sitemap.xml
sitemap https://www.jahia.com/cms/render/live/fr/sites/www/home.full-sitemap.xml