jiwh.org
robots.txt

Robots Exclusion Standard data for jiwh.org

Resource Scan

Scan Details

Site Domain jiwh.org
Base Domain jiwh.org
Scan Status Ok
Last Scan2025-10-02T15:19:10+00:00
Next Scan 2025-11-01T15:19:10+00:00

Last Scan

Scanned2025-10-02T15:19:10+00:00
URL https://jiwh.org/robots.txt
Domain IPs 104.21.86.91, 172.67.217.102, 2606:4700:3031::6815:565b, 2606:4700:3036::ac43:d966
Response IP 104.21.86.91
Found Yes
Hash f655f4aee2ff5a1eec027e29c212d9078f216b4c749794bb18f77dedd8bf7989
SimHash 6518a9419731

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow /wp/
Disallow *?s=
Disallow *%26s%3D
Disallow */embed
Disallow */wlwmanifest.xml
Disallow /xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif

Other Records

Field Value
sitemap https://www.jiwh.org/sitemap_index.xml