jonsguide.org
robots.txt

Robots Exclusion Standard data for jonsguide.org

Resource Scan

Scan Details

Site Domain jonsguide.org
Base Domain jonsguide.org
Scan Status Ok
Last Scan2025-10-08T21:07:22+00:00
Next Scan 2025-11-07T21:07:22+00:00

Last Scan

Scanned2025-10-08T21:07:22+00:00
URL https://jonsguide.org/robots.txt
Domain IPs 104.21.0.144, 172.67.128.20, 2606:4700:3032::ac43:8014, 2606:4700:3033::6815:90
Response IP 172.67.128.20
Found Yes
Hash 3ea8362f4b670903b53117397443a41b54f98144e5b4d9e6b45548b1e0589102
SimHash 2110b9738638

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow /wp/
Disallow *?s=
Disallow *%26s%3D
Disallow /*?*
Disallow /issue/
Disallow /search/
Disallow /author/
Disallow /users/
Disallow */trackback
Disallow */feed
Disallow */rss
Disallow */embed
Disallow */page
Disallow /xmlrpc.php
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif

Other Records

Field Value
sitemap https://jonsguide.org/sitemap_index.xml

Warnings

  • 1 invalid line.
  • `host` is not a known field.