dionc.org
robots.txt

Robots Exclusion Standard data for dionc.org

Resource Scan

Scan Details

Site Domain dionc.org
Base Domain dionc.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-12-22T05:52:06+00:00
Next Scan 2026-03-22T05:52:06+00:00

Last Successful Scan

Scanned2025-05-04T00:12:48+00:00
URL https://dionc.org/robots.txt
Domain IPs 104.21.61.29, 172.67.205.105, 2606:4700:3034::ac43:cd69, 2606:4700:3037::6815:3d1d
Response IP 104.21.61.29
Found Yes
Hash af7d48622aa673b9fb2b02a87e7de0374574cdd545aea96d6ba4cd1869a37c15
SimHash e1a0cb592ca1

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/
Disallow /?s=
Disallow /search
Disallow /cart
Disallow /checkout
Disallow /my-account
Disallow /tag/
Disallow /author/
Disallow /comments/
Disallow /?replytocom
Disallow /?attachment_id=
Disallow /*?*add-to-cart*
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://dionc.org/sitemap_index.xml

Comments

  • Robots.txt for WordPress site SEO Optimization
  • Optimized for fast indexing on Google
  • Allow all bots to crawl everything (except the disallowed parts)
  • Allow important files and sections to be indexed
  • Block certain non-SEO friendly bots (optional, adjust as needed)
  • Sitemap directives (for fast indexing by Google)