sweetcustomwebsites.com
robots.txt

Robots Exclusion Standard data for sweetcustomwebsites.com

Resource Scan

Scan Details

Site Domain sweetcustomwebsites.com
Base Domain sweetcustomwebsites.com
Scan Status Ok
Last Scan2026-01-23T23:04:21+00:00
Next Scan 2026-02-22T23:04:21+00:00

Last Scan

Scanned2026-01-23T23:04:21+00:00
URL https://sweetcustomwebsites.com/robots.txt
Domain IPs 2607:f1c0:100f:f000::2f0, 74.208.236.235
Response IP 74.208.236.235
Found Yes
Hash 0f9f2185d17115dc8ac69b57b4324a6293b853a0d3255f67620e1191e4a2b84f
SimHash 6c31fb120973

Groups

facebookexternalhit

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /cart
Disallow /checkout
Disallow /account
Disallow /login
Disallow /register
Disallow /orders
Disallow /wishlist
Disallow /search
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /*?sort=
Disallow /*?page=
Disallow /*?filter=
Disallow /*?utm_
Disallow /*?ref=

Other Records

Field Value
sitemap https://sweetcustomwebsites.com/sitemap_index.xml

Comments

  • Allow Facebook crawler full access
  • General directives for all other crawlers
  • Allow all key content
  • Disallow non-SEO-critical pages
  • Disallow WooCommerce and WordPress-specific non-public folders
  • Optional: Block common query parameters to reduce duplicate content
  • Sitemap location