herbdish.com
robots.txt

Robots Exclusion Standard data for herbdish.com

Resource Scan

Scan Details

Site Domain herbdish.com
Base Domain herbdish.com
Scan Status Ok
Last Scan2025-10-21T11:19:32+00:00
Next Scan 2025-10-28T11:19:32+00:00

Last Scan

Scanned2025-10-21T11:19:32+00:00
URL https://herbdish.com/robots.txt
Domain IPs 104.21.1.164, 172.67.129.161, 2606:4700:3031::ac43:81a1, 2606:4700:3035::6815:1a4
Response IP 172.67.129.161
Found Yes
Hash 0c7d5bcc957e4a3bdd06d4440ea49d374ea82382d5f816d63f94f72b984ff338
SimHash 6d301b10e4c8

Groups

*

Rule Path
Disallow /author/
Disallow /search/
Disallow /feed/
Disallow */feed/
Disallow /wprm_print/
Disallow /cgi-bin/
Disallow /xmlrpc.php
Disallow /embed/
Disallow /trackback/
Disallow /comments/
Disallow /tag/
Disallow /*?attachment_id=
Disallow /*?s=
Disallow /*/amp/
Disallow /wp-login.php
Disallow /wp-admin/
Disallow /*?utm_source=
Allow /wp-admin/admin-ajax.php

chatgpt-user

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

googlebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

claudebot

Rule Path
Allow /

gptbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.herbdish.com/sitemap_index.xml

Comments

  • πŸ”’ Block thin/duplicate content
  • ❌ Block tracking URLs
  • βœ… Allow WP Ajax
  • --- Allow AI & Search Bots ---
  • πŸ—ΊοΈ Sitemap