halfbakedharvest.com
robots.txt

Robots Exclusion Standard data for halfbakedharvest.com

Resource Scan

Scan Details

Site Domain halfbakedharvest.com
Base Domain halfbakedharvest.com
Scan Status Ok
Last Scan2024-11-16T17:48:22+00:00
Next Scan 2024-11-23T17:48:22+00:00

Last Scan

Scanned2024-11-16T17:48:22+00:00
URL https://halfbakedharvest.com/robots.txt
Domain IPs 104.21.76.229, 172.67.201.238, 2606:4700:3030::6815:4ce5, 2606:4700:3036::ac43:c9ee
Response IP 104.21.76.229
Found Yes
Hash 85d912f8d4065d4e4ab52f2f654e797dfe463104eb01586f36c2c9b2860245ff
SimHash 7220d9408596

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /wp-admin/
Disallow /wp-login.php?*
Allow /wp-admin/admin-ajax.php

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.halfbakedharvest.com/sitemap_index.xml