healthyduck.org
robots.txt

Robots Exclusion Standard data for healthyduck.org

Resource Scan

Scan Details

Site Domain healthyduck.org
Base Domain healthyduck.org
Scan Status Ok
Last Scan2025-06-29T22:41:43+00:00
Next Scan 2025-07-06T22:41:43+00:00

Last Scan

Scanned2025-06-29T22:41:43+00:00
URL https://healthyduck.org/robots.txt
Domain IPs 104.21.52.96, 172.67.197.217, 2606:4700:3030::ac43:c5d9, 2606:4700:3035::6815:3460
Response IP 104.21.52.96
Found Yes
Hash afb5caede3f25970c2799b3d139e7afd537d2c53c853a9abe901bcc9ba8cc456
SimHash 59bdb9628a22

Groups

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

detectify

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /feed
Disallow */feed
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-json/
Disallow /xmlrpc.php
Disallow /readme.html
Allow /wp-includes/*.css
Allow /wp-includes/*.js
Allow /wp-content/plugins/*.css
Allow /wp-content/plugins/*.js
Allow /*.css
Allow /*.js

Other Records

Field Value
sitemap https://healthyduck.org/sitemap.xml