arecipebite.com
robots.txt

Robots Exclusion Standard data for arecipebite.com

Resource Scan

Scan Details

Site Domain arecipebite.com
Base Domain arecipebite.com
Scan Status Ok
Last Scan2026-01-29T09:59:09+00:00
Next Scan 2026-02-05T09:59:09+00:00

Last Scan

Scanned2026-01-29T09:59:09+00:00
URL https://arecipebite.com/robots.txt
Domain IPs 104.21.90.29, 172.67.193.196, 2606:4700:3030::ac43:c1c4, 2606:4700:3035::6815:5a1d
Response IP 104.21.90.29
Found Yes
Hash 5c8bbb58f5bd7b5c9eb6dc5f3a557e13edbba39b216c064a4f38fe23d680fc88
SimHash 613049418272

Groups

*

Rule Path
Disallow /*?s=
Disallow /search/
Disallow /feed/
Disallow /wprm*print/
Disallow /cgi-bin/
Disallow /xmlrpc.php
Disallow /embed/
Disallow /trackback/
Disallow /comments/
Disallow /tag/
Disallow /wp-login.php
Disallow /wp-admin/
Disallow */page/
Allow /wp-admin/admin-ajax.php

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.arecipebite.com/sitemap_index.xml

Comments

  • ===========================
  • Custom robots.txt for SEO + Bot Protection
  • ===========================
  • --- General SEO rules ---
  • --- Sitemap reference (Yoast SEO) ---
  • ===========================
  • Bot Restrictions
  • ===========================