mirimary.com
robots.txt

Robots Exclusion Standard data for mirimary.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mirimary.com
Base Domain	mirimary.com
Scan Status	Ok
Last Scan	2025-11-02T02:25:14+00:00
Next Scan	2025-12-02T02:25:14+00:00

Last Scan

Scanned	2025-11-02T02:25:14+00:00
URL	https://mirimary.com/robots.txt
Domain IPs	104.21.58.153, 172.67.161.58, 2606:4700:3037::6815:3a99, 2606:4700:3037::ac43:a13a
Response IP	172.67.161.58
Found	Yes
Hash	2a014821971006fb81173350c27bc9974b223b4ce72bad64483cc93feca48245
SimHash	4633c240cd92

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

oai-searchbot

Rule	Path
Allow	/

Rule

Path

Allow

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

chatgpt-user/2.0

Rule	Path
Allow	/

Rule

Path

Allow

gptbot

Rule	Path	Comment
Disallow	/private/	Example: block private folder
Allow	/	Allow rest of site

Rule

Path

Comment

Disallow

/private/

Example: block private folder

Allow

Allow rest of site

anthropic-ai

Rule	Path
Allow	/

Rule

Path

Allow

claudebot

Rule	Path
Allow	/

Rule

Path

Allow

claude-web

Rule	Path
Allow	/

Rule

Path

Allow

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

perplexity-user

Rule	Path
Allow	/

Rule

Path

Allow

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Allow	/

Rule

Path

Allow

applebot

Rule	Path
Allow	/

Rule

Path

Allow

applebot-extended

Rule	Path
Allow	/

Rule

Path

Allow

facebookbot

Rule	Path
Allow	/

Rule

Path

Allow

meta-externalagent

Rule	Path
Allow	/

Rule

Path

Allow

linkedinbot

Rule	Path
Allow	/

Rule

Path

Allow

duckassistbot

Rule	Path
Allow	/

Rule

Path

Allow

grokbot

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
sitemap	https://mirimary.com/wp-sitemap.xml

Field

Value

sitemap

https://mirimary.com/wp-sitemap.xml

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a content-signal = yes, you may collect content for the corresponding
use.
(b) If a content-signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a content signal for a
corresponding use, the website operator neither grants nor restricts
permission via content signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
===============================
robots.txt for LLM Crawlers
===============================
---------- OPENAI ----------
SearchBot - Used for linking webpages in ChatGPT search
ChatGPT user agent – Human browsing via ChatGPT or Custom GPTs
GPTBot – Used for GPT model training (e.g., GPT-4o, GPT-5)
---------- ANTHROPIC (Claude) ----------
Training and crawling
---------- PERPLEXITY ----------
---------- GOOGLE (Gemini/Gemini App) ----------
---------- MICROSOFT (Bing / Copilot) ----------
---------- AMAZON ----------
---------- APPLE ----------
---------- META ----------
---------- LINKEDIN ----------
---------- DUCKDUCKGO ----------
---------- GROK ----------

Warnings

`content-signal` is not a known field.

mirimary.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

oai-searchbot

chatgpt-user

chatgpt-user/2.0

gptbot

anthropic-ai

claudebot

claude-web

perplexitybot

perplexity-user

google-extended

bingbot

amazonbot

applebot

applebot-extended

facebookbot

meta-externalagent

linkedinbot

duckassistbot

grokbot

Other Records

Comments

Warnings

mirimary.com
robots.txt