dailyyummys.com
robots.txt

Robots Exclusion Standard data for dailyyummys.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	dailyyummys.com
Base Domain	dailyyummys.com
Scan Status	Ok
Last Scan	2026-02-07T23:47:11+00:00
Next Scan	2026-02-14T23:47:11+00:00

Last Scan

Scanned	2026-02-07T23:47:11+00:00
URL	https://dailyyummys.com/robots.txt
Domain IPs	104.21.13.254, 172.67.133.147, 2606:4700:3033::6815:dfe, 2606:4700:3035::ac43:8593
Response IP	172.67.133.147
Found	Yes
Hash	28c8535ba906d945b956533d437937286116a763929305d5d19b88a18458da88
SimHash	64334953cd74

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/wprm_print/
Disallow	/cgi-bin/
Disallow	/xmlrpc.php
Disallow	*/embed/
Disallow	/trackback/
Disallow	/comments/
Disallow	/wp-login.php
Disallow	/wp-admin/
Disallow	*/page/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wprm_print/

Disallow

/cgi-bin/

Disallow

/xmlrpc.php

Disallow

*/embed/

Disallow

/trackback/

Disallow

/comments/

Disallow

/wp-login.php

Disallow

/wp-admin/

Disallow

*/page/

Allow

/wp-admin/admin-ajax.php

Rule	Path
Allow	/

Rule

Path

Allow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

oai-searchbot

Rule	Path
Allow	/

Rule

Path

Allow

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

claude-web

Rule	Path
Allow	/

Rule

Path

Allow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

googleother

Rule	Path
Allow	/

Rule

Path

Allow

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

deepseekbot

Rule	Path
Allow	/

Rule

Path

Allow

mistralbot

Rule	Path
Allow	/

Rule

Path

Allow

grokbot

Rule	Path
Allow	/

Rule

Path

Allow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

coherebot

Rule	Path
Allow	/

Rule

Path

Allow

meta-externalcrawler

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
sitemap	https://www.dailyyummys.com/sitemap_index.xml

Field

Value

sitemap

https://www.dailyyummys.com/sitemap_index.xml

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a Content-Signal = yes, you may collect content for the corresponding
use.
(b) If a Content-Signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a Content-Signal for a
corresponding use, the website operator neither grants nor restricts
permission via Content-Signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
===== Standard WordPress-Einstellungen =====
=====================================================
--- Pinterest ---
=====================================================
=== AI / LLM CRAWLER SETTINGS – UPDATED OCT 2025 ====
=====================================================
--- OpenAI ---
--- Anthropic (Claude) ---
--- Google Gemini ---
blockt Training
erlaubt Suche/Zitate
--- Perplexity ---
--- DeepSeek ---
--- Mistral / Mixtral ---
--- xAI / Grok ---
--- Common Crawl (meist Trainingsfeed) ---
--- Cohere, Meta / LLaMA Indexing (optional erlauben) ---

Warnings

`content-signal` is not a known field.

/.well-known/

dailyyummys.com
robots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

pinterest

gptbot

oai-searchbot

chatgpt-user

claude-web

google-extended

googleother

perplexitybot

deepseekbot

mistralbot

grokbot

ccbot

coherebot

meta-externalcrawler

Other Records

Comments

Warnings

dailyyummys.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

pinterest

gptbot

oai-searchbot

chatgpt-user

claude-web

google-extended

googleother

perplexitybot

deepseekbot

mistralbot

grokbot

ccbot

coherebot

meta-externalcrawler

Other Records

Comments

Warnings

dailyyummys.com
robots.txt