zelmu.com
robots.txt

Robots Exclusion Standard data for zelmu.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	zelmu.com
Base Domain	zelmu.com
Scan Status	Ok
Last Scan	2025-12-09T12:26:08+00:00
Next Scan	2025-12-16T12:26:08+00:00

Last Scan

Scanned	2025-12-09T12:26:08+00:00
URL	https://zelmu.com/robots.txt
Domain IPs	104.21.87.32, 172.67.140.23, 2606:4700:3035::ac43:8c17, 2606:4700:3036::6815:5720
Response IP	104.21.87.32
Found	Yes
Hash	6543f64d564682e769e02f362699892e41f5ea36f9292a8f586fadb9239c1757
SimHash	c4b6ce52e495

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/
Allow	/page/
Allow	/pages
Allow	/user/
Allow	/profile
Disallow	/admin/
Disallow	/admin/users
Disallow	/admin/pages
Disallow	/admin/audit-logs
Disallow	/login/
Disallow	/register/
Disallow	/account/
Disallow	/checkout/
Disallow	/cart/
Disallow	/api/
Disallow	/internal/
Disallow	/wp-admin/
Disallow	/cgi-bin/
Disallow	/?utm_
Disallow	/?fbclid
Disallow	/?gclid
Disallow	/?session=
Disallow	/?ref=
Disallow	/?sort=
Disallow	/?page=
Disallow	/assets/large-images/
Disallow	/static/videos/
Disallow	/backup/
Disallow	/cache/
Disallow	/tmp/
Disallow	/download/
Disallow	/exports/

Rule

Path

Allow

/page/

Allow

/pages

Allow

/user/

Allow

/profile

Disallow

/admin/

Disallow

/admin/users

Disallow

/admin/pages

Disallow

/admin/audit-logs

Disallow

/login/

Disallow

/register/

Disallow

/account/

Disallow

/checkout/

Disallow

/cart/

Disallow

/api/

Disallow

/internal/

Disallow

/wp-admin/

Disallow

/cgi-bin/

Disallow

/*?*utm_

Disallow

/*?*fbclid

Disallow

/*?*gclid

Disallow

/*?*session=

Disallow

/*?*ref=

Disallow

/*?*sort=

Disallow

/*?*page=

Disallow

/assets/large-images/

Disallow

/static/videos/

Disallow

/backup/

Disallow

/cache/

Disallow

/tmp/

Disallow

/download/

Disallow

/exports/

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

googlebot

Rule	Path
Allow	/
Disallow	/admin/
Disallow	/checkout/

Rule

Path

Allow

Disallow

/admin/

Disallow

/checkout/

googlebot-image

Rule	Path
Allow	/images/
Disallow	/assets/large-images/

Rule

Path

Allow

/images/

Disallow

/assets/large-images/

google-adstxt

Rule	Path
Allow	/

Rule

Path

Allow

bingbot

Rule	Path
Allow	/
Disallow	/admin/

Rule

Path

Allow

Disallow

/admin/

Other Records

Field	Value
crawl-delay	2

Field

Value

crawl-delay

bingpreview

Rule	Path
Allow	/

Rule

Path

Allow

slurp

Rule	Path
Allow	/

Rule

Path

Allow

duckduckbot

Rule	Path
Allow	/

Rule

Path

Allow

yandex

Rule	Path
Allow	/
Disallow	/admin/

Rule

Path

Allow

Disallow

/admin/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

baiduspider

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

sogou spider

Rule	Path
Allow	/

Rule

Path

Allow

exabot

Rule	Path
Allow	/

Rule

Path

Allow

facebot

Rule	Path
Allow	/

Rule

Path

Allow

linkedinbot

Rule	Path
Allow	/

Rule

Path

Allow

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

semrushbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

mj12bot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

dotbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

openai
openaibot
anthropic
anthropicbot
perplexity
perplexitybot
perplexityai
poe
huggingface
huggingfacebot
llm-research
aibot

Rule	Path
Allow	/

Rule

Path

Allow

ia_archiver

Rule	Path
Allow	/

Rule

Path

Allow

Other Records

Field	Value
sitemap	https://zelmu.com/sitemap.xml
sitemap	https://zelmu.com/sitemap.xml

Field

Value

sitemap

https://zelmu.com/sitemap.xml

sitemap

https://zelmu.com/sitemap.xml

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a content-signal = yes, you may collect content for the corresponding
use.
(b) If a content-signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a content signal for a
corresponding use, the website operator neither grants nor restricts
permission via content signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
---------------------------
Combined robots.txt for zelmu.com
Goals: SEO-friendly + reduce server load + protect admin/staging
Includes major search engines + popular AI / research bots (allowed)
---------------------------
===== Global default rules (apply to all crawlers unless overridden) =====
Public pages (explicit for clarity)
Protect admin, auth, checkout, API and sensitive endpoints
Avoid crawling infinite/duplicate URLs (common tracking params)
Reduce crawl of very heavy or low-value folders
Optional: block tag/category archives if they cause duplicate content
Uncomment if your site produces thin archive pages
Disallow: /tag/
Disallow: /category/
===== Specific crawler directives (overrides) =====
Google family
Google ignores Crawl-delay; keep server-side rate limiting if needed
Bing / Microsoft
Yahoo / Slurp
DuckDuckGo
Yandex
Baidu
Sogou, Exalead, Exabot, Soso
Social / Preview bots
SEO / crawler tools (rate-limit or disallow if you don't want them crawling)
If you prefer to block auditing bots, uncomment:
Disallow: /
Research / AI / answer-engine bots — allowed (per "AI everything")
These are common user-agent names; some services may use different UA strings.
Generic crawler tools / others
===== Staging / dev note (use separate robots.txt on staging domain) =====
If you host a staging site (staging.zelmu.com), prefer a strict robots.txt there:
User-agent: *
Disallow: /
And protect with HTTP auth / IP allowlist — robots.txt alone isn't secure.
===== Sitemap =====

Warnings

`content-signal` is not a known field.

zelmu.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

Other Records

googlebot

googlebot-image

google-adstxt

bingbot

Other Records

bingpreview

slurp

duckduckbot

yandex

Other Records

baiduspider

Other Records

sogou spider

exabot

facebot

linkedinbot

ahrefsbot

Other Records

semrushbot

Other Records

mj12bot

Other Records

dotbot

Other Records

openaiopenaibotanthropicanthropicbotperplexityperplexitybotperplexityaipoehuggingfacehuggingfacebotllm-researchaibot

ia_archiver

Other Records

Comments

Warnings

zelmu.com
robots.txt

openai
openaibot
anthropic
anthropicbot
perplexity
perplexitybot
perplexityai
poe
huggingface
huggingfacebot
llm-research
aibot