nosugar.today
robots.txt

Robots Exclusion Standard data for nosugar.today

Archived Snapshots

Resource Scan

Scan Details

Site Domain	nosugar.today
Base Domain	nosugar.today
Scan Status	Ok
Last Scan	2025-12-22T09:04:10+00:00
Next Scan	2026-01-21T09:04:10+00:00

Last Scan

Scanned	2025-12-22T09:04:10+00:00
URL	https://nosugar.today/robots.txt
Domain IPs	104.21.18.183, 172.67.183.28, 2606:4700:3030::6815:12b7, 2606:4700:3033::ac43:b71c
Response IP	104.21.18.183
Found	Yes
Hash	2099b6b52c9e033d74f75bae84d9a576c968d825f5324b93f29b85ef77861cf7
SimHash	44254b43c597

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/sitemap.xml
Allow	/sitemap/
Allow	/

Rule

Path

Allow

/sitemap.xml

Allow

/sitemap/

Allow

*

Rule	Path
Disallow	/*.htm$

Rule

Path

Disallow

/*.htm$

*

Rule	Path
Disallow	/app/
Disallow	/bin/
Disallow	/dev/
Disallow	/lib/
Disallow	/phpserver/
Disallow	/pkginfo/
Disallow	/report/
Disallow	/setup/
Disallow	/update/
Disallow	/var/
Disallow	/vendor/
Disallow	/index.php
Disallow	/cron.php
Disallow	/cron.sh
Disallow	/checkout/
Disallow	/customer/
Disallow	/wishlist/
Disallow	/newsletter/
Disallow	/sendfriend/
Disallow	/review/
Disallow	/catalogsearch/
Disallow	/search/
Disallow	/tag/
Disallow	/*?dir=
Disallow	/*?limit=
Disallow	/*?mode=
Disallow	/*?order=
Disallow	/catalog/product_compare/
Disallow	/sales/guest/
Disallow	/downloadable/
Disallow	/no-route

Rule

Path

Disallow

/app/

Disallow

/bin/

Disallow

/dev/

Disallow

/lib/

Disallow

/phpserver/

Disallow

/pkginfo/

Disallow

/report/

Disallow

/setup/

Disallow

/update/

Disallow

/var/

Disallow

/vendor/

Disallow

/index.php

Disallow

/cron.php

Disallow

/cron.sh

Disallow

/checkout/

Disallow

/customer/

Disallow

/wishlist/

Disallow

/newsletter/

Disallow

/sendfriend/

Disallow

/review/

Disallow

/catalogsearch/

Disallow

/search/

Disallow

/tag/

Disallow

/*?dir=

Disallow

/*?limit=

Disallow

/*?mode=

Disallow

/*?order=

Disallow

/catalog/product_compare/

Disallow

/sales/guest/

Disallow

/downloadable/

Disallow

/no-route

Other Records

Field	Value
sitemap	https://nosugar.com.iq/sitemap.xml
sitemap	https://nosugar.com.iq/sitemap/sitemap_ar.xml
sitemap	https://nosugar.com.iq/sitemap/sitemap_en.xml
sitemap	https://nosugar.com.iq/sitemap/blog_sitemap_ar.xml
sitemap	https://nosugar.com.iq/sitemap/blog_sitemap_en.xml

Field

Value

sitemap

https://nosugar.com.iq/sitemap.xml

sitemap

https://nosugar.com.iq/sitemap/sitemap_ar.xml

sitemap

https://nosugar.com.iq/sitemap/sitemap_en.xml

sitemap

https://nosugar.com.iq/sitemap/blog_sitemap_ar.xml

sitemap

https://nosugar.com.iq/sitemap/blog_sitemap_en.xml

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a content-signal = yes, you may collect content for the corresponding
use.
(b) If a content-signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a content signal for a
corresponding use, the website operator neither grants nor restricts
permission via content signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
==============================================================================
Default robots.txt for Magento 2
==============================================================================
Allow all search engines full access initially.
This rule also applies to AI bots since they are not explicitly blocked.
General rules for all bots
Disallow technical and system folders/files
Disallow customer-specific and checkout paths
Disallow internal search results and filtering/sorting parameters
Disallow other non-content pages
Sitemap location

Warnings

`content-signal` is not a known field.

nosugar.todayrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

*

*

Other Records

Comments

Warnings

nosugar.today
robots.txt