atlanticcyclery.com
robots.txt

Robots Exclusion Standard data for atlanticcyclery.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	atlanticcyclery.com
Base Domain	atlanticcyclery.com
Scan Status	Ok
Last Scan	2025-12-08T05:11:01+00:00
Next Scan	2026-01-07T05:11:01+00:00

Last Scan

Scanned	2025-12-08T05:11:01+00:00
URL	https://atlanticcyclery.com/robots.txt
Domain IPs	104.21.31.211, 172.67.179.249, 2606:4700:3034::ac43:b3f9, 2606:4700:3037::6815:1fd3
Response IP	104.21.31.211
Found	Yes
Hash	6a671af7256a2c255ee37eb0192516343b39dd2024d471ae1401f9b5b7adeb6f
SimHash	47345a13c5d4

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

meta-externalagent

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/
Allow	/index.html
Allow	/contact.html
Allow	/login.html
Allow	/*.webp$
Allow	/*.svg$
Allow	/*.png$
Allow	/*.jpg$
Allow	/*.jpeg$
Allow	/manifest.json
Disallow	/admin/
Disallow	/private/
Disallow	/temp/
Disallow	/*.log$

Rule

Path

Allow

/index.html

Allow

/contact.html

Allow

/login.html

Allow

/*.webp$

Allow

/*.svg$

Allow

/*.png$

Allow

/*.jpg$

Allow

/*.jpeg$

Allow

/manifest.json

Disallow

/admin/

Disallow

/private/

Disallow

/temp/

Disallow

/*.log$

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-image

Rule	Path
Allow	/assets/images/
Allow	/*.webp$
Allow	/*.png$
Allow	/*.jpg$
Allow	/*.jpeg$
Allow	/*.svg$

Rule

Path

Allow

/assets/images/

Allow

/*.webp$

Allow

/*.png$

Allow

/*.jpg$

Allow

/*.jpeg$

Allow

/*.svg$

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

facebookexternalhit

Rule	Path
Allow	/

Rule

Path

Allow

twitterbot

Rule	Path
Allow	/

Rule

Path

Allow

*

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	1

Field

Value

crawl-delay

googlebot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	0

Field

Value

crawl-delay

bingbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	0

Field

Value

crawl-delay

slurp

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	0

Field

Value

crawl-delay

Other Records

Field	Value
sitemap	https://atlanticcyclery.com/sitemap.xml

Field

Value

sitemap

https://atlanticcyclery.com/sitemap.xml

Comments

As a condition of accessing this website, you agree to abide by the following
content signals:
(a) If a content-signal = yes, you may collect content for the corresponding
use.
(b) If a content-signal = no, you may not collect content for the
corresponding use.
(c) If the website operator does not include a content signal for a
corresponding use, the website operator neither grants nor restricts
permission via content signal with respect to the corresponding use.
The content signals and their meanings are:
search: building a search index and providing search results (e.g., returning
hyperlinks and short excerpts from your website's contents). Search does not
include providing AI-generated search summaries.
ai-input: inputting content into one or more AI models (e.g., retrieval
augmented generation, grounding, or other real-time taking of content for
generative AI search answers).
ai-train: training or fine-tuning AI models.
ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
BEGIN Cloudflare Managed content
END Cloudflare Managed Content
Robots.txt for cap 999 สล็อต - เว็บตรงไม่ผ่านเอเย่นต์
https://atlanticcyclery.com/
Allow all search engines to crawl main pages
Disallow sensitive areas (if any exist in future)
Allow Google-specific crawlers
Allow Bing
Allow Facebook crawler
Allow Twitter crawler
Sitemap location
Crawl delay for non-major bots (1 second)
Special rules for major search engines (no delay)

Warnings

`content-signal` is not a known field.

atlanticcyclery.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

amazonbot

applebot-extended

bytespider

ccbot

claudebot

google-extended

gptbot

meta-externalagent

*

googlebot

googlebot-image

bingbot

facebookexternalhit

twitterbot

*

Other Records

googlebot

Other Records

bingbot

Other Records

slurp

Other Records

Other Records

Comments

Warnings

atlanticcyclery.com
robots.txt