bobscycle.com
robots.txt

Robots Exclusion Standard data for bobscycle.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	bobscycle.com
Base Domain	bobscycle.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-08-29T23:42:34+00:00
Next Scan	2024-11-27T23:42:34+00:00

Last Successful Scan

Scanned	2023-05-01T23:31:23+00:00
URL	https://bobscycle.com/robots.txt
Domain IPs	18.66.218.125, 18.66.218.13, 18.66.218.36, 18.66.218.73
Response IP	18.65.3.107
Found	Yes
Hash	c8f54beb570212edb96db6061376108c5fa59daeeb49ff61b6f486a7fc406481
SimHash	2fb4fc81c7f1

Groups

googlebot-image

Rule	Path
Disallow

Rule

Path

Disallow

bingbot

Rule	Path
Disallow	/404/
Disallow	/app/
Disallow	/cgi-bin/
Disallow	/downloader/
Disallow	/errors/
Disallow	/includes/
Disallow	/lib/
Disallow	/magento/
Disallow	/pkginfo/
Disallow	/report/
Disallow	/scripts/
Disallow	/shell/
Disallow	/skin/
Disallow	/stats/
Disallow	/var/

Rule

Path

Disallow

/404/

Disallow

/app/

Disallow

/cgi-bin/

Disallow

/downloader/

Disallow

/errors/

Disallow

/includes/

Disallow

/lib/

Disallow

/magento/

Disallow

/pkginfo/

Disallow

/report/

Disallow

/scripts/

Disallow

/shell/

Disallow

/skin/

Disallow

/stats/

Disallow

/var/

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

30

*

Rule	Path
Disallow	/404/
Disallow	/app/
Disallow	/cgi-bin/
Disallow	/downloader/
Disallow	/errors/
Disallow	/includes/
Disallow	/lib/
Disallow	/magento/
Disallow	/pkginfo/
Disallow	/report/
Disallow	/scripts/
Disallow	/shell/
Disallow	/stats/
Disallow	/var/
Disallow	/index.php/
Disallow	/catalog/product_compare/
Disallow	/catalogsearch/
Disallow	/checkout/
Disallow	/control/
Disallow	/contacts/
Disallow	/customer/
Disallow	/customize/
Disallow	/newsletter/
Disallow	/poll/
Disallow	/review/
Disallow	/sendfriend/
Disallow	/tag/
Disallow	/wishlist/
Disallow	/catalog/product/gallery/
Disallow	/cron.php
Disallow	/cron.sh
Disallow	/error_log
Disallow	/install.php
Disallow	/LICENSE.html
Disallow	/LICENSE.txt
Disallow	/LICENSE_AFL.txt
Disallow	/STATUS.txt
Disallow	/*.php$
Disallow	/*?SID=

Rule

Path

Disallow

/404/

Disallow

/app/

Disallow

/cgi-bin/

Disallow

/downloader/

Disallow

/errors/

Disallow

/includes/

Disallow

/lib/

Disallow

/magento/

Disallow

/pkginfo/

Disallow

/report/

Disallow

/scripts/

Disallow

/shell/

Disallow

/stats/

Disallow

/var/

Disallow

/index.php/

Disallow

/catalog/product_compare/

Disallow

/catalogsearch/

Disallow

/checkout/

Disallow

/control/

Disallow

/contacts/

Disallow

/customer/

Disallow

/customize/

Disallow

/newsletter/

Disallow

/poll/

Disallow

/review/

Disallow

/sendfriend/

Disallow

/tag/

Disallow

/wishlist/

Disallow

/catalog/product/gallery/

Disallow

/cron.php

Disallow

/cron.sh

Disallow

/error_log

Disallow

/install.php

Disallow

/LICENSE.html

Disallow

/LICENSE.txt

Disallow

/LICENSE_AFL.txt

Disallow

/STATUS.txt

Disallow

/*.php$

Disallow

/*?SID=

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

30

megaindex.ru

Rule	Path
Disallow	/

Rule

Path

Disallow

/

megaindex.com

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.bobscycle.com/sitemap.xml

Field

Value

sitemap

https://www.bobscycle.com/sitemap.xml

Back to top

Comments

****************************************************************************
robots.txt
: Robots, spiders, and search engines use this file to detmine which
content they should *not* crawl while indexing your website.
: This system is called "The Robots Exclusion Standard."
: It is strongly encouraged to use a robots.txt validator to check
for valid syntax before any robots read it!
Examples:
Instruct all robots to stay out of the admin area.
: User-agent: *
: Disallow: /admin/
Restrict Google and MSN from indexing your images.
: User-agent: Googlebot
: Disallow: /images/
: User-agent: MSNBot
: Disallow: /images/
****************************************************************************
Google Image Crawler Setup
Bing
Disallow: /js/
Crawlers Setup
Directories
Disallow: /js/
Disallow: /skin/
Paths (clean URLs)
Disallow: /catalog/category/view/
Disallow: /catalog/product/view/
Files
Paths (no clean URLs)
Disallow: /*.js$
Disallow: /*.css$
https://megaindex.com/crawler

Back to top

bobscycle.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

googlebot-image

bingbot

Other Records

*

Other Records

megaindex.ru

megaindex.com

Other Records

Comments

bobscycle.com
robots.txt