mylayby.co.nz
robots.txt

Robots Exclusion Standard data for mylayby.co.nz

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mylayby.co.nz
Base Domain	mylayby.co.nz
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-11-02T17:28:15+00:00
Next Scan	2024-11-16T17:28:15+00:00

Last Successful Scan

Scanned	2024-09-25T17:25:04+00:00
URL	https://mylayby.co.nz/robots.txt
Domain IPs	13.33.88.11, 13.33.88.24, 13.33.88.45, 13.33.88.6
Response IP	13.33.88.24
Found	Yes
Hash	3f96815c4fa49ac8d97894dbd86a09de8c824f64791858cfe93223efa63b859d
SimHash	2814d9c2c68c

Groups

bingbot

Rule	Path
Disallow	/wishlist*

Rule

Path

Disallow

/wishlist*

*

Rule	Path
Allow	/

Rule

Path

Allow

siteauditbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-ba

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-si

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-swa

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-ocob

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

bytespider

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

diffbot

Rule	Path
Disallow	/

Rule

Path

Disallow

imagesiftbot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgilibot

Rule	Path
Disallow	/

Rule

Path

Disallow

omgili

Rule	Path
Disallow	/

Rule

Path

Disallow

youbot

Rule	Path
Disallow	/

Rule

Path

Disallow

scrapy

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

geedoproductsearch

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

facebookcatalog/1.0

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

facebookexternalhit/1.1

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

facebookexternalhit/1.0 (+http://www.facebook.com/externalhit_uatext.php)

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

*

Rule	Path
Disallow	/*?
Disallow	/index.php/
Disallow	/catalog/product_compare/
Disallow	/catalog/category/view/
Disallow	/catalog/product/view/
Disallow	/wishlist/
Disallow	/admin/
Disallow	/catalogsearch/
Disallow	/review/product/
Disallow	/sendfriend/
Disallow	/enable-cookies/
Disallow	/LICENSE.txt
Disallow	/LICENSE.html
Disallow	/skin/
Disallow	/js/
Disallow	/checkout/
Disallow	/onestepcheckout/
Disallow	/customer/
Disallow	/customer/account/
Disallow	/customer/account/login/
Disallow	/catalogsearch/
Disallow	/catalog/product_compare/
Disallow	/catalog/category/view/
Disallow	/catalog/product/view/
Disallow	/?dir
Disallow	/*?dir=desc
Disallow	/*?dir=asc
Disallow	/*?limit=all
Disallow	/?mode
Disallow	/*.php$
Disallow	/*?SID=
Disallow	/app/
Disallow	/bin/
Disallow	/dev/
Disallow	/lib/
Disallow	/phpserver/
Disallow	/pub/
Disallow	/secomm-pma-tool/
Disallow	/phpmyadmin/
Disallow	/phpMyAdmin/
Disallow	/pma/

Rule

Path

Disallow

/*?

Disallow

/index.php/

Disallow

/catalog/product_compare/

Disallow

/catalog/category/view/

Disallow

/catalog/product/view/

Disallow

/wishlist/

Disallow

/admin/

Disallow

/catalogsearch/

Disallow

/review/product/

Disallow

/sendfriend/

Disallow

/enable-cookies/

Disallow

/LICENSE.txt

Disallow

/LICENSE.html

Disallow

/skin/

Disallow

/js/

Disallow

/checkout/

Disallow

/onestepcheckout/

Disallow

/customer/

Disallow

/customer/account/

Disallow

/customer/account/login/

Disallow

/catalogsearch/

Disallow

/catalog/product_compare/

Disallow

/catalog/category/view/

Disallow

/catalog/product/view/

Disallow

/*?dir*

Disallow

/*?dir=desc

Disallow

/*?dir=asc

Disallow

/*?limit=all

Disallow

/*?mode*

Disallow

/*.php$

Disallow

/*?SID=

Disallow

/app/

Disallow

/bin/

Disallow

/dev/

Disallow

/lib/

Disallow

/phpserver/

Disallow

/pub/

Disallow

/secomm-pma-tool/

Disallow

/phpmyadmin/

Disallow

/phpMyAdmin/

Disallow

/pma/

googlebot

Rule	Path
Disallow

Rule

Path

Disallow

googlebot-image

Rule	Path
Disallow

Rule

Path

Disallow

mail.ru

Rule	Path
Disallow	/

Rule

Path

Disallow

yandex

Rule	Path
Disallow	/

Rule

Path

Disallow

twiceler

Rule	Path
Disallow	/
Allow	/media/sitemaps/laybylandnz/

Rule

Path

Disallow

Allow

/media/sitemaps/laybylandnz/

seekport crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://mylayby.co.nz/sitemap.xml

Field

Value

sitemap

https://mylayby.co.nz/sitemap.xml

Comments

block SemrushBot
AI bots
GeedoProductSearch
Facebook Crawler Setup
Crawlers Setup
Directories
Stop crawling user account and checkout pages by search engine robot:
Blocking native catalog and search pages:
Paths (no clean URLs)
More reasonable to use canonical tag on these pages.
Blocking CMS directories.
Block URL Tools
Google Crawler Setup
Google Image Crawler Setup
Allow Sitemap
Block Bad Bot

mylayby.co.nzrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

bingbot

*

siteauditbot

semrushbot-ba

semrushbot-si

semrushbot-swa

semrushbot-ocob

gptbot

chatgpt-user

google-extended

perplexitybot

amazonbot

claudebot

omgilibot

facebookbot

applebot

anthropic-ai

bytespider

claude-web

diffbot

imagesiftbot

omgilibot

omgili

youbot

scrapy

ahrefsbot

geedoproductsearch

Other Records

facebookcatalog/1.0

Other Records

facebookexternalhit/1.1

Other Records

facebookexternalhit/1.0 (+http://www.facebook.com/externalhit_uatext.php)

Other Records

*

googlebot

googlebot-image

mail.ru

yandex

twiceler

seekport crawler

Other Records

Comments

mylayby.co.nz
robots.txt