mylayby.co.nz
robots.txt

Robots Exclusion Standard data for mylayby.co.nz

Resource Scan

Scan Details

Site Domain mylayby.co.nz
Base Domain mylayby.co.nz
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-11-02T17:28:15+00:00
Next Scan 2024-11-16T17:28:15+00:00

Last Successful Scan

Scanned2024-09-25T17:25:04+00:00
URL https://mylayby.co.nz/robots.txt
Domain IPs 13.33.88.11, 13.33.88.24, 13.33.88.45, 13.33.88.6
Response IP 13.33.88.24
Found Yes
Hash 3f96815c4fa49ac8d97894dbd86a09de8c824f64791858cfe93223efa63b859d
SimHash 2814d9c2c68c

Groups

bingbot

Rule Path
Disallow /wishlist*

*

Rule Path
Allow /

siteauditbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ocob

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

youbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

geedoproductsearch

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

facebookcatalog/1.0

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

facebookexternalhit/1.1

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

facebookexternalhit/1.0 (+http://www.facebook.com/externalhit_uatext.php)

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

*

Rule Path
Disallow /*?
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /wishlist/
Disallow /admin/
Disallow /catalogsearch/
Disallow /review/product/
Disallow /sendfriend/
Disallow /enable-cookies/
Disallow /LICENSE.txt
Disallow /LICENSE.html
Disallow /skin/
Disallow /js/
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /*?mode*
Disallow /*.php$
Disallow /*?SID=
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /pub/
Disallow /secomm-pma-tool/
Disallow /phpmyadmin/
Disallow /phpMyAdmin/
Disallow /pma/

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

mail.ru

Rule Path
Disallow /

yandex

Rule Path
Disallow /

twiceler

Rule Path
Disallow /
Allow /media/sitemaps/laybylandnz/

seekport crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://mylayby.co.nz/sitemap.xml

Comments

  • block SemrushBot
  • AI bots
  • GeedoProductSearch
  • Facebook Crawler Setup
  • Crawlers Setup
  • Directories
  • Stop crawling user account and checkout pages by search engine robot:
  • Blocking native catalog and search pages:
  • Paths (no clean URLs)
  • More reasonable to use canonical tag on these pages.
  • Blocking CMS directories.
  • Block URL Tools
  • Google Crawler Setup
  • Google Image Crawler Setup
  • Allow Sitemap
  • Block Bad Bot