ezvacuum.com
robots.txt

Robots Exclusion Standard data for ezvacuum.com

Resource Scan

Scan Details

Site Domain ezvacuum.com
Base Domain ezvacuum.com
Scan Status Ok
Last Scan2024-08-30T07:36:38+00:00
Next Scan 2024-09-29T07:36:38+00:00

Last Scan

Scanned2024-08-30T07:36:38+00:00
URL https://ezvacuum.com/robots.txt
Domain IPs 104.16.172.24, 104.16.173.24
Response IP 104.16.172.24
Found Yes
Hash e163774e81ceb7f9782262d6a42e023efcc3e4cc6814751b909173625a22114a
SimHash 59259f41e3d3

Groups

*

Rule Path
Disallow /old_store/
Disallow /magento/
Disallow /beta/
Disallow /admin/
Disallow /checkout/
Disallow /review/
Disallow /app/
Disallow /downloader/
Disallow /lib/
Disallow /pkginfo/
Disallow /report/
Disallow /customer/
Disallow /enable-cookies/
Disallow /sendfriend/
Disallow /wishlist/
Disallow /report/
Disallow /private/
Disallow /poll/
Disallow /install/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /catalog/product_compare/
Disallow /ezvacs-answers/tag/
Disallow /top-searches.html
Disallow /reminder-email.php
Disallow /captcha/refresh/
Disallow /productquestion/question/vote/id/
Disallow /captcha/
Disallow /errors/

Other Records

Field Value
crawl-delay 5

amazon cloudfront

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mediapartners-google

Rule Path
Allow /

googlebot

Rule Path
Allow /
Disallow

adsbot-google

Rule Path
Allow /
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.ezvacuum.com/blog/sitemap.xml
sitemap https://www.ezvacuum.com/sitemap.xml

Comments

  • Disallow: /js/
  • Disallow: /skin/js/
  • Disallow: /skin/css/
  • Disallow: /var/
  • bots consuming MAXCLIENTS requests and then crashing the server
  • ALLOW MEDIA BOT TO CRAWL ANYWHERE
  • ALLOW IMAGE BOT TO CRAWL ANYWHERE