pipette.com
robots.txt

Robots Exclusion Standard data for pipette.com

Resource Scan

Scan Details

Site Domain pipette.com
Base Domain pipette.com
Scan Status Ok
Last Scan2026-02-17T18:58:45+00:00
Next Scan 2026-03-19T18:58:45+00:00

Last Scan

Scanned2026-02-17T18:58:45+00:00
URL https://pipette.com/robots.txt
Domain IPs 104.26.12.229, 104.26.13.229, 172.67.71.114, 2606:4700:20::681a:ce5, 2606:4700:20::681a:de5, 2606:4700:20::ac43:4772
Response IP 104.26.12.229
Found Yes
Hash c683b9683f39319168f700a260e674bd3923b2d0b28cb9c5026437b7e7c8e270
SimHash 230d5fe1def1

Groups

*

Rule Path
Disallow /feed/*
Disallow /*.aspx
Disallow /favorites-add-list.html*
Disallow /favorites-customer-login.html*
Disallow /checkout-customer-information.html*
Disallow /checkout-basket-empty.html*
Disallow /basket-contents.html
Disallow /thank-you-general-inquiry.html*
Disallow /mm5/merchant.mvc*
Disallow /cs/c/?cta_guid=
Disallow /*__hstc%3D
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*
Disallow /ajax-cart.html*
Allow /mm5/graphics/*
Allow /mm5/pdfs/*
Allow /

hanaleibot

Rule Path
Disallow /

*
adsbot-google
adsidxbot
google-inspectiontool
googlebot
googlebot-image
yepbot
twitterbot
ahrefssiteaudit
ahrefsbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://pipette.com/sitemap.xml