shop.boeing.com
robots.txt

Robots Exclusion Standard data for shop.boeing.com

Resource Scan

Scan Details

Site Domain shop.boeing.com
Base Domain boeing.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2024-05-31T03:22:40+00:00
Next Scan 2024-08-29T03:22:40+00:00

Last Successful Scan

Scanned2021-09-13T09:33:36+00:00
URL https://shop.boeing.com/robots.txt
Found Yes
Hash 51bed1460db079f358c69a81e7e7826dad6791c281b800c819fd003c0744acbf
SimHash a846171c6fec

Groups

*

Rule Path
Disallow /aviation-supply/cart
Disallow /aviation-supply/checkout
Disallow /aviation-supply/search?text=*
Disallow /aviation-supply/my-account/
Disallow */c/

Other Records

Field Value Comment
crawl-delay 10 10 seconds between page requests

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

voltron

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

Other Records

Field Value
sitemap /aviation-supply/sitemap.xml

Comments

  • robots.txt for shop.boeing.com
  • Do Not delete this file.
  • Global
  • Block access to specific pages
  • Crawl-delay: 5 # 5 seconds between page requests
  • Visit-time: 2100-1100 # only visit between 09:00 PM and 11:00 AM UTC (3:00 PM - 5:00 AM CST)
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Block AHREFS BOT
  • Block Voltron Bot
  • Block Etao Bot

Warnings

  • `request-rate` is not a known field.
  • `visit-time` is not a known field.