[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

タグ

twitterとDremelに関するyassのブックマーク (1)

  • Dremel made simple with Parquet

    Columnar storage is a popular technique to optimize analytical workloads in parallel RDBMs. The performance and compression benefits for storing and processing large amounts of data are well documented in academic literature as well as several commercial analytical databases. The goal is to keep I/O to a minimum by reading from a disk only the data required for the query. Using Parquet at Twitter,

    Dremel made simple with Parquet
    yass
    yass 2013/09/14
    " a technique outlined in the Dremel paper from Google. / We will first describe the general model used to represent nested data structures. Then we will explain how this model can be represented as a flat list of columns. Finally we’ll discuss why this representation is effective. "
  • 1