SBIR Phase I: A Multithreaded Storage Engine using Highly-Concurrent Fractal Trees

Information

  • NSF Award
  • 0945687
Owner
  • Award Id
    0945687
  • Award Effective Date
    1/1/2010 - 16 years ago
  • Award Expiration Date
    6/30/2010 - 15 years ago
  • Award Amount
    $ 150,000.00
  • Award Instrument
    Standard Grant

SBIR Phase I: A Multithreaded Storage Engine using Highly-Concurrent Fractal Trees

This Small Business Innovation Research Phase I project will investigate techniques for implementing high-performance databases on multi-core computers by focusing on how to support concurrent activity with provably good thread scheduling in "Fractal Tree" databases. Today's databases suffer from resource imbalances between storage bandwidth, disk-seek rate, and CPU core capacity, leading to underperformance, cumbersome workarounds, and energy inefficiency. The company has developed a high-performance storage engine for MySQL that maintains indexes on live data 100 times faster than traditional engines. The approach employs cache-oblivious Fractal-Tree indexes, which scale with storage bandwidth rather than seek rate, thus addressing the imbalance between bandwidth and disk-seek rate. If successful, this research will produce a database implementation that for each query that either saturates the CPU cores, saturates disk bandwidth, or consumes all of the inherent parallelism in the query.<br/><br/>The target market comprises organizations that have very large databases and a workload dominated by insertions and queries. There are many application areas that do not employ databases because their performance is too slow. Orders-of-magnitude speedup for databases can help grow the market. Applications in finance, retail, homeland security, telecommunications, and scientific computing will benefit from high-performance databases. Furthermore the researchers hope to lead all database implementers into the multi-core realm. The proposed research will further the understanding of how to schedule database queries when data is well laid out on disk. As users' appetite for data continues to outstrip the availability of fast memory, organizing multithreaded queries on disk-based data for performance will only grow in importance.

  • Program Officer
    Errol Arkilic
  • Min Amd Letter Date
    12/17/2009 - 16 years ago
  • Max Amd Letter Date
    12/17/2009 - 16 years ago
  • ARRA Amount

Institutions

  • Name
    Tokutek, Inc.
  • City
    Lexington
  • State
    MA
  • Country
    United States
  • Address
    1 Militia Drive, Suite 11
  • Postal Code
    024214703
  • Phone Number
    3392230680

Investigators

  • First Name
    Bradley
  • Last Name
    Kuszmaul
  • Email Address
    bradley@mit.edu
  • Start Date
    12/17/2009 12:00:00 AM

FOA Information

  • Name
    Industrial Technology
  • Code
    308000