NeTS: CSR: Large: Collaborative Research: Co-Design of Network, Storage and Computation Fabrics for Disaggregated Datacenters

Information

  • NSF Award
  • 1704941
Owner
  • Award Id
    1704941
  • Award Effective Date
    9/15/2017 - 8 years ago
  • Award Expiration Date
    8/31/2022 - 3 years ago
  • Award Amount
    $ 186,900.00
  • Award Instrument
    Continuing grant

NeTS: CSR: Large: Collaborative Research: Co-Design of Network, Storage and Computation Fabrics for Disaggregated Datacenters

Traditional datacenters are built using servers, each of which tightly integrates a small amount of CPU (central processing unit), memory and storage onto a single motherboard. However, the end of Dennard's scaling and the slowdown of Moore's Law has led to surfacing of several fundamental limitations of such server-centric architectures (e.g., the memory-capacity wall making CPU-memory co-location unsustainable). Consequently, a new computing paradigm is emerging -- a disaggregated architecture, where each resource type is built as a standalone 'blade' and a network fabric interconnects the resource blades within and across racks. The computer architecture community has established a number of benefits of such disaggregated architectures, including the potential to have 10-100x larger resource capacity. While beneficial from the computer architecture perspective, disaggregated architectures alter several assumptions that once guided the design and optimization of existing networks, systems and applications (e.g., CPU-storage colocation, high CPU-memory bandwidth, storage hierarchy, data locality, failure models, etc.). Capitalizing on the benefits of disaggregated architectures will thus require re-architecting legacy systems and networks. This project aims to co-design the network, storage and compute fabrics for disaggregated datacenters.<br/><br/>On the network front, the project will design ultra-low latency intra-rack and inter-rack fabrics including a new network software stack that incorporates efficient congestion control, failure tolerance and scheduling mechanisms. The co-design of network and storage fabrics will lead to new (distributed) memory and storage management stacks for disaggregated storage, and a resource manager that provides essential isolation, sharing and elasticity guarantees across multiple applications sharing disaggregated storage and network fabrics. Finally, the project will build new distributed programming frameworks and re-architect existing applications to efficiently and correctly operate on disaggregated architectures. <br/><br/>This project will provide solutions to some of the most difficult and important technical questions surrounding this emerging computing paradigm and will have broad community impact primarily through educational and outreach activities, and technology transfer. Software artifacts resulting from this project will be publicly released to ensure repeatability and to foster follow up research. The project also has a substantial educational component including new courses and public release of teaching materials. Finally, the project will provide the necessary thrust to build an inter-disciplinary research community via mentoring of graduate students and postdoctoral scholars, yearly workshops and industry retreats to bridge the gap between industrial development and academic research.

  • Program Officer
    Darleen L. Fisher
  • Min Amd Letter Date
    6/9/2017 - 8 years ago
  • Max Amd Letter Date
    6/9/2017 - 8 years ago
  • ARRA Amount

Institutions

  • Name
    International Computer Science Institute
  • City
    Berkeley
  • State
    CA
  • Country
    United States
  • Address
    1947 CENTER ST STE 600
  • Postal Code
    947044115
  • Phone Number
    5106662900

Investigators

  • First Name
    Scott
  • Last Name
    Shenker
  • Email Address
    shenker@berkeley.edu
  • Start Date
    6/9/2017 12:00:00 AM
  • First Name
    Sylvia
  • Last Name
    Ratnasamy
  • Email Address
    sylvia@eecs.berkeley.edu
  • Start Date
    6/9/2017 12:00:00 AM

Program Element

  • Text
    RES IN NETWORKING TECH & SYS
  • Code
    7363

Program Reference

  • Text
    LARGE PROJECT
  • Code
    7925
  • Text
    WOMEN, MINORITY, DISABLED, NEC
  • Code
    9102