CCF-BSF: AF: Small: Convex and Non-Convex Distributed Learning

Information

NSF Award
1718970

Owner

Toyota Jidosha Kabushiki Kaisha

Award Id
1718970
Award Effective Date
1/1/2018 - 7 years ago
Award Expiration Date
12/31/2020 - 4 years ago
Award Amount
$ 249,978.00
Award Instrument
Standard Grant

Information

CCF-BSF: AF: Small: Convex and Non-Convex Distributed Learning

Machine learning is an increasingly important approach in tackling many difficult scientific, engineering and artificial intelligence tasks, ranging from machine translation and speech recognition, through control of self driving cars, to protein structure prediction and drug design. The core idea of machine learning is to use examples and data to automatically train a system to perform some task. Accordingly, the success of machine learning is tied to availability of large amounts of training data and our ability to process it. Much of the recent success of machine learning is fueled by the large amounts of data (text, images, videos, etc) that can now be collected. But all this data also needs to be processed and learned from---indeed this data flood has shifted the bottleneck, to a large extent, from availability of data to our ability to process it. In particular, the amounts of data involved can no longer be stored and handled on single computers. Consequently, distributed machine learning, where data is processed and learned from on many computers that communicate with each other, is a crucial element of modern large scale machine learning.<br/><br/>The goal of this project is to provide a rigorous framework for studying distributed machine learning, and through it develop efficient methods for distributed learning and a theoretical understanding of the benefits of these methods, as well as the inherent limitations of distributed learning. A central component in the PIs' approach is to model distributed learning as a stochastic optimization problem, where different machines receive samples drawn from the same source distribution, thus allowing methods and analysis that specifically leverage the relatedness between data on different machines. This is crucial for studying how availability of multiple computers can aid in reducing the computational cost of learning. Furthermore, the project also encompasses the more challenging case where there are significant differences between the nature of the data on different machines (for instance, when different machines serve different geographical regions, or when each machine is a personal device, collecting data from a single user). In such a situation, the proposed approach to be studied is to integrate distributed learning with personalization or adaptation, which the PIs argue can not only improve learning performance, but also better leverage distributed computation.<br/><br/>This is an international collaboration, made possible through joint funding with the US-Israel Binational Science Foundation (BSF). The project brings together two PIs that have worked together extensively on related topics in machine learning and optimization.

Program Officer
Tracy J. Kimbrel
Min Amd Letter Date
7/10/2017 - 8 years ago
Max Amd Letter Date
7/10/2017 - 8 years ago
ARRA Amount

Institutions

Name
Toyota Technological Institute at Chicago
City
Chicago
State
IL
Country
United States
Address
6045 S. Kenwood Avenue
Postal Code
606372902
Phone Number
7738340409

Investigators

First Name
Nathan
Last Name
Srebro
Email Address
nati@ttic.edu
Start Date
7/10/2017 12:00:00 AM

Program Element

Text
ALGORITHMIC FOUNDATIONS
Code
7796

Program Reference

Text
SMALL PROJECT
Code
7923

Text
ALGORITHMS
Code
7926

Text
PARAL/DISTRIBUTED ALGORITHMS
Code
7934

CCF-BSF: AF: Small: Convex and Non-Convex Distributed Learning

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

CCF-BSF: AF: Small: Convex and Non-Convex Distributed Learning

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

Program Element

Text

Code

Program Reference

Text

Code

Text

Code

Text

Code