III: Small: Combining Stochastics and Numerics for Improved Scalable Matrix Computations

Information

NSF Award
1815054

Owner

INTERNATIONAL COMPUTER SCIENCES, INC.

Award Id
1815054
Award Effective Date
9/1/2018 - 7 years ago
Award Expiration Date
8/31/2021 - 4 years ago
Award Amount
$ 500,000.00
Award Instrument
Standard Grant

Information

III: Small: Combining Stochastics and Numerics for Improved Scalable Matrix Computations

Data are often modeled as matrices. As a result, linear algebraic algorithms, and in particular matrix decompositions, have proven extremely successful in the analysis of datasets in the form of matrices. RandNLA (Randomized Numerical Linear Algebra), which integrates the complementary perspectives that theoretical computer science and numerical linear algebra bring to matrix computations, has led to nontrivial theory and high-quality implementations, and it has proven useful in a range of scientific and internet applications. This project will addresses statistical properties of RandNLA algorithms, and how these algorithms are used in downstream convex and non-convex optimization pipelines. This project will facilitate the development of algorithmic methods for the extraction of knowledge from large genetic, medical, internet, financial, astronomical, and other scientific data sets, and it will also focus on broader interdisciplinary educational opportunities, including undergraduate courses on the mathematics of data science. <br/><br/>Examples of technical challenges of interest include that the randomness inside the algorithm can lead to implicit regularization, and that it can also lead to usefulness in downstream applications that is not captured by existing theory. These and other challenges will be addressed in several complementary ways. First, by developing bootstrapping methods for core RandNLA algorithms. Second, by developing improved statistical analysis of core RandNLA algorithms. Third, by developing non-linear leverage scores for more general statistical objectives. Fourth, by developing methods to combine in a principled manner SGD and RandNLA. And fifth, by providing implementations addressing scientific data analysis applications, and also by considering longer-term directions of interdisciplinary interest. In each case, there will be a focus on complementary stochastic and numerical aspects of RandNLA algorithms, as well as on how RandNLA primitives are used in realistic convex and non-convex machine learning pipelines. This will lead to new insights in algorithmic and statistical theory, as well as more useful algorithms in practical implementations and applications.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Program Officer
Aidong Zhang
Min Amd Letter Date
8/17/2018 - 7 years ago
Max Amd Letter Date
8/17/2018 - 7 years ago
ARRA Amount

Institutions

Name
International Computer Science Institute
City
Berkeley
State
CA
Country
United States
Address
1947 CENTER ST STE 600
Postal Code
947044115
Phone Number
5106662900

Investigators

First Name
Michael
Last Name
Mahoney
Email Address
mmahoney@icsi.berkeley.edu
Start Date
8/17/2018 12:00:00 AM

Program Element

Text
INFO INTEGRATION & INFORMATICS
Code
7364

Program Reference

Text
INFO INTEGRATION & INFORMATICS
Code
7364

Text
SMALL PROJECT
Code
7923

III: Small: Combining Stochastics and Numerics for Improved Scalable Matrix Computations

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

III: Small: Combining Stochastics and Numerics for Improved Scalable Matrix Computations

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

Program Element

Text

Code

Program Reference

Text

Code

Text

Code