RI: Small: Toward Maximally Efficient Sampling and Optimization for Decentralized Learning: Beyond Markovian Regime

Information

NSF Award
2421484

Owner

North Carolina State University

Award Id
2421484
Award Effective Date
10/1/2024 - 2 months ago
Award Expiration Date
9/30/2027 - 2 years from now
Award Amount
$ 600,000.00
Award Instrument
Standard Grant

Information

RI: Small: Toward Maximally Efficient Sampling and Optimization for Decentralized Learning: Beyond Markovian Regime

Like taking an opinion poll, down sampling is a key technique for grappling with the ever-growing amounts of data used in large-scale machine learning (ML) tasks. In particular, the objective function optimized when training ML models is defined over a full set of data, but evaluating this objective function repeatedly during training is impractical, whereas evaluating the objective function on a sample of the data scales much better with minimal loss in accuracy. Instead of sampling uniformly, the sampling process can be biased to prefer parts of the data set that have a greater influence on the objective function, resulting in faster convergence and smaller errors of in the learning process. The classical approach to sampling, known as Markov chain Monte Carlo (MCMC), assumes that the current sample has no useful dependence on the samples that came before, known as the Markov assumption. It remains to be seen whether a non-Markovian sample process that allows for an explicit dependence on past samples, can be turned into a sampling process that translates to a better ML training procedure, either practically or theoretically.<br/><br/>The overarching theme in this project is to transcend the current limitations in sampling, optimization, and machine learning algorithms that have predominantly been built upon Markovian approach or MCMC, by exploiting the full potential of going beyond traditional Markov chains for the analysis and design of distributed algorithms in the most efficient way. Specifically, this project aims to explore the following three inter-related thrusts. The first is to explore all possible ways to maximally enhance the sampling efficiency of multiple, interacting nonlinear Markov chains in the form of self-repellent random walks (SRRWs) by designing adaptive degrees of spatio-temporal repellency among multiple walkers as well as with their `collective' history, while providing all the theoretical performance guarantees. Second, this project will assess the performance implication of distributed algorithms in ML/optimization and decentralized learning in the form of stochastic approximation and its variants, when driven by a set of adaptive and interacting nonlinear Markov chains such as SRRWs instead of traditional MCMC inputs, and obtain usable performance bounds both in finite-time/sample and asymptotic regime to strike a balance between faster convergence and maximal efficiency. Third, this project seeks to develop an algorithmic framework in which one can, for a given Markovian environment, always speed up the stochastic approximation algorithm by augmenting it into multi-timescale versions with low computational complexity, as well as co-design them with carefully constructed nonlinear Markovian sampling strategies for a tunable environment in decentralized learning. Broadly speaking, this project will have potential impact on a vast range of multi-disciplinary applications where the standard MCMC methods and Markovian-driven stochastic and iterative algorithms have been dominant and taken for granted, including sampling from high-dimensional state space with graphical constraints, Markovian random walks on general graphs and their applications to various inference tasks in a distributed manner, learning algorithms and stochastic approximation in a Markovian environment, stochastic optimization with Markovian noise, and beyond.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Program Officer
Vladimir Pavlovicvpavlovi@nsf.gov7032928318
Min Amd Letter Date
8/26/2024 - 4 months ago
Max Amd Letter Date
8/26/2024 - 4 months ago
ARRA Amount

Institutions

Name
North Carolina State University
City
RALEIGH
State
NC
Country
United States
Address
2601 WOLF VILLAGE WAY
Postal Code
276950001
Phone Number
9195152444

Investigators

First Name
Do Young
Last Name
Eun
Email Address
dyeun@ncsu.edu
Start Date
8/26/2024 12:00:00 AM

Program Element

Text
Robust Intelligence
Code
749500

Program Reference

Text
ROBUST INTELLIGENCE
Code
7495

Text
SMALL PROJECT
Code
7923

RI: Small: Toward Maximally Efficient Sampling and Optimization for Decentralized Learning: Beyond Markovian Regime

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

RI: Small: Toward Maximally Efficient Sampling and Optimization for Decentralized Learning: Beyond Markovian Regime

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

Program Element

Text

Code

Program Reference

Text

Code

Text

Code