Collaborative Research: III: Small: Closing Sim-to-Real Gap in Reinforcement Learning via Randomization, Alignment, and Derivation

Information

NSF Award
2426341

Owner

OREGON HEALTH AND SCIENCE UNIVERSITY

Award Id
2426341
Award Effective Date
9/1/2024 - 9 months ago
Award Expiration Date
8/31/2027 - 2 years from now
Award Amount
$ 99,987.00
Award Instrument
Standard Grant

Information

Collaborative Research: III: Small: Closing Sim-to-Real Gap in Reinforcement Learning via Randomization, Alignment, and Derivation

Features are used to describe the characteristics of entities. For example, "age", "smoking or not", and "years of smoking" are features of a patient, which can be used to describe the patient's physical condition, and furthermore, to predict if she or he is likely to get lung cancer. A combination of features could be more helpful to the prediction, e.g., "age" minus "years of smoking" can be a new feature to indicate how early the patient starts smoking. This kind of feature combination is called feature generation. In the big data era, there exist enormous numbers of features, and it is not realistic to generate features manually by human experts. This project will build new technologies to automatically generate new features based on existing features, to better describe the objects and entities, and to gain better prediction performance. Additionally, this project aims to substantially improve the traceability, affordability, and explainability during the generation process. The developed algorithms and tools are expected to be generalized and applicable to a broad range of scientific and engineering problems, not just in feature generation, but also in other domains such as data pre-processing, social analysis, intelligent transportation systems, healthcare, and the internet of things. <br/><br/>This project identifies three research tasks: (i) A Reinforcement Learning (RL) based approach to realize traceability. Two RL agents are used to select appropriate features, and one RL agent is used to select the appropriate operation. The policy network will be decomposed into two sub-networks, i.e., representation network and value network. Different agents will share the value network to improve training convergence. (ii) A heuristic approach to realize affordability. Information theory-based utility scores will be designed to evaluate features and feature sets, and the heuristic selection strategy will be designed in the generation process. (iii) A Large Language Model (LLM) based approach to realize explainability. The tabular data will be serialized into natural language strings, and comprehensive prompts will be designed incorporating feature generation expertise and domain expertise. The LLM can generate features with explanations by fine-tuning it with prompts. Two strategies will be proposed to compress the prompt. The proposed research will provide novel perspectives and methodologies as to how to generate new features by advancing the understanding and designing new generation strategies. They go beyond conventional generation methodologies that are highly dependent on domain knowledge<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

Program Officer
Raj Acharyaracharya@nsf.gov7032927978
Min Amd Letter Date
8/21/2024 - 9 months ago
Max Amd Letter Date
8/21/2024 - 9 months ago
ARRA Amount

Institutions

Name
Oregon Health & Science University
City
PORTLAND
State
OR
Country
United States
Address
3181 SW SAM JACKSON PARK RD
Postal Code
972393011
Phone Number
5034947784

Investigators

First Name
Craig
Last Name
Newgard
Email Address
newgardc@ohsu.edu
Start Date
8/21/2024 12:00:00 AM

Program Element

Text
Info Integration & Informatics
Code
736400

Program Reference

Text
INFO INTEGRATION & INFORMATICS
Code
7364

Text
SMALL PROJECT
Code
7923

Collaborative Research: III: Small: Closing Sim-to-Real Gap in Reinforcement Learning via Randomization, Alignment, and Derivation

Information

Owner

Award Id

Award Effective Date

Award Expiration Date

Award Amount

Award Instrument

Collaborative Research: III: Small: Closing Sim-to-Real Gap in Reinforcement Learning via Randomization, Alignment, and Derivation

Program Officer

Min Amd Letter Date

Max Amd Letter Date

ARRA Amount

Institutions

Name

City

State

Country

Address

Postal Code

Phone Number

Investigators

First Name

Last Name

Email Address

Start Date

Program Element

Text

Code

Program Reference

Text

Code

Text

Code