Efficient and high-performance control of chemical processes is critical to the safety, sustainability, and economic interest of industrial sectors. Some processes, called nonlinear processes, are more difficult to control because they exhibit more complex behaviors. Additionally, establishing accurate equations completely from physical and chemical principles is unrealistic in these situations. This project aims to develop direct data-driven, namely model-free, methodology for nonlinear control, based on the principle that control-relevant information underlying the nonlinear dynamics can be statistically learned from process data. The focus of this project is on the combination of learning and control theories underpinning (1) the observation of unmeasured (hidden) states, (2) the analysis of unknown dynamics, as well as (3) the synthesis of optimal controllers, in a generic, physically meaningful, and performance-guaranteed framework. The algorithms and methods of this project will be tested with benchmark systems and finally developed into computer codes for practical implementation. Outcomes of this project are expected to facilitate a “big data” transformation of industrial control technology that features time-flexible workflows and increasing workforce diversity. The research outcomes will also be used in the design of a new graduate-level course on machine learning for chemical engineers and incorporated into the existing undergraduate-level process control course, both aiming to improve the “data literacy” of chemical engineering students. In addition, the project will involve outreach activities, including the PI’s lectures at local high schools and NC State’s Engineering Summer Camp, to motivate younger generations to pursue higher studies and careers in STEM. <br/><br/>State-space descriptions of nonlinear dynamics should be used for data-driven control to guarantee physical interpretability and achieve optimal control. The goals of this research program include the following aspects: (1) the development of model-free state observers, i.e., dynamical routines that reconstruct the hidden state trajectories based on the input and output measurements, through machine learning over typical nonlinear observer structures, (2) the analysis and prediction of state-space dynamical behaviors, e.g., bifurcation, chaos, and conservation laws, through a global linearization of nonlinear dynamics as Koopman operators defined on function spaces, and (3) the learning of input-output dissipative properties, where physical constraints are used to enforce conformity to first principles and conic optimization methods are leveraged to synthesize stabilizing and rigorously performance-guaranteed controllers. This data-driven framework is end-to-end (i.e., from data processing to the final control) and thus suitable for implementation in real-world processes. To verify the practicality of the proposed technical approaches, three representative systems – a computational fluid dynamics (CFD) reactor simulator, a Belousov-Zhabotinsky reactive system with video data, and a hydrogel manufacturing device with lab measurements – will be used as benchmarks.<br/><br/>This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.