In recent years, human-in-the-loop control systems have been widely studied. A subset of human-in-the-loop control systems called supervisory control, in which human operators adjust specific setpoints and supervise a predominantly autonomous process, commonly occurs in industry. However, the diversity of personalities, past experiences, and skill levels among operators causes inconsistency in process operation.
Different approaches have been tried to mitigate this inconsistency and improve the plant performance. One approach is to directly model the system and derive the optimal control strategies based on objectives. A second approach is to build a simulator and let operators practice and explore different control strategies. A third approach is to build a simulator and employ machine learning techniques to discover an optimal control policy. What these three approaches have in common is their reliance on some characterization, or model, of the relationship between human input and performance-related output. However, obtaining such a characterization is not trivial for many complex systems of interest.
Physics-based models of industrial process have been successfully derived and used in many cases. But given the complexity of many such processes, including metal casting, using only first-principles can be very difficult to derive a suitable model of the plant dynamics. At the same time, rapid progress in industrial sensing has spurred the development of data-driven methods for modeling. With respect to data-driven methods, especially those involving an industrial process with molten metal, researchers have developed 1) linear models such as autoregressive with extra input (ARX), 2) nonlinear models involving neural networks, and 3) predictions with fuzzy logic. Linear models have simple structures but are less accurate than nonlinear ones, which are more complex and require more computation to identify parameters. Both linear and nonlinear data-driven models have shown some success. However, their prediction accuracy are marginally better than an autoregression model of the form ŷ(k+1)=ay(k). In addition, only some of these models account for human-in-the-loop processes.
A twin roll casting system according to one aspect of the present invention includes a pair of counter-rotating casting rolls, a casting roll controller, a cast strip sensor, and a controller. The pair of counter-rotating casting rolls have a nip between the casting rolls and are capable of delivering cast strip downwardly from the nip. The casting roll controller is configured to adjust at least one process control setpoint for the casting rolls in response to control signals. The cast strip sensor is capable of measuring at least one parameter of the cast strip.
The controller coupled to the cast strip sensor to receive cast strip measurement signals from the cast strip sensor and coupled to the casting roll controller to provide control signals to the casting roll controller, the controller comprising a data-driven model comprising a database of state-input pairs; wherein the controller is configured to execute the following steps at each time step k:
The controller is configured to provide the control signals to the casting roll controller based on the predicted output of the new state observation. In some embodiments, the data-driven model is training-free.
In some embodiments, each sample in the sample database comprises a current state, a next state, a state difference between the next state and the current state, and a plant input.
In some embodiments, the controller is further configured to identify a candidate set of samples from the database whose states and inputs are within a predetermined range of a current state and input; and perform steps (a)-(c) of above on the candidate set of samples. In some embodiments, the candidate set of samples is determined with a fuzzy membership function and a defuzzification process.
In some embodiments, the cast strip sensor comprises a thickness gauge that makes state observations by measuring a thickness of the cast strip in intervals across a width of the cast strip.
In some embodiments, the process control setpoint comprises a setpoint for roll separation force of the casting rolls. In some embodiments, at least one state-input pair comprises chatter and a setpoint for the roll separation force of the casting rolls. In some embodiments, at least one state-input pair comprises a state selected from the group consisting of edge bulge, edge ridge, maximum peak, and high edge flag, and the input comprises a setpoint for the roll separation force of the casting rolls. In some embodiments, at least one state-input pair further states of edge bulge, edge ridge, maximum peak, and high edge flag, and the input comprises a setpoint for the roll separation force of the casting rolls.
For the existing data-driven methods, especially for those including neural networks, how similar a model architecture is to that of the real physical process largely affects the model accuracy. Therefore, modeling the effect of the human's input on the process is more challenging. Considering a general nonlinear system with dynamics x(k+1)=f(x(k),u(k)), What is needed is a data-driven method for predicting the state transition from a given state-input pair by using the state transition of some similar state-input pairs in a given database. That is:
where wi is a similarity weight and D is the set containing all samples similar to the given state-input pair. Since the prediction relies directly on samples in the database, model structure does not need to be specifically designed for a certain system, and the training process is not required.
A training-free data-driven (TFDD) modeling method for simulating the plant dynamics purely from the input-output data is provided. This TFDD method employs the weighted sum of the state transition results of samples in a database to predict the upcoming state of an unseen situation. The ideas of membership function and defuzzification are leveraged to determine the weight of each sample. Due to this property, the disclosed method does not require computational power to determine any model parameter through learning, and its model structure can fit a wide range of complex processes.
In an exemplary embodiment, the TFDD method is applied to a twin-roll steel strip casting process. Verification shows that the TDFF method requires little expertise about the process, and its performance generally exceeds that of a comparable nonlinear autoregressive network with exogenous inputs (NARX) (The MathWorks (2021) model). A principle underlying the proposed TFDD model is to search an existing database of state-input pair samples to identify that which is most similar to a new observed state-input pair. In some embodiments, at each time step k, the following steps are performed:
A full observability of the TFDD system is assumed, and therefore the words state and output may be used interchangeably.
A database is used to construct the TFDD model. The database is composed of samples collected from real physical plant operations. Each sample Si contains four components: the current state xi,1 the next state xi,2, the state difference between the next and the current states Δxi,2, and plant input ui.
S
i
={x
i,1
,x
i,2
Δx
i,2
,u
i} (2)
The components xi,1, ui are used to determine the membership of a given state; the components xi,2, Δxi,2 are then used to predict the next state. The database may comprise any set of structured data, such as a relational database, a flat file database, or other suitable structured data format.
Using all samples in the database for prediction can be inefficient. In some embodiments a candidate set is advantageously employed, and only samples in the candidate set are used to predict the next state. The candidate set contains samples whose states and inputs are within a range of the current state and input that is sufficiently close to yield a reliable prediction. Both the state and input ranges of the candidate set can be determined by analyzing samples in the database (discussed more in detail below). In other embodiments, the ranges are determined based on knowledge of the real plant. For a given state-input pair, (xk, uk), the candidate set can be expressed as;
where P, Q are the dimensions of state and input, respectively.
Referring to
In some embodiments, the triangular membership function is modified with an exponential operation. This modification can mitigate the consequences of having the number of very dissimilar samples outweigh the number of more similar samples. The center average defuzzifier calculates the net similarity by multiplying the similarities of all components and determines the weight by normalizing the net similarity. The step of calculating the net similarity by applying both the modified triangular membership function and the center average defuzzifier can be expressed as
Where cxp, cuq are scanning factors that are used to normalize the difference between the current and next state, and the current and next input, respectively.
A user can set these scaling factors as the maximum state/input difference between two steps among the database or might set these values based on additional understanding of the real system. The predictions of the next state and of the state change are calculated as
The final prediction of the next state is a weighted sum of the predictions of the next state {circumflex over (x)}k+1|k and of the state change Δ{circumflex over (x)}k+1|k.
The weights ap and bp are the normalized sample variances of the state change Δxi,2p and the next state xi,2p in the candidate sample set i∈Dc. They quantify the prediction uncertainty that arises by merely predicting the next state with either Δ{circumflex over (x)}k+1|k or {circumflex over (x)}k+1|k. A larger uncertainty of the state change among candidate samples results in a larger weight being assigned to the prediction of the next state, and vice versa. Hence, by applying these weights, the final prediction of the next state relies more on the prediction with a lower uncertainty.
Twin-roll steel strip casting is a near-net-shape manufacturing process used to improve energy efficiency and reduce operating costs for steel manufacturing. In twin-roll casting, molten steel is poured directly onto the surface of two casting rolls which cool and solidify the steel as two shells, and the two shells formed come together at the nip to form a strip. The process is characterized by rapid thermo-mechanical dynamics which are difficult to control, particularly during the highly transient start-up process. Hence, human operators are tasked with adjusting certain setpoints to achieve desired characteristics of the final product.
A cast sequence start-up process is defined as the period from when the casting process begins to the start of production of the first prime coil. The process is highly nonlinear, and factors such as minor differences in the chemical make-up of the steel, even for the same steel grade, can result in a different initial strip profile. The initial strip head end will not be collected for commercial use, and the operators adjust setpoints to drive the process to a steady-state condition that satisfies final prime product requirements including a desired profile in the shortest possible time to minimize the process start up yield loss.
In this process, operators typically adjust the roll separation force setpoint to mitigate a variety of imperfections in the steel strip profile. Along the strip length, surface defects caused by vibration between 35 and 65 Hz of the casting machine, can be effectively reduced by decreasing the roll separation force.
These vibrations are commonly referred to as “chatter”. Across the strip length, a slight parabolic shape is desired, but if insufficient roll separation force is applied to the strip, undesirable sharp peaks near the edges of the strip, also called “edge spikes”, begin to appear.
The relationship between the roll separation force and the strip profile features is extremely difficult to model. Therefore, the proposed TFDD method is applied to develop such a model using experimental input-output data.
To assist operators in quickly comprehending information and making setpoint adjustments, the aforementioned imperfections of the strip profile are quantified, and their time trajectories are plotted and shown to the operators. Moreover, the edge spikes are characterized by four quantities to provide more information.
These quantities are defined as follows:
All signals are originally sampled with a 1 Hz sampling rate, and a moving-average filter with window size 10 is applied to smooth each signal. After that, the average of each five consecutive samples is taken without sample overlapping and stored in the training database. Doing so results in a sampling rate that aligns with the minimum time interval between two consecutive human commands.
In one example, the dataset contains 123 cast sequences and is divided into training and testing databases. The training database contains 115 sequences (>5500 samples after processing), and the testing database contains 8 sequences.
All data are generated with the same steel grade and width condition using a single casting machine. However, the invention may be applied to various steel grades, widths, and casting machines.
According to one implementation of the above invention, referring to
Twin-roll caster 11 comprises a main machine frame 21 which supports a pair of laterally positioned casting rolls 22 having casting surfaces 22A and forming a nip 27 between them. Molten metal is supplied during a casting campaign from a ladle (not shown) to a tundish 23, through a refractory shroud 24 to a removable tundish 25 (also called distributor vessel or transition piece), and then through a metal delivery nozzle 26 (also called a core nozzle) between the casting rolls 22 above the nip 27. Molten steel is introduced into removable tundish 25 from tundish 23 via an outlet of shroud 24. The tundish 23 is fitted with a slide gate valve (not shown) to selectively open and close the outlet 24 and effectively control the flow of molten metal from the tundish 23 to the caster. The molten metal flows from removable tundish 25 through an outlet and optionally to and through the core nozzle 26.
Molten metal thus delivered to the casting rolls 22 forms a casting pool 30 above nip 27 supported by casting roll surfaces 22A. This casting pool is confined at the ends of the rolls by a pair of side dams or plates 28, which are applied to the ends of the rolls by a pair of thrusters (not shown) comprising hydraulic cylinder units connected to the side dams. The upper surface of the casting pool 30 (generally referred to as the “meniscus” level) may rise above the lower end of the delivery nozzle 26 so that the lower end of the deliver nozzle 26 is immersed within the casting pool.
Casting rolls 22 are internally water cooled by coolant supply (not shown) and driven in counter rotational direction by drives (not shown) so that shells solidify on the moving casting roll surfaces and are brought together at the nip 27 to produce the thin cast strip 12, which is delivered downwardly from the nip between the casting rolls.
Below the twin roll caster 11, the cast steel strip 12 passes within a sealed enclosure 10 to the guide table 13, which guides the strip to a pinch roll, stand 14 through which it exits sealed enclosure 10. The seal of the enclosure 10 may not be complete, but is appropriate to allow control of the atmosphere within the enclosure and access of oxygen to the cast strip within the enclosure. After exiting the sealed enclosure 10, the strip may pass through further sealed enclosures (not shown) after the pinch roll stand 14.
Before the strip enters the hot roll stand, the transverse thickness profile is obtained by thickness gauge 44 and communicated to controller 92. It is in this location that the strip profile is measured. In some embodiments, the controller 92 provides input to the casting roll controller 94 which, for example, may control nip geometry and roll separation force. In some embodiments, controller 92 and controller 94 are combined into a single unit.
To apply the proposed training-free data-driven (TFDD) method, we define the model input as uk={Fk,ak, tk}, where Fk is the roll separation force value at the current step, ak=Fk+1−Fk is the force adjustment from the current to the next step, and tk is the time at the current step with respect to the start of the cast. Note that the action ak (force adjustment) is not equal to ΔFk (change in force). Instead, it represents the force adjustment decision taken by the human operator at step k. Knowledge of absolute force change is not needed because the method does not rely on a physical model of an actual plant. Moreover, the time tk is included in the input vector because the start-up process is transient; therefore, time similarity in the transient period may imply similarity of the model dynamics. Through analyzing the training database D, upper and lower bounds may be based on the change in force between two steps. In one example, the upper bound tup based on the average time length of the start-up process. With these values, the input range rua may be defined. An example of a TFDD model architecture relating roll separation force and time inputs to the strip profile features is illustrated in
In some embodiments, the state vector for the TFDD model contains the five profile features of interest xk=(C, b,eg,fg,np). From the training database, the following may be defined:
|xkp−xi,2p|≤rxp∀i∈D and p∈{C,bg,eg,fg,mp};
in other words, the change of each state component between two steps is upper bounded by its corresponding r. Based on these values, the criteria of the candidate set may be formed as shown in Eqn. 3. The scaling factors cxp or cuq are defined as the range of each state or input variable as obtained from the database.
In some embodiments, to reduce the impact of inaccurate prediction caused by a small candidate set, the model outputs the next state as a weighted sum of the current state and the prediction of the next state if the number of samples in the candidate set is less than 10:
where n is the number of samples in the candidate set.
To benchmark the performance of the TFDD method, it is compared to a nonlinear autoregressive network with exogenous inputs (NARX) model trained in MATLAB to predict the same process. As shown in
The input uk is the same as in the aforementioned TFDD model. To more efficiently train the NARX model, we define a scaled state yk from the original state xk as follows:
The performance of the NARX model is evaluated for different numbers of intermediate variables (5 to 10) by calculating the mean square error of between the actual and the predicted scaled states. Table 1 shows that using six intermediate variables results in the best closed-loop performance among all options.
Both TFDD and NARX models are tested using all 8 test sequences. For each sequence, the actual force trajectory and the initial state vector are given. At any step after the initial, each model predicts the upcoming state based on its previous prediction and the actual force value at the current step.
The TDFF method, as applied to twin roll casting plants, provides several advantages. In one use, the TDFF method may be employed as a simulator to provide feedback and improve human control performance without the expense of operating a casting campaign (and potentially disposing or recycling of extensive scrap materials). In another example, the TDFF method may be employed to generate augmented data for use in training Iterative Learning Controllers, A.I. Controllers, Reinforcement Learning Agent Controllers, etc. In another example, a computer controller may control the casting plant and, based on a current state, generate predicted next states for a plurality of potential inputs, and then select the most optimum input based on the predicted next states.
Number | Date | Country | |
---|---|---|---|
63331013 | Apr 2022 | US |