The present application is a U.S. National Phase filing of International Application No. PCT/AU2020/050979, filed on Sep. 11, 2020, designating the United States of America, which claims priority to Australian Patent Application No. 2019903427, filed on Sep. 13, 2019, and the present application claims priority to and the benefit of the above-identified applications, the disclosures of which are incorporated by reference herein in their entireties.
The present invention concerns methods and apparatus for operating railways in order to adjust train schedules for purposes such as minimizing travel times of trains, minimizing deviations from a given timetable or allocating precedence to trains, whilst ensuring safety and avoiding deadlocks.
Any references to methods, apparatus or documents of the prior art are not to be taken as constituting any evidence or admission that they formed, or form part of the common general knowledge.
Railway systems comprise rail networks that include interconnected blocks of rails and rolling stock such as locomotives and carriages that ride along the rails. For example,
The network 21 includes network devices for controlling the paths of trains over the rails and for causing trains to stop and proceed at and from designated positions or “stations” throughout the rail network. Examples of the network devices include visual display signals 9a, 9b and switches 10a, 10b, for connecting one block of rail with either of two (or more) other blocks of rails, for example to divert train 1a to siding 23. The rail network 21 also includes a data communications system 29 having a data network 31 for transmitting position updates of trains to a central rail network controller 27 and for distributing scheduling data and/or commands for use in controlling the signal indicators 9a, 9b and switches 10a, 10b and thus the timing of trains along the rails and the paths taken by the trains. The data communications system 29 includes suitable radio infrastructure including terrestrial radio stations 14 and satellite stations 16.
Train 1a is shown in
Upon the signal 9 changing state to “proceed” a person operating the locomotive 5 will manipulate control system 11 (
It will be realized that optimizing the scheduling of the journeys of trains along their allocated paths for each journey is important. Optimization is required to minimize the amount of time that a train, such as trains 1a, must wait for another train, such as train 1b, to be able to pass safely and to avoid deadlocks occurring.
Railway traffic, is usually operated on a rail network according to reference schedules. In some cases, these might be fixed cyclical timetables. In other contexts, such as in freight transport, schedules are usually established some time in advance depending on the availability and delivery requirements of the goods to be transported.
Real time operation of a railway network (or “rail network”) as it may be referred to herein) is affected by the presence of disturbances, which manifest themselves as delays or early arrival of trains. These disturbances can originate from a variety of sources including weather conditions, unexpected outages, train driver and passenger behavior, and span a broad range of magnitudes. Consequently, compensating real-time traffic control mechanisms are required to ensure that the railway network is operated correctly and in a manner that minimizes the propagation of these disturbances. The task of making real-time adjustments to the schedule becomes more complicated as railway systems are operated closer to capacity, resulting in complex configurations involving several trains on congested segments of the system that are difficult to resolve optimally manually. At the same time, the propagation of delays is exacerbated in magnitude and extent under the same circumstances. Despite this, a surprising amount of human interaction is still a practical reality for many railway systems [6].
One method that is used for train scheduling is the stringline plot, a prior art example of which is shown in
Trains starting their travel in the opposite direction, i.e. from Stn01 to Stn17, appear on the stringline as a leftward and downward diagonal. Where one train must be sided to await the passage of another, the stringline becomes horizontal as time passes by without movement of the sided train. For example, train 11 was sided at Stn02 for nearly two hours awaiting the passage of the train 99 and train B2. Similarly train 88 was sided 5 twice, once in Stn02 to wait the passage of train F6 and a second time in Stn05 to await the passage of train G7.
As can be seen in the stringline chart of
Clearly it would be advantageous if the timing of the various trains' trips could be altered to achieve different objectives. For example an objective that is often of primary importance is reducing time spent by trains in sidings, which would equate to a reduction in overall length of time needed to take any particular trip thus permitting greater throughput for the railway system and reducing such costs as engine idling, crews, and other time dependent factors.
It is still quite common for humans working in the network controller 27 to construct stringline plots, either entirely manually or with the help of computerized tools for scheduling trains across the railway network. Humans tend to err on the side of caution so that trains may be sided for longer than necessary. Furthermore, constructing a stringline graph for a large railway network with many trains is very demanding and mistakes can occur.
In recent years optimization methods have been used to assist in finding feasible scheduling solutions. However, it has been found that the computational demands for solving the optimization problem for large railway network s with many trains can result in the computer time becoming infeasibly long, even with use of high-speed computer systems.
It is an object of the present invention to provide a method and apparatus for assisting in the scheduling of trains over a rail network that addresses at least one of the problems of the prior art or which is at least a commercially attractive alternative to hitherto known methods and apparatus of the prior art.
According to a first aspect of the present invention there is provided a railway system comprising:
In an embodiment the controls include timings for movements of the trains.
In an embodiment the controls specify positions for the train at the railway network locations.
In an embodiment the controls specify a position comprising a siding at the network location.
In an embodiment the electronic memory contains instructions for the processors to apply control signals based on the controls to traffic controllers of the railway network.
In an embodiment the traffic controllers include signal lights for timing the movement of the trains.
In an embodiment the traffic controllers include switches for directing trains to the positions at the railway network locations.
In an embodiment the electronic memory contains instructions for the processors to transmit a series of train schedules comprising the controls.
In an embodiment the electronic memory contains instructions for the processors to display the train schedules as stringline plots on electronic displays for reference of human operators.
In an embodiment the electronic memory contains instructions for the processors to determine the controls by optimizing an objective function for the trains comprises minimizing total travel time of the trains.
In an embodiment the electronic memory contains instructions for the processors to determine the controls for an optimization horizon, for each train along its path.
In an embodiment the optimization horizon extends to at least one location allowing passing of trains.
In an embodiment the electronic memory contains instructions for the processors to determine said horizon for each train in the system upon determining that the system is in a safe state.
In an embodiment the electronic memory contains instructions for the processors to iteratively extend the optimization horizon for each train in the safe state until it reaches a node of the model, such that the state of the system would be safe if trains transited up to that node from respective current positions thereof.
In an embodiment the electronic memory contains instructions for the processors to extend the optimization horizon for each train until the determined optimization horizon is solvable to obtain a feasible solution for the model.
In an embodiment the electronic memory contains instructions for the processors to determine if the railway network is in a non-deadlocked state.
In an embodiment the electronic memory contains instructions for the processors to apply a time-wise problem decomposition procedure comprising optimizing the objective function by optimizing objective functions for each of a sequence of smaller models for incremental additional portions of time.
In an embodiment the electronic memory contains instructions for the processors to apply a train-wise problem decomposition procedure comprising optimizing the objective function by considering only portions of the number of trains at a time.
In an embodiment the electronic memory contains instructions for the processor to implement an optimization engine for optimizing the objective function for the trains.
In an embodiment the model includes a graph comprised of nodes and edges corresponding to railway network locations and blocks of rails therebetween.
In an embodiment the model defines locations in the railway network allowing passing of trains with nodes including two or more slots for accommodating two or more corresponding trains at the node.
In an embodiment the model further defines locations in the railway network allowing passing of trains with double edges representing double tracks of the railway network.
According to a further aspect of the invention there is provided a method for operating a railway network having a number of trains, the method comprising:
In an embodiment the controls include timings for movements of the trains.
In an embodiment the controls include positions for the train at the railway network locations.
In an embodiment the positions for the train at the railway network locations include a siding.
In an embodiment the method includes applying control signals based on the controls to traffic controllers of the railway network.
In an embodiment the traffic controllers include signal lights for timing the movement of the trains.
In an embodiment the traffic controllers include switches for directing trains to the positions at the railway network locations.
In an embodiment the method includes operating the scheduling machine to transmit a series of train schedules comprising the controls.
In an embodiment the method includes displaying the train schedules as stringline plots on electronic displays for reference of human operators.
In an embodiment the method includes operating the scheduling machine to determine said controls by optimizing an objective function for the trains comprises minimizing total travel time of the trains.
In an embodiment the method includes operating the scheduling machine to determine controls for an optimization horizon, for each train along its path.
In an embodiment the optimization horizon extends to at least one railway network location allowing passing of trains.
In an embodiment the method includes determining the optimization horizon for each train in the system upon determining that the system is in a safe state.
In an embodiment the method includes iteratively extending the optimization horizon for each train in the safe state until it reaches a node of the model, such that the state of the system would be safe if trains transited up to that node from respective current positions thereof.
In an embodiment the method includes further extending the optimization horizon for each train until the determined optimization horizon is solvable to obtain a feasible solution for the model.
In an embodiment the method includes operating the scheduling machine to determine if the system is in a non-deadlocked state.
In an embodiment the method includes applying a time-wise problem decomposition procedure comprising optimizing the objective function by optimizing objective functions for each of a sequence of smaller models for incremental additional portions of time.
In an embodiment the method includes applying a train-wise problem decomposition procedure comprising optimizing the objective function by considering only portions of the number of trains at a time.
In an embodiment the method includes optimizing of the objective function for the trains is with an optimization engine of the scheduling machine.
In an embodiment of the method the model includes a graph comprised of nodes and edges.
In an embodiment the model defines locations in the railway network allowing passing of trains with nodes including two or more slots for accommodating two or more corresponding trains at the node.
In an embodiment the model further defines locations in the railway network allowing passing of trains with double edges representing double tracks of the network.
According to a further aspect there is provided a method for producing controls, such as timings for movement, for trains of a railway network including processing information defining a state of the railway network relative to a model of the network including paths for each of the trains and optimizing an objective function defining a desired outcome for movements of the trains across the network, wherein an optimal solution of the objective function results in values for the controls.
According to another aspect of the invention there is provided a machine configured to perform the method for producing controls for trains.
Preferred features, embodiments and variations of the invention may be discerned from the following Detailed Description which provides sufficient information for those skilled in the art to perform the invention. The Detailed Description is not to be regarded as limiting the scope of the preceding Summary of the Invention in any way. The Detailed Description will make reference to a number of drawings as follows:
1. Overview and Scheduling Machine
Referring now to
The electronic memory contains instructions for the processors 35 to effect a number of tasks as follows:
As previously mentioned, according to a preferred embodiment of the present invention a specially programmed computational device in the form of scheduling machine 33 is provided that is in data communication with the Rail Network Controller 27 via data communication system 29 including data network 31. As will be discussed, scheduling machine 33 accesses a graph 55, comprised of nodes interconnected by edges that models the railway network.
The scheduling machine 33 receives time separated network state data in the form of state data reports xt1, . . . , xtn from the rail network controller 27 via the data communications system 29. Scheduling machine 33 is configured by instructions comprising a software product 40 that it runs to implement a method for processing the network state snapshots to generate time separated schedules S1, . . . , Sm for trains running on the network 21. The rail network controller 27 uses the time separated schedules S1, . . . , Sm to operate traffic controllers such as switches, e.g. switches 10a, 10b and signaling apparatus, e.g. signal lights 9a, 9b of the network in order to dynamically manage rail traffic across the network in accordance with the schedules S1, . . . , Sm.
The motor 18 is electrically coupled to the data network 31 of data communications system 29 and so the switch 10a can be remotely operated by controls in the form of control signals 24 that are ultimately derived from scheduling information generated by scheduling machine 33. Similarly, signal lights such as lights 9a, 9b are also remotely controllable. Consequently, by using traffic controllers of the railway network, such as switches 10a, 10b and signal lights 9a, 9b, and also be sending commands to the trains, train schedules generated by the scheduling machine 33 are able to be implemented in the railway network.
The main board 34 acts as an interface between microprocessors 35 and secondary memory 47. The secondary memory 47 may comprise one or more optical or magnetic, or solid state, drives. The secondary memory 47 stores instructions for an operating system 39. The main board 3 also communicates with random access memory (RAM) 50 and read only memory (ROM) 43. The ROM 43 typically stores instructions for a startup routine, such as a Basic Input Output System (BIOS) which the microprocessor 35 accesses upon start up and which preps the microprocessor 5 for loading of the operating system 39.
The main board 34 also an integrated graphics adapter for driving display 47. The main board 3 will typically include a communications adapter, for example a LAN adaptor or a modem 55, that places the scheduling machine 33 in data communication with data network 29.
An operator 67 of scheduling machine 33 interfaces with it by means of keyboard 49, mouse 21 and display 47.
The operator 67 may operate the operating system 39 to load software product 40. The software product 40 may be provided as tangible, non-transitory, machine readable instructions 59 borne upon a computer readable media such as optical disk 57. Alternatively it might also be downloaded via port 53.
The secondary storage 47, is typically implemented by a magnetic or solid-state data drive and stores the operating system, for example Microsoft Windows Server, and Linux Ubuntu Server are two examples of such an operating system.
The secondary storage 47 also includes a server-side rail traffic scheduling software product 40 according to a preferred embodiment of the present invention which implements a database 42 that is also stored in the secondary storage 47, or at another location accessible to the scheduling machine 33. The database 42 stores the model 55 that is used, in conjunction with the system state data xt1, . . . , xtn by processor 35 under control of software 40 to implement a method for determining optimal rail traffic journeys across the railway network. The database 42 stores the railway network model including data defining edges interconnected by nodes comprising a graph. Scheduling software product 40 includes an optimization engine 41 such as Gurobi Optimizer provided by Gurobi Optimization, LLC of 9450 SW Gemini Dr. #90729, Beaverton, Oregon, 97008-7105, USA; website: www.gurobi.com.
During operation of the scheduling machine 33 the one or more CPUs 35 load the operating system 39 and then load the software 40.
The scheduling machine 33 receives data, for example the network state information xt1, . . . , xtn about the state of the railway network from the data network 29, to which the scheduling machine 33 is connected by means of its data port 53.
In use the scheduling machine 33 is operated by an administrator 67 who is able to log into the scheduling machine interface either directly using mouse 21, keyboard, 49 and display 47, or more usually remotely across network 29. Administrator 67 is able to monitor activity logs and perform various housekeeping functions from time to time in order to keep the scheduling machine 33 operating in an optimal fashion.
It will be realized that scheduling machine 33 is simply one example of a computing environment for executing software 40. Other suitable environments are also possible, for example the software 40 could be executed on a virtual machine in a cloud computing environment.
2. Railway Traffic Optimization Model
2.1. Overview. The Rail Traffic Optimization software 40, stores a model 55 of a railway network, such as network 21, in database 42, or some other datasource that is accessible to scheduling machine 33. Model 55 captures the arrangement of the railway network as a graph.
The smallest building unit of a track in the railway network is called a block. One block of each network 71, 73, 75 is identified by a dashed line loop 71a, 73a, 75a in each of
The network segment in
The movement of trains is modelled to occur in stages. A stage is the movement of a train from a node to the next node. Nodes are connected by edges, which can be single or double. A single edge represents a single line and at any given time only one train can transit over such an edge. A double edge models a double track, which allows the transit of two trains at the same time, as long as they are transiting in opposite directions. Consequently, under normal operating conditions two trains can travel in opposite directions on a double track segment. Nodes in the graph representing the railway network are connected by either single or double edges.
As shown, nodes are also characterized by a number of slots indicating how many trains can be present on the node at the same time. Locations where passing can occur (e.g., sidetracks, stations) can be modelled as nodes with multiple slots and are shown as double (or multiple) circles in the graph. (Models of trains transit based on standard job-shop scheduling essentially assume that all nodes have an infinite number of slots.)
The same railway system can be represented by different graphs, depending on the density of traffic allowed and the desired granularity of the schedules produced.
Nodes in a model of a network represent the completion of processes rather than physical locations. In
It will be realised that the visual representation of nodes and edges interconnected as a graph is primarily for ease of human comprehension. The graph is used to determine the sequence of nodes from the current location of the train to its destination and need not be visually displayed, e.g. on display 47. The rail traffic optimization software 40 only needs to be able to retrieve the ordered list of nodes (and edges) that the train needs to occupy as it progresses along its path, and in what sequence, so that it is able to ensure that e.g. no two trains occupy the same resource (node or edge) at the same time if it is for example a single capacity edge.
To allow for a clean characterization of deadlock in the next section, in the optimization model presented here, which is the model that is stored in software 40, the physical constraints on railway traffic are focussed on. Also, the primary operational requirement in the presently described embodiment is that throughput should be maximized or equivalently that the sum of the trains' arrival times are minimized. These requirements result in a particularly suitable model for cases in which the railway system is used for freight transportation [1, 13]. In other embodiments of the invention the primary operational requirement may be otherwise, for example to adjust train schedules for purposes such as minimizing travel times of trains, minimizing deviations from a given timetable or allocating precedence to trains.
In the presently described embodiment the path that each train will take, e.g. path 89 in
2.2. Model Formulation. In order to model the rail network the first step is for a human operator to make a graph G (E,N) for the model that corresponds to the rail network and which is stored as part of model 55 in database 42 of scheduling machine 33. As previously discussed, a Graph G (E,N) comprises a set of edges E and nodes N.
The model 55 further comprises trains Ti, where i∈I
For each train Ti, where i∈I
ni=(ni[0],ni[1], . . . ,ni[Fi]) (1A)
is the sequence of nodes in the path of train Ti from its current position to its destination node ni[FA], where Fi characterizes the number of stages from train Ti's current position to its destination node ni[Fi]. Similarly, for each train
ei=(ei[0],ei[1], . . . ,ei[Fi−1]) (1B)
is the sequence of edges in the path of train Ti from its current position to its destination node ni[Fi], where Fi is the sequences of nodes and edges, respectively, in the path of train Ti from its current position to its destination node ni[Fi], where Fi characterizes the number of edges to the “terminals”, being the nodes at the end of the train's path. In the following bracket notation [·] is used when edges are being referred to.
If a train is currently transiting an edge, then that edge is ei[0] in ei, and ni[0] is the last node it visited. Accordingly, trains' trajectories for the example shown in
As a further example, in the model 55 at the state illustrated in
The sequence of nodes and edges for the paths of T1 and T2 in
T1—Sequence of Nodes
Let ki[e] be the index of edge e in ei, and ki[n] be the index of node n in ni. Whenever clear from the context, the index i will be dropped and k[e], k[n], n[k], e[k] will be written instead.
Sequentiality of transit. The initial set of constraints represents the required temporal sequentiality of transit over the edges of the network. Let yi[k]∈R+ be the optimization variable modelling the time at which train Ti∈I departs from the k-th node ni[k]. Then,
yi[1]≥yi[0]+τi,e[0]edge·(1−ωi)∀i∈I
yi[k]≥yi[k−1]+τi,e[k−1]edge∀i∈I,k=2 . . . ,Fi−1 (2)
where τi,e[k]edge∈ is the time required by train i to complete travel over the k-th edge ei[k], which for the first stage is reduced by the fraction of the edge already traversed wi. For example, in
The underlining of wi in (2) indicates that this is a measurement of the τi,e[k]edge current state of the system used to initialize the optimization model, an aspect which will be further analyzed in Section 3 where closed loop operation is discussed. Note that can depend on a number of factors, including the current speed of the train, its state, i.e., whether it's empty or loaded with goods, wear and tear conditions, its length and number of locomotives, etc. As long as these characteristics are effectively captured in the edges' travel times they fit into the optimization framework.
Initial conditions. Let Iedge⊆I be the subset of trains currently transiting an edge (i.e., not stopped at a node), and ei[0] be that edge. Then we have
yi[0]=0 ∀i∈Iedge. (3)
Table 1 includes times required by each of trains T1 and T2 to complete travel over each of the edges ei[k],
As an example, the application of Eqn (2) for T1 in
y1[1]≥y1[0]+τ1,e[0]·(1−w1)
Here, y1[1] is the time at which train T1 departs from the first node n1[1] which is n38 and is greater than or equal to the time that it departed from the zeroth node n1[0] (i.e. n37) plus the time it takes for train 1 to travel over the zeroth edge (i.e. ε37 reduced by the fraction of the zeroth edge already traversed.
The initialisation state is y1[0]=0 because T1 is not starting from a node but from 0.5 the way along edge ε37.
Applying Eqn 2 to the data in Table 1 results in:
y1[1]≥0+0.25×(1−0.5)=0.125 hrs.
y1[2]≥0.125+0.6=0.725 hrs.
y1[3]≥0.725+0.5=1.225 hrs.
y1[4]≥1.225+0.9=2.125 hrs.
The application of Eqn (2) for T2 is: y2[1]≥y2[0]+τ2,e[0]·(1−w2) in which y2[1] is the time at which train T2 departs from the first node n1[1] which is n41 and is greater than or equal to the time that it departed from the zeroth node n1[0] (i.e. n42) plus the time it takes for train 1 to travel over the zeroth edge (i.e. ε41 reduced by the fraction of the zeroth edge already traversed (which in this case is zero since T2 starts from n42 and so must traverse all of zeroth edge ε41).
In this case y2[0] is left as an optimization variable that is only required to be equal or greater than 0. Its final value will be determined after having solved the optimization model. Because T2 is at a node, it generally can dwell there for some amount of time before it departs from that node; that is what it would mean for y2[0] to have some strictly positive value, it would be the amount of time T2 dwells on n42 from the point in time the state of the system was determined and used to construct the optimization model.
If a train is currently transiting an edge it cannot be stopped in the middle of that edge which is why y[0]=0 in those cases.
Edge conflicts. The set of edges ε is partitioned into single εs and double tracks, εd so that ε≐εs∪εd. The single edges allow the transit of at most one train at the time, while on the latter two trains can transit as long as they are headed in opposite directions. For each single track edge e∈εs the following set of conflicts hold
Ce={(i,j)∀i,jεI,j>i|e∈ei,e∈ej}, (4)
encoding the fact that if both trains i and j are to transit over edge e within their planned path to destination, then a conflict must be resolved to determine the train transiting first.
The construction for double edges e∈εd is similar, but conflicts are considered only among trains transiting in the same direction.
A binary optimization variable zi,j,eedge will now be introduced zi,j,eedge is set to 1 if train i is scheduled to transit before j over edge e, and is 0 otherwise as follows:
The value of M has to be set to a sufficiently large value, e.g., M≥maxi∈I yi[Fi−1].
Initial conditions. Trains that are currently transiting an edge i∈Iedge automatically get priority over that edge:
zi,j,e
Node conflicts. Similar to edges, the resolution of conflicts over a node involves node deciding which train transits first, and is encoded with the binary variable zi,j,nnode, attaining 1 if train i transits over n before train j. Nodes are characterized by a number of “slots” indicating how many trains can be present over that node at the same time. Before transiting, a train thus also needs to acquire a slot on the nodes along its path. To capture this, the binary variable zi,n,lslot is introduced, which indicates whether train i occupies slot l∈Ln on its transit over node n, where Ln is the set of slots at node n.
Then, for each n∈N, the following set is introduced to capture conflicts over nodes:
Cn≐{(i,j)∀i,j∈I,j>i|n∈ni, and n∈nj},
and require that schedules satisfy the following constraints:
yj[k[n]−1]≥yi[k[n]]−M(1−zi,j,nmode)−M(1−zi,n,jslot)−M(1−zj,n,lslot),
yi[k[n]−1]≥yj[k[n]]−Mzi,j,nnode−M(1−zi,n,lslot)−M(1−j,n,lslot) (7)
for all n∈N, l∈Ln, and (i, j)∈Cn. These constraints can be active only if, for a given node n and slot l, both zi,n,lslot and zj,n,lslot attain a value of 1 in the solution, i.e., both trains, are scheduled to use the same slot during their transit. In such case, the constraint ensures that if train i transits before j on the node, then the start time of train j over the edge leading to node n has to be greater or equal to the start time of i leaving node n. Additionally, each train occupies exactly one slot during transit:
In order to simplify the exposition herein terminal stations are generally modelled as nodes with infinite capacity, i.e., nodes for which constraints (8)-(9) are suppressed. Note also that it is possible to extend the formulation in (8) with the addition of a quantity of time {tilde over (τ)}. Doing so requires that the train giving way must wait an additional amount of time equal to {tilde over (τ)} after the train with precedence has left the conflict node allowing for, e.g., safety headways of long trains. The quantity {tilde over (τ)} may also be negative allowing for earlier departure, a feature that may be useful on long edges.
Initial conditions. As with edges, trains that are currently transiting a node are occupying a slot on that node and hence they automatically get priority over that node and acquire a slot. Let Inode⊆I be the subset of trains currently transiting on a node, ni[0] be that node and li be the slot they are currently occupying.
Then,
zi,j,n
zi,n
where, as before, the underlining of li indicates that this is part of the state that is measured.
Objective function. As a proxy for rail network throughput maximization, the objective in the presently described embodiment is the minimization of the sum of the trains' arrival times,
It may be noted that there is significant flexibility in the type of objectives that could be used so that it is possible to include, e.g., penalties on delays at intermediate steps, which would allow a straightforward extension of the model presented to pursue timetable adherence. To achieve this, for some train i∈I scheduled to depart from stage k at the reference time yiref[k] stemming from, e.g., a timetable, a new optimization variable yidev[k]≥0 can be introduced as:
−yidev[k]≤yi[k]−yiref[k]≤yidev[k]
This variable could then be added to the objective function allowing a straightforward extension of the model presented to pursue timetable adherence.
In summary, the complete model (eqn (11)) is:
3. Closed Loop Operation: Receding Horizon Control
In this section the optimization model P of Section 2 is embedded within a strategy called receding horizon control, in which a shortened optimization horizon fi, where 0≤fi≤Fi for train i∈I, is used rather than Fi (eqn (1A), (1B)), which extends all the way to the train's destination node. Use of the shortened optimization horizon fi enables the scheduling machine 33 to operate with reduced computation times, and also reflects the fact that in practice, the presence of disturbances and imperfect information on transit times means that the final part of schedules stretching far into the future is likely to be of little value, and unnecessarily leads to increased computational demands; note that the size of the model 55, measured in number of constraints and variables, grows as O(|I|2|N|). Within this framework, feedback is introduced by scheduling machine 33 continuously monitoring the current state of the system, since it is arranged to receive state reports xt, via data communications system 29 and using the new state information to recompute adapted schedules.
The state of the system xt=(ni, wi, li)i∈I denotes the complete set of measurements required to initialize the optimization model P, where ni≡ni[0]∈N is the most recent node visited by train i∈I, 0≤wi≤1 is the fraction of the edge ei[0] already traversed and li indicates the slot occupied if the train is currently located at a node.
P(t, xt, f) indicates the instance of P generated at time t for the initial state xt and under the optimization horizon schedule f=(fi)i∈I. In this section the evolution through time of the state of the railway system x, under the control of movement schedules S1, . . . , Sm produced by scheduling machine 33 as it solves P(t, xt, ft) will be discussed.
For simplicity of notation and exposition, it is assumed here that the schedules, e.g. S1, . . . , Sm of
A potential problem with shortening prediction horizons is that the trains interactions in the later stages are not determined, which might lead to deadlocking.
Example 3.1. Consider the State of a Portion of the Rail Network 21 Depicted in
Assume that at that point in time t, the optimization horizons for the individual trains used to construct model P(t, xt, ft) are as depicted in
A feasible solution to P for this situation is shown in the train graph of
Instances affected by a deadlock are reflected as models that do not allow for a finite, feasible set of start times y, i.e., equation (11) cannot be solved to obtain start times for each train that do not result in a node, edge or slot conflict, so that a solution for P(t, xt, F) is infeasible. Given that P(t, xt, F) exclusively entails physical constraints on traffic, rather than operational ones such as deadlines, an infeasible model indicates that there is no sequence of decisions steering trains from their current position to their respective terminals that is compatible with the physical limitations on traffic, i.e., that there is a deadlock. Hence, a state x, is deadlocked if and only if P(t, xt, F) is infeasible.
In the following section the relationship between P(t, xt, F) and P(t, xt, f) as it relates to deadlocking will be further examined.
3.1. Recursive Feasibility. Recursive feasibility is the fundamental notion used to establish the stability of linear, time-invariant systems under receding horizon controllers such as model predictive controllers [4]. Even though the presently described system is neither, due to the presence of binary variables and the fact that the constraints are time-varying, the issue of recursive feasibility remains crucial in ensuring that the system is not driven into a deadlocked state when the prediction horizons are shortened to 0≤f≤F.
In this section, a procedure is presented to compute a dynamic horizon termination schedule f that guarantees recursive feasibility and which may be implemented by scheduling machine 33. The core notion required for the construction of such a procedure is that of a safe state.
Definition 3.2 (Safe state). A safe state is a system state xsafe=(nsafei, 0, lisafe)i∈I in which all trains are at a node, and all nodes n∈N in the graph have an unoccupied slot. The term “non-regressive” denotes, with respect to x, a safe state in which trains occupy nodes that are successors along their paths from a given state x.
Definition 3.3 (Non-regressiveness). We define as non-regressive with respect to x a system state in which trains occupy nodes that are successors along their paths from a given state x.
The inequality sign “≤” is overloaded when applied to horizons to indicate non-regressiveness: fi≤fi means that, for train i, the horizon determined by {tilde over (f)}i terminates at a node that is further along i's path than the node reached by fi. When fi and {tilde over (f)}i refer to two different points in time, the numbers might not satisfy the standard meaning of the inequality, but they still do imply non-repressiveness.
The following result is a characteristic of safe states which will be used to prove recursive feasibility.
Proposition 3.4. There Always Exists a Sequence of Train Movements that Drives the System from any Safe State xasafe into any Other Safe State xbsafe that is Non-Regressive with Respect to xasafe.
Proof. Algorithm 1 constructs one such sequence of movements. Since the initial state is safe, any train can be moved forward to any other node in the network in a first step; the destination node has to have at least two slots (otherwise it can't be part of a safe state). Upon train arrival, the node has now either no empty slots left, or at least one. If it has at least one empty slot, then the current state is also safe, and the procedure can restart by picking any other train that hasn't been moved yet. If the current node has no slots left, there must be another train on the current node that has not been moved yet. By construction, all other nodes have at least one empty slot available for transit, meaning that the train can be moved anywhere in the network. This procedure can be repeated to termination.
Algorithm 2 presents a procedure to compute a dynamic horizon ft based on the notion of safe states, which may be implemented by scheduling machine 33. If the system is in a non-deadlocked state xt, it is guaranteed to successfully compute an optimization horizon ft which ensures recursive feasibility. In the proposed procedure, the optimization horizon for each train fi is iteratively extended until it reaches a node such that the state of the system would be safe if trains transited up to that point from their current position. Prediction horizons are further extended until the computed ft results in a feasible P(t, xt, ft), while retaining the condition on the final state being safe, a condition that is guaranteed to be met if P(t, xt, F) is feasible. We call horizons ft computed according to Algorithm 2 safe optimization horizons.
Remark 3.5. Note that the feasibility of P(t, xt, ft) implies the feasibility of P(t,xt,ft) for any ft≥ft that leads to a safe state. This is true because Algorithm I can always be used to generate a feasible schedule between the corresponding safe states.
Thus, choosing larger initialization horizons (line 1 of the algorithm) reduces the number of models that have to be attempted before a feasible one is found.
The implementation of Algorithm 2 by scheduling machine 33 will be illustrated with reference to
From that initial state Algorithm 1 executes as follows:
Line 4 (T6): For the “while” condition in Line 4 to be triggered the η value (i.e. number of slots) of the node at which the current train's current horizon fi terminates must be less than or equal to 1. For train T6, the current η(n39) value of node 39 is η(n39)=1 (from Line 2) so that the “while” condition is triggered for T6.
Ultimately the horizons appear as shown in
This result assumes that each time scheduling machine 33 proceeds to Line 7 of Algorithm 2 it is possible to find a feasible solution, by using the optimization engine 41 for the model in the current state P(t, xt, f). If a feasible solution can't be found then Line 7 diverts to Line 9.
The scheduling machine 33 uses the optimization engine 41 of the rail traffic optimization software product 40 to search for a feasible solution within a practical time, e.g. five minutes of processing on a scheduling machine with 16 GB of RAM, an Intel i7-6700K CPU clocking at 4.00 GHz running on Linux Ubuntu 16.04.4 LTS and using Gurobi 7.5.2 as the optimization engine.
The correctness of Algorithm 2 will now be proved and a characterization of deadlocks will be provided that is generally computationally cheaper than solving the full-horizon model P(t, xt, F).
Theorem 3.6 (Deadlock Characterization and Recursive Feasibility). Let P(t, xt, f) be the Optimization Program Instance Generated at Time t for the Initial State x, and with any Non-Regressive Horizon Termination Schedule f Produced by Algorithm 2. Then,
Let ({tilde over (y)},{tilde over (z)}) be any feasible solution of P(t, xt, ft), where ft is an horizon termination schedule computed at time t according to Algorithm 2. Let (
For part a), if P(t, xt, f) is feasible, the system is not deadlocked since, as shown in part b), a feasible solution to P(t, xt, F) can always be constructed by applying Algorithm 1.
On the other hand, Algorithm 2 extends f until P(t, xt, f) is feasible, which is guaranteed to succeed if the system is not deadlocked.
Note that with the application of Algorithm 2 safe optimization horizons are determined, merely to use within the optimization model. These are constructed to guarantee that enough interactions are taken into account to prevent deadlocks. Typically a new sequence of controls, i.e. schedules S1, . . . , SM, are computed before any of the trains have arrived at the final node within their respective horizons, in which case the system will generally not traverse that safe state. Also, note that optimization horizons required by Theorem 3.6 are not unique: in Example 3.1, both {T1: n5, T2: n8, T3: n0} as well as {T1: n8, T2: n5, T3: n0} would be valid. Further, the result does not depend on the optimality of the solution recovered, meaning that a solver can be safely interrupted as soon as a feasible solution to P(t, xt, ft) has been found.
The following counterexample illustrates how the result in Theorem 3.6 might fail when the assumption on non-regressiveness is violated.
Example 3.7 (Non-regressiveness). Consider again the example depicted in
terminating at the nodes indicated with black dashed arrows in
Suppose that trains depart from their current location at time t according to this schedule, and at t+Δt the alternative horizons
are selected, as indicated with cross-head dotted arrows in
Finally, note that the notion of safe states introduced in Definition 3.2 does not exclude the existence of more efficient definitions. Definition 3.2 is sufficient to guarantee recursive feasibility and works well for the freight network discussed in the results discussion Section 5. Alternative definitions might be devised for other networks; the only fundamental requirement is that a safe state must be endowed with a (usually trivial) policy that drives all trains from that safe state into a subsequent safe state, and that this policy can be applied recursively, ensuring that the system can be continuously operated for an “infinite” amount of time without deadlocking. This is accomplished by the (inefficient, but valid) policy described in Algorithm 1.
In the following section computational ramifications of recursive feasibility are discussed.
4. Computationally Efficient Optimization Approaches
As mentioned previously, the computation of solutions to P becomes a practical difficulty, in particular when the size of the network and number of trains is large. In this section several approaches to tackle this issue based on the previous section's results will be discussed.
A. Warm Starting
In warm starting solutions computed at t are reused at t+Δt as a system has moved from Xt to Xt+Δt. We first illustrate how warm starting might fail when the conditions required by Theorem 3.6 are violated.
Example 4.1 (Warm-starting). Consider the train graph depicted in
The procedures laid out in Theorem 3.6 for the construction of safe optimization horizons guarantee that warm starting can always be performed. That is, any solution to P(t, xt, ft) when P(t, xt, ft) is feasible and ft is computed according to Algorithm 2, can always be reused by scheduling machine 33 at t+Δt as a partial solution to P(t+Δt, Xt+Δt, ft+Δt). This is shown in the proof of the theorem, when a complete solution to P(t+Δt, xt+Δt, ft+Δt) is derived combining the procedure in Algorithm 1 with a solution to P(t, xt, ft).
It should be noted that these partial solutions can either be enforced in the following optimization model P(t+Δt, xt+Δt, ft+Δt), in which case the size of the model to be solved is reduced, or used only as initialization points for solvers. Both can result in faster computations.
A by-product of this result is that Algorithm 2 can be substituted by the more efficient procedure in Algorithm 3 to compute ft when the preceding ft-Δt is available. In particular, this more effective procedure does not require one to verify the feasibility of P(t, xt, ft) for a candidate ft, as done on line 7 of Algorithm 2, since the generated ft is guaranteed to result in a feasible P(t, xt, ft). This is true because, as discussed in Remark 3.5, having established that P(t−Δt, xt−Δt, ft−Δt) is feasible automatically ensures the feasibility of P(t, xt, ft) for ft≥ft−Δt. Strictly speaking, Remark 3.5 ensures the feasibility of P(t−Δt, xt−Δt, ft) which, in turn, ensures the required feasibility.
B. Anytime Approaches
A direct consequence of the results in Section 3 and 4-A is that feasible solutions for arbitrary horizons lengths at subsequent iterations can be obtained without performing optimizations. Namely, once an initial feasible solution to a safe state is found, cf. line 7 of Algorithm 2, that solution remains valid at t+Δt according to the discussion in Section 4-A. It can then easily be extended into a solution to any arbitrarily long optimization horizon which satisfies the condition of being safe by application of Algorithm 1. This guarantees that at t+Δt a complete feasible solution to P(t+Δt, xt+Δt, ft+Δt) is available before any optimization is performed. Solvers can thus always be seeded with an initial feasible solution, and since all results herein do not rely on optimality, the solution progress can be interrupted at any time returning valid schedules.
The quality of the solutions recovered with this approach depends on the quality of the heuristic utilized to move trains from safe state to safe state. The policy in Algorithm 1 is evidently suboptimal. It could be improved, for instance, by moving all trains that do not interact which each other at the same time. Generally, designing efficient movement schedules between safe states appears to be simpler than working with generic initial and terminal states.
C. Time-Wise Problem Decomposition
One important ramification of Theorem 3.6 is that scheduling machine 33 can be configured to calculate a feasible solution to P(t, xt, ft), for any arbitrarily long safe optimization horizon ft, by solving a sequence of smaller optimization models, each of which incrementally considers an additional portion of time. More precisely, if (
This is true because scheduling machine 33 can always construct a feasible solution to P(t, xt, {tilde over (f)}t) by extending a solution to P(t, xt, ft) to any non-regressive safe horizon {tilde over (f)}t≥ft by application of the trivial policy in Algorithm 1, thus guaranteeing feasibility.
Since all variables (
Note that the chances of time-wise decomposition working on extensive networks with large fleets are exceedingly low when not enforcing safe horizons. As traffic density increases, the likelihood of at least one train terminating at a node that impedes the transit of other trains in subsequent steps also increases (e.g., dwelling on a single slot node).
D. Train-Wise Problem Decomposition
The Inventors have found that a consequence of the results in Section 3-A is that, under certain provisions, it is possible to solve P by considering only portions of the train fleet, e.g. trains 1a, . . . , 1n of
Two specific procedures enabled by this result are as follows:
In both cases, at each iteration the size of the problems to be solved are smaller than the full-scale model P.
An example illustrating that, as expected, this is generally not possible will now be provided. However, a way in which adjustments may be made to boundary conditions to resolve the underlying issue will also be discussed.
Example 4.2. Consider again the example depicted in
To see how the desired result might be possible more generally, we first observe that movements of individual trains are almost entirely independent of each other in Algorithm 1. Note that the algorithm assumes that initial and final states are safe. The only coupling between trains in the policy occurs when a train i is moved to its destination node n, resulting in all slots in n to be occupied. In this case the policy, as presented, does enforce a specific transit order for scheduling trains by requiring j, another train at node n that hasn't been moved yet, to be moved next. Note, however, that it would be possible to rectify this by delaying the departure of all—or any subset—of the trains already processed (I\Iopen) and move j first. The node at which j arrives can itself then be fully occupied, but as before, a train that hasn't moved yet must exist at this node and hence the same procedure can be re-applied. These recursive iterations must terminate because the number of trains that haven't been moved yet is finite.
This demonstrates that the policy can be adapted to return a train transit schedule by processing trains in any sequence and/or independently of each other, provided boundary conditions adhere to safe state requirements. It thus follows that precedences in instances of P, in which initial and final states are safe, can be solved by considering conflicts of subsets of trains in any order and, hence, both decomposition schemes mentioned above can be applied. The modified procedure does, however, also require the ability to modify y through iterations, which is why the analysis in this section is exclusively valid for z.
Example 4.2 violated the assumption on boundary conditions, both for the initial as well the final states. Adjusting the terminal conditions was sufficient to recover feasibility. It is generally possible to make this adjustment whenever optimization horizons can be stretched far enough to reach a safe state, which is always possible under the assumption of infinite capacity at the terminals.
To guarantee that the procedure succeeds in all cases, however, we need to address initial conditions as well.
One approach is to run Algorithm 2, which outputs a minimal safe horizon ft together with a feasible solution to P(t, xt, ft) and only consider the output ft. Solve problem P(t, xt, {tilde over (f)}t) for any safe {tilde over (f)}t≥ft using either procedure i. or ii. but, at each iteration, only freeze optimization variables indexed from ft onwards; all other variables, which concern the schedule from the trains' current position to the horizon ft, need to be left open as optimization variables. They can, however, be seeded with the values obtained in previous iterations, which, as noted above, will often be a valid initialization point. This is guaranteed to work because running Algorithm 2 ensures that a feasible schedule exists from the trains' current position to ft. As long as the existence of at least one solution is guaranteed, the model can be extended from that point using either procedure i) or ii) into a feasible solution to P(t, xt, {tilde over (f)}t) for any arbitrary {tilde over (f)}t≥ft that is safe.
An alternative approach is to first construct a feasible schedule from the trains' current state into a safe state. One way to obtain this is to run Algorithm 2 and consider both the horizon ft as well as the feasible solution to P(t, xt, ft). We can then solve the problem P(t, xt, {tilde over (f)}t) to any arbitrary {tilde over (f)}t≥ft by following procedures i. or ii. Note that this approach, however, forces the system to pass through the safe state determined by solving P(t, xt, ft).
The Inventors have found that this approach works independently of the extent of the network and the complexity of its topology. It is also independent of the train fleet size. The only relevant factors are the initial and terminal conditions, the approach works for any arbitrary complexity degree of traffic patterns between those boundary conditions.
The Inventors have found that that the quality of the schedules obtained with this model decomposition depends on the size and sequence of the subsets of I used in the iterations.
At decision box 126, control diverts to box 128 where counter variable i is incremented so that box 124 determines an optimization horizon for the next train. Once all trains have been processed to determine their associated optimization horizons for the current state the procedure proceeds to box 130.
At box 130 the scheduling machine 33 implements the optimization engine 41 to solve the model P for the current state using the optimization horizons that have been determined at box 124. The optimization engine finds controls in the form of timing yi[k] for the train, e.g. a time for the train to commence movement from its current position, and also zedge, zslot and znode controls which dictate which edge node and slot on the node the train should proceed to.
At box 132 scheduling machine 33 compiles a schedule based on the control values that have been determined at box 132 for all of the trains for the current state. The schedule, e.g. S1 of
How the control values yi[k] and zedge, zslot and znode are used depends the deployment of the network 21.
For example, the schedules S1, . . . , Sm may be displayed on monitors of computers in the rail network controller 27 to train controllers (people that sit in front of screens and operate on computers in the rail network controller to effect changes in signals 9 and switches 10 (e.g. switch 10a of
In this context, for those binary variables: the stringlines that are produced, e.g. as shown in
In other embodiments the scheduling machine 33 may control the railway network 21 in an autonomous fashion in which point z-slot information can be used and mapped to a control, e.g. switches 10 (such as switch 10 of
5. Testing and Results
Scheduling machine 33 was tested in different configurations on two networks. The first network was modelled with a graph comprising 27 nodes, displayed in
Scheduling machine 33 was tested whilst varying the number of trains present in the network to assess the sensitivity of computations to traffic levels. For the network with 27 nodes, 10 (moderate traffic), 20 (high traffic) and 30 trains (very high traffic-more trains than nodes), were considered. For the 69-node network, 30 and 50 trains on the network were tested. For each network and train number combination, 500 random initial positions of trains were created. For each random initial condition, P (eqn (11)) was solved using the processing methods presented in the previous section:
Time-wise decomposition. In time-wise decompositions, the results in Section 4-C were utilized. Three iterations of the time-wise decomposition solution approach that were implemented by scheduling machine 33 are illustrated in the stringlines generated in
Train-wise decomposition. In train-wise decompositions, the procedures from Section 4-D were utilized to configure the scheduling machine 33. Three iterations of the time-wise decomposition solution approach by the scheduling machine 33 are illustrated in the stringlines generated in
The “incremental” version refers to variant i., while “partitions” corresponds to variant ii. Experiments were run with varying sizes of the train subsets considered at each step. To make comparisons fair, since the “partitions” strategy only recovers a partial solution to P, a last step was performed by scheduling machine 33 in which that partial solution is enforced into the full model P to retrieve a complete solution. The trains selected to be within the next subset at each iteration were chosen randomly for this test.
Monolithic. In the monolithic version, P is solved as a single optimization model until the incumbent solution has a guaranteed optimality gap of less than 0.1% or 120 seconds have elapsed, whichever occurs first.
The results of these experiments are presented in
For the first network and moderate traffic case (10 trains), all methods quickly (≤0.1 sec) solve the model to optimality in the vast majority of cases. For cases with higher traffic, it is possible to distinguish a trade-off between computation time and solution quality. Approaches with higher compute times are generally producing higher quality solutions and vice-versa. The monolithic variant tends to produce solutions with lowest optimality gaps, while median compute times for incremental variants of train-wise decompositions are two orders of magnitude faster, while retaining (in median) optimality gaps of 5% or less.
Computational constraints for the network with 69 nodes make this a more challenging set of instances, especially the experiments involving 50 trains. For this case, the monolithic approach caps at the maximum allowed compute time of 120 seconds in the majority of instances, and presents several outliers with high optimality gaps. Incremental train-wise decompositions with 1 and 5 trains per subset significantly outperform this approach in terms of worst-case optimality gap while being more than two orders of magnitude faster in terms of median computation times.
The results indicate that problems' hardness is very strongly related to the traffic density in the network: significant increases in compute times can be observed for all algorithms on both networks as the number of trains is increased. In particular, median compute times for the monolithic variant grow in excess of an order of magnitude at each higher level of traffic density for both networks. Note also that compute times for the synthetic network with 30 trains are approximately an order of magnitude slower than for the 69 nodes network with the same number of trains. This is to be expected as the number of conflicts (and hence binary variables) grows with increased interactions between the trains.
Comparing time-wise decompositions, we note that performing relaxations drastically improves solutions' quality, while retaining median compute times that are approximately one order of magnitude faster than the monolithic approach for the cases with the highest traffic. Improved quality is likely due to the fact that solutions are not forced through a safe state at the end of each solution step.
Train-wise decomposition with partitions tend to require more computations than the incremental variant and this is mainly due to the last step, in which a full solution is computed from a partial one. All prior steps, involving separate and independent partitions, compute very quickly.
The following articles are each incorporated herein in their entireties by reference.
In compliance with the statute, the invention has been described in language more or less specific to structural or methodical features. The term “comprises” and its variations, such as “comprising” and “comprised of” is used throughout in an inclusive sense and not to the exclusion of any additional features. It is to be understood that the invention is not limited to specific features shown or described since the means herein described herein comprises preferred forms of putting the invention into effect. The invention is, therefore, claimed in any of its forms or modifications within the proper scope of the appended claims appropriately interpreted by those skilled in the art.
Throughout the specification and claims (if present), unless the context requires otherwise, the term “substantially” or “about” will be understood to not be limited to the value for the range qualified by the terms. The Abstract of the invention as lodged with this specification is hereby incorporated herein by reference.
Features, integers, characteristics, moieties or groups described in conjunction with a particular aspect, embodiment or example of the invention are to be understood to be applicable to any other aspect, embodiment or example described herein unless incompatible therewith.
Any embodiment of the invention is meant to be illustrative only and is not meant to be limiting to the invention. Therefore, it should be appreciated that various other changes and modifications can be made to any embodiment described without departing from the scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2019903427 | Sep 2019 | AU | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/AU2020/050979 | 9/11/2020 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2021/046619 | 3/18/2021 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6135396 | Whitfield et al. | Oct 2000 | A |
6546371 | Doner | Apr 2003 | B1 |
7092894 | Crone | Aug 2006 | B1 |
7340328 | Matheson et al. | Mar 2008 | B2 |
8612071 | Chun | Dec 2013 | B2 |
9073562 | Bollapragada | Jul 2015 | B2 |
20070014532 | Parisi | Jan 2007 | A1 |
20130284859 | Polivka | Oct 2013 | A1 |
20200120510 | Gresset | Apr 2020 | A1 |
Number | Date | Country |
---|---|---|
3141451 | Mar 2017 | EP |
20150035303 | Apr 2015 | KR |
20180049673 | May 2018 | KR |
Entry |
---|
Dec. 13, 2022—(EP) Search Report—App 20863028.5. |
Jan. 5, 2021—(WO) International Search Report and Written Opinion—App PCT/AU2020/050979. |
Number | Date | Country | |
---|---|---|---|
20220348242 A1 | Nov 2022 | US |