Dividing A Travel Query Into Sub-Queries

Description

BACKGROUND

This invention relates to travel scheduling and pricing, and more particularly to processing queries for air travel planning systems.

In travel planning such as for air travel scheduling, pricing and low-fare-search queries are posed by users from travel agent systems, airline reservation agent systems, travel web sites, and airline-specific web sites. Low-fare-search (LFS) queries typically include origin and destination information, time constraints and additional information including passenger profile and travel preferences. Travel planning computer systems respond to these LFS queries and typically return a list of possible tickets, each having flight and price information. Some systems to return answers in a compact form such as through a pricing graph.

Travel planning systems expend considerable computational resources responding to LFS queries. It is not uncommon for a travel planning system to spend more than 30 seconds responding to an LFS query, even for a relatively straightforward round-trip query leaving and returning from specific airports on specific dates. Typically, a single computer will be devoted to answering such a query, though the computer may range from a small personal computer or workstation class machine to a mainframe computer.

Because travel planning systems spend considerable computational resources on each LFS query, and because many such queries are answered every second, it is typical for travel planning computer programs to be run on large “farms” of computers, including tens, hundreds or even thousands of computer processors. In current practice, each query is answered by a single computer with different computers in a farm concurrently working on corresponding different queries.

SUMMARY

However, there are many situations in which it is advantageous for multiple computers to work on the same query concurrently. One reason for doing so is that the response time (“latency”) can be reduced. For example, where one computer might expend 1 minute answering a query, it may be possible for 4 computers acting in concert to each expend 15 seconds answering the same query. The total number of CPU-seconds is the same, but the query latency is reduced from 1 minute to 15 seconds, a considerable improvement from the user's standpoint.

Also, in many cases the peak load on the farm, which may only be reached for short periods, dictates the size of a computer farm. For example, it is common for load on travel planning systems to be high in the early work hours but much lower late at night and on weekends and holidays (when travelers are less likely to access the internet and travel agencies are closed). It may be that a travel planning system requires 1000 computers to support its query load during peak periods, but only 250 during off-peak hours. Since the incremental cost of using an otherwise idle computer is negligible, during off-peak hours it may be economically practical to devote 4 times the computing resources to answering a query as at peak hours. The extra resources may enable more complicated queries, or be used to improve the search accuracy. However, it may be preferred to use these resources in parallel to maintain low query latency, rather than having each computer spend four times longer on each query.

According to an aspect of the present invention, a method includes dividing a travel query into sub-queries for execution by a travel planning system to return answers that satisfy the travel query.

According to an additional aspect of the present invention a method includes dividing a travel query into sub-queries according to a determined optimal division of the query for execution by a travel planning system to return answers that satisfy the travel query.

Depending on the travel planning system, there may be different ways to divide up a low-fare-search query amongst several computers. For example, some travel planning systems solve low-fare-search problems by first enumerating a list of from 1 to several thousand possible flight combinations that satisfy the airport and time specifications. Such systems then iterate over each flight combination finding prices for each, and return a small set of flight combinations that have low prices. Because the process of finding prices is typically much more computationally expensive than finding flight combinations, for a travel planning system with such a design a practical way to divide the work amongst several computers would be to have one computer generate the list of flight combinations and to divide the list of flight combinations into smaller lists to be priced concurrently by multiple computers.

However, again depending on the design of the travel planning system, this strategy may be less efficient than other strategies. For example, a travel planning system that achieves computational advantages by sharing work across the pricing of multiple flight combinations can divide queries in certain ways amongst the computers in order to retain those efficiencies resulting from sharing work. Such ways include having each computer price flight combinations for a different airline or by dividing up queries by time range. For such a system it is less efficient in terms of total resources expended to price many flight combinations separately on different computers than to price many flight combinations as part of a single computational process.

When dividing a low-fare-search query amongst multiple computers it is advantageous to have each computer perform roughly equal amounts of work, since typically the slowest computer determines the response time of the entire query. It is desirable that any technique of dividing a query into sub-queries be sophisticated enough to base its decisions in part on the expected work necessary to solve each sub-query.

Because of resource or program limitations, a travel planning system may be incapable of answering queries beyond a certain level of difficulty. For example, a system may be limited to solving problems involving no more than one-day departure windows, or a single origin or destination. For such a system, queries that exceed the limits of the system may need to be divided into smaller “sub-queries.” Techniques for dividing a query into smaller sub-queries executed concurrently with the goal of reducing query latency can be used to extend the capabilities of those travel planning systems that have difficulties handling more complex travel queries.

The details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the invention will be apparent from the description and drawings, and from the claims.

DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram of a travel planning system that divides search queries into sub-queries to be executed concurrently.

FIG. 2 is a flow chart of a query dividing process that is executed in a centralized manner.

FIG. 3 is a flow chart of a query dividing process that is executed in a distributed manner.

FIGS. 4-7 are flow charts depicting details of algorithms for dividing queries according to a specified criterion.

FIGS. 8-10 are flow charts depicting details of query division that takes into consideration loading on travel planning system.

DETAILED DESCRIPTION

Referring to FIG. 1, an arrangement 10 for travel planning includes a process 12 to divide low-fare-search queries into sub-queries to be executed concurrently. A user such as a traveler, travel agent or airline reservation agent enters trip information typically including time and airport (i.e. origin and destination) information from a client system 14 into a travel application 16. The travel application 16 is typically accessed via the client system 14 which can be a travel agent terminal, an Internet web browser connected to a travel web site, and so forth. The travel application 16 composes this information into an appropriately formatted query, e.g., a low-fare-search query 18 that is fed via a network 15 to a travel planning system 20. Network 15 can be any type of network such as a public network such as the Internet or telephone system or a private network such as a local area network (LAN), wide area network (WAN), virtual private network (VPN), and so forth. The travel planning system 20 includes a query distributor 22 that alters the query 18 to produce sub-queries 18a-18i that are distributed to various travel planning computers 20a-20n, where n does not necessary have to be equal to i. The travel planning computers 20a-20n execute the sub-queries 18a-18i concurrently to produce answers 24a-24i. The answers 24a-24i to these sub-queries 18a-18i are sent back to the user. In one embodiment, the answers 24a-24i are sent to an answer collator 25, which merges the answers 24a-24i into a composite answer 26. Several merging techniques can be employed, such as returning all answers or selecting the cheapest answers from all the answers and so forth.

The answers for each sub-query may be collected and organized by the answer collator 25. If the form of the sub-query results is a simple list of travel options, the collation process used by the answer collator 25 may simply involve concatenating the answers from each sub-query. However more complex collations schemes are possible, such as selecting a subset of answers from each sub-query (possibly based on cheapest travel options from amongst all of the answers and so forth). Alternatively, if the query division process 12 produces sub-queries that overlap, the collation process 25 could remove duplicate answers. In the case where the travel planning computers produce answers in other forms, such as the pricing graph representation, other methods of collation may be used. For example, multiple pricing graphs can be merged into one by joining them with an OR node. It may also be that no collation process is used, so that answers for the different sub-queries are returned to the travel application as soon as they are available, rather than waiting for all sub-queries to complete.

Referring to FIG. 2 a process 40 for dividing queries is shown. The process 40 receives 42 a query, e.g., a low fare search query. A low-fare-search query typically includes a sequence of specifications of origins, destinations, and travel time periods for each part of a trip. For example, a two-part round trip query might be described as:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 17th - August 18th

2
SFO or SJC
BOS
August 23rd - August 30^th

The process 40 divides 44 the query into sub-queries based on a criterion. There are many ways such a query could be divided into sub-queries. To reduce unnecessary work, it is typically advantageous to divide a query into sub-queries that do not overlap. For example, if dividing into at most 4 sub-queries, the following divisions of the query according to different criterion as set out in the examples below are all possibilities:

1. By destination airport (2 sub-queries)

- Sub-query 1:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO
August 17th - August 18th

2
SFO
BOS
August 23rd - August 30th

- Sub-query 2:

Part#
Origin
Destination
Departure Dates

1
BOS
SJC
August 17th - August 18th

2
SJC
BOS
August 23rd - August 30th

2. By outbound departure time (4 sub-queries)

- Sub-query 1:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 17th (0:00 to 13:59)

2
SFO
BOS
August 23rd - August 30th

- Sub-query 2:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 17th (14:00 to 23:59)

2
SFO
BOS
August 23rd - August 30th

- Sub-query 3:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 18th (0:00 to 13:59)

2
SFO
BOS
August 23rd - August 30th

- Sub-query 4:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 18th (14:00 to 23:59)

2
SFO
BOS
August 23rd - August 30th

3. By outbound and return departure times (4 sub-queries)

- Sub-query 1:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 17th

2
SFO
BOS
August 23rd - August 26th

- Sub-query 2:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 17th

2
SFO
BOS
August 27th - August 30th

- Sub-query 3:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 18th

2
SFO
BOS
August 23rd - August 26th

- Sub-query 4:

Part#
Origin
Destination
Departure Dates

1
BOS
SFO or SJC
August 18th

2
SFO
BOS
August 27th - August 30th

4. By airline (4 sub-queries)

- Sub-query 1: