BACKGROUND
The present invention describes a framework for organizing, selecting and launching media items. Part of that framework involves the design and operation of graphical user interfaces with the basic building blocks of point, click, scroll, hover and zoom and, more particularly, to graphical user interfaces associated with media items which can be used with a free-space pointing remote.
Technologies associated with the communication of information have evolved rapidly over the last several decades. Television, cellular telephony, the Internet and optical communication techniques (to name just a few things) combine to inundate consumers with available information and entertainment options. Taking television as an example, the last three decades have seen the introduction of cable television service, satellite television service, pay-per-view movies and video-on-demand. Whereas television viewers of the 1960s could typically receive perhaps four or five over-the-air TV channels on their television sets, today's TV watchers have the opportunity to select from hundreds and potentially thousands of channels of shows and information. Video-on-demand technology, currently used primarily in hotels and the like, provides the potential for in-home entertainment selection from among thousands of movie titles. Digital video recording (DVR) equipment such as offered by TiVo, Inc., 2160 Gold Street, Alviso, Calif. 95002, further expand the available choices.
The technological ability to provide so much information and content to end users provides both opportunities and challenges to system designers and service providers. One challenge is that while end users typically prefer having more choices rather than fewer, this preference is counterweighted by their desire that the selection process be both fast and simple. Unfortunately, the development of the systems and interfaces by which end users access media items has resulted in selection processes which are neither fast nor simple. Consider again the example of television programs. When television was in its infancy, determining which program to watch was a relatively simple process primarily due to the small number of choices. One would consult a printed guide which was formatted, for example, as series of columns and rows which showed the correspondence between (1) nearby television channels, (2) programs being transmitted on those channels and (3) date and time. The television was tuned to the desired channel by adjusting a tuner knob and the viewer watched the selected program. Later, remote control devices were introduced that permitted viewers to tune the television from a distance. This addition to the user-television interface created the phenomenon known as “channel surfing” whereby a viewer could rapidly view short segments being broadcast on a number of channels to quickly learn what programs were available at any given time.
Despite the fact that the number of channels and amount of viewable content has dramatically increased, the generally available user interface and control device options and framework for televisions has not changed much over the last 30 years. Printed guides are still the most prevalent mechanism for conveying programming information. The multiple button remote control with simple up and down arrows is still the most prevalent channel/content selection mechanism. The reaction of those who design and implement the TV user interface to the increase in available media content has been a straightforward extension of the existing selection procedures and interface objects. Thus, the number of rows and columns in the printed guides has been increased to accommodate more channels. The number of buttons on the remote control devices has been increased to support additional functionality and content handling, e.g., as shown in FIG. 1. However, this approach has significantly increased both the time required for a viewer to review the available information and the complexity of actions required to implement a selection. Arguably, the cumbersome nature of the existing interface has hampered commercial implementation of some services, e.g., video-on-demand, since consumers are resistant to new services that will add complexity to an interface that they view as already too slow and complex.
In addition to increases in bandwidth and content, the user interface bottleneck problem is being exacerbated by the aggregation of technologies. Consumers are reacting positively to having the option of buying integrated systems rather than a number of segregable components. A good example of this trend is the combination television/VCR/DVD in which three previously independent components are frequently sold today as an integrated unit. This trend is likely to continue, potentially with an end result that most if not all of the communication devices currently found in the household being packaged as an integrated unit, e.g., a television/VCR/DVD/internet access/radio/stereo unit. Even those who buy separate components desire seamless control of and interworking between them. With this increased aggregation comes the potential for more complexity in the user interface. For example, when so-called “universal” remote units were introduced, e.g., to combine the functionality of TV remote units and VCR remote units, the number of buttons on these universal remote units was typically more than the number of buttons on either the TV remote unit or VCR remote unit individually. This added number of buttons and functionality makes it very difficult to control anything but the simplest aspects of a TV or VCR without hunting for exactly the right button on the remote. Many times, these universal remotes do not provide enough buttons to access many levels of control or features unique to certain TVs. In these cases, the original device remote unit is still needed, and the original hassle of handling multiple remotes remains due to user interface issues arising from the complexity of aggregation. Some remote units have addressed this problem by adding “soft” buttons that can be programmed with the expert commands. These soft buttons sometimes have accompanying LCD displays to indicate their action. These too have the flaw that they are difficult to use without looking away from the TV to the remote control. Yet another flaw in these remote units is the use of modes in an attempt to reduce the number of buttons. In these “moded” universal remote units, a special button exists to select whether the remote should communicate with the TV, DVD player, cable set-top box, VCR, etc. This causes many usability issues including sending commands to the wrong device, forcing the user to look at the remote to make sure that it is in the right mode, and it does not provide any simplification to the integration of multiple devices. The most advanced of these universal remote units provide some integration by allowing the user to program sequences of commands to multiple devices into the remote. This is such a difficult task that many users hire professional installers to program their universal remote units.
Some attempts have also been made to modernize the screen interface between end users and media systems. Electronic program guides (EPGs) have been developed and implemented to replace the afore-described media guides. Early EPGs provided what was essentially an electronic replica of the printed media guides. For example, cable service operators have provided analog EPGs wherein a dedicated channel displays a slowly scrolling grid of the channels and their associated programs over a certain time horizon, e.g., the next two hours. Scrolling through even one hundred channels in this way can be tedious and is not feasibly scalable to include significant additional content deployment, e.g., video-on-demand. More sophisticated digital EPGs have also been developed. In digital EPGs, program schedule information, and optionally applications/system software, is transmitted to dedicated EPG equipment, e.g., a digital set-top box (STB). Digital EPGs provide more flexibility in designing the user interface for media systems due to their ability to provide local interactivity and to interpose one or more interface layers between the user and the selection of the media items to be viewed. An example of such an interface can be found in U.S. Pat. No. 6,421,067 to Kamen et al., the disclosure of which is incorporated here by reference. FIG. 2 depicts a GUI described in the '067 patent. Therein, according to the Kamen et al. patent, a first column 190 lists program channels, a second column 191 depicts programs currently playing, a column 192 depicts programs playing in the next half-hour, and a fourth column 193 depicts programs playing in the half hour after that. The baseball bat icon 121 spans columns 191 and 192, thereby indicating that the baseball game is expected to continue into the time slot corresponding to column 192. However, text block 111 does not extend through into column 192. This indicates that the football game is not expected to extend into the time slot corresponding to column 192. As can be seen, a pictogram 194 indicates that after the football game, ABC will be showing a horse race. The icons shown in FIG. 2 can be actuated using a cursor, not shown, to implement various features, e.g., to download information associated with the selected programming. Other digital EPGs and related interfaces are described, for example, in U.S. Pat. Nos. 6,314,575, 6,412,110, and 6,577,350, the disclosures of which are also incorporated here by reference.
However, the interfaces described above suffer from, among other drawbacks, an inability to easily scale between large collections of media items and small collections of media items. For example, interfaces which rely on lists of items may work well for small collections of media items, but are tedious to browse for large collections of media items. Interfaces which rely on hierarchical navigation (e.g., tree structures) may be more speedy to traverse than list interfaces for large collections of media items, but are not readily adaptable to small collections of media items. Additionally, users tend to lose interest in selection processes wherein the user has to move through three or more layers in a tree structure. For all of these cases, current remote units make this selection processor even more tedious by forcing the user to repeatedly depress the up and down buttons to navigate the list or hierarchies. When selection skipping controls are available such as page up and page down, the user usually has to look at the remote to find these special buttons or be trained to know that they even exist.
Organizing frameworks, techniques and systems which simplify the control and screen interface between users and media systems as well as accelerate the selection process have been described in U.S. patent application Ser. No. 10/768,432, filed on Jan. 30, 2004, entitled “A Control Framework with a Zoomable Graphical User Interface for Organizing, Selecting and Launching Media Items”, the disclosure of which is incorporated here by reference and which is hereafter referred to as the “'432 application”. Such frameworks permit service providers to take advantage of the increases in available bandwidth to end user equipment by facilitating the supply of a large number of media items and new services to the user.
Typically software development associated with user interface and application building associated with, for example, set-top box and TV systems involves a choice between two extremes. One approach is to develop all of the software as one unified application. This approach has the advantage that the interaction between the user and the user interface is fully encapsulated and the performance is fully controlled. The disadvantage of this approach is that the development of new features for the user interface is slow because the whole application is affected whenever something is changed. At the other end of the spectrum, there is the approach of designing the user interface much like a web browser. Using this approach, a small machine is built that interprets HTML code to build up the user interface screens. One advantage of this second approach is that development of applications is very quick. A disadvantage of this approach is that interactions are not fully encapsulated and bandwidth performance issues are not fully controlled. Since consistent user interactions are important for a good user interface design, particularly in TV user interface design, the former problem may be significant. Moreover, since set-top boxes, for example, frequently have to cope with severe bandwidth limitations, this latter problem may also be troubling.
Accordingly, it would be desirable to provide user interfaces, methods and software design constructions which overcome these difficulties.
SUMMARY
Systems and methods according to the present invention address these needs and others by providing a user interface displayed on a screen with a plurality of control elements, at least some of the plurality of control elements having at least one alphanumeric character displayed thereon. A text box for displaying alphanumeric characters entered using the plurality of control elements and a plurality of groups of displayed items. The layout of the plurality of groups on the user interface is based on a first number of groups which are displayed, and wherein a layout of the displayed items within a group is based on a second number of items displayed within that group.
According to an exemplary embodiment of the present invention, a method for distributed software construction associated with a metadata handling system includes the steps of providing a plurality of a first type of system-wide software constructs, each of which define user interactions with a respective, high level, metadata category, and providing at least one second type of lower level system-wide software constructs, wherein each of the plurality of first type of system-wide software constructs are composed of one or more of the second type of lower level system-wide software constructs.
According to another exemplary embodiment of the present invention, a metadata handling system having a distributed software construction includes a metadata supply source for supplying various types of metadata to the metadata handling system, a plurality of a first type of system-wide software constructs, each of which define user interactions with a respective, high level, metadata category, and at least one second type of lower level system-wide software constructs, wherein each of the plurality of first type of system-wide software constructs are composed of one or more of the second type of lower level system-wide software constructs.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings illustrate exemplary embodiments of the present invention, wherein:
FIG. 1 depicts a conventional remote control unit for an entertainment system;
FIG. 2 depicts a conventional graphical user interface for an entertainment system;
FIG. 3 depicts an exemplary media system in which exemplary embodiments of the present invention (both display and remote control) can be implemented;
FIG. 4 shows a system controller of FIG. 3 in more detail;
FIGS. 5-8 depict a graphical user interface for a media system according to an exemplary embodiment of the present invention;
FIG. 9 illustrates an exemplary data structure according to an exemplary embodiment of the present invention;
FIGS. 10(a) and 10(b) illustrate a zoomed out and a zoomed in version of a portion of an exemplary GUI created using the data structure of FIG. 9 according to an exemplary embodiment of the present invention;
FIG. 11 depicts a doubly linked, ordered list used to generated GUI displays according to an exemplary embodiment of the present invention;
FIGS. 12(a) and 12(b) show a zoomed out and a zoomed in version of a portion of another exemplary GUI used to illustrate operation of a node watching algorithm according to an exemplary embodiment of the present invention;
FIGS. 13(a) and 13(b) depict exemplary data structures used to illustrate operation of the node watching algorithm as it the GUI transitions from the view of FIG. 12(a) to the view of FIG. 12(b) according to an exemplary embodiment of the present invention;
FIG. 14 depicts a data structure according to another exemplary embodiment of the present invention including a virtual camera for use in resolution consistent zooming;
FIGS. 15(a) and 15(b) show a zoomed out and zoomed in version of a portion of an exemplary GUI which depict semantic zooming according to an exemplary embodiment of the present invention;
FIGS. 16-20 depict a zoomable graphical user interface according to another exemplary embodiment of the present invention;
FIG. 21 illustrates an exemplary set of overlay controls which can be provided according to exemplary embodiments of and the present invention;
FIG. 22 illustrates an exemplary framework for implementing zoomable graphical user interfaces according to the present invention;
FIG. 23 shows a data flow associated with generating a zoomable graphical user interface according to an exemplary embodiment of the present invention;
FIG. 24 illustrates a GUI screen drawn using a brick according to exemplary embodiments of the present invention;
FIG. 25 illustrates a second GUI screen drawn using a brick according to exemplary embodiments of the present invention;
FIG. 26 illustrates a toolkit screen usable to create bricks according to exemplary embodiments of the present invention;
FIG. 27 illustrates a system in which system bricks are employed as system building blocks which facilitate distributed software design according to an exemplary embodiment of the present invention; and
FIG. 28 depicts a hierarchy of different types of bricks according to an exemplary embodiment of the present invention.
DETAILED DESCRIPTION
The following detailed description of the invention refers to the accompanying drawings. The same reference numbers in different drawings identify the same or similar elements. Also, the following detailed description does not limit the invention. Instead, the scope of the invention is defined by the appended claims.
In order to provide some context for this discussion, an exemplary aggregated media system 200 which the present invention can be used to implement will first be described with respect to FIGS. 3-22. Those skilled in the art will appreciate, however, that the present invention is not restricted to implementation in this type of media system and that more or fewer components can be included therein. Therein, an input/output (I/O) bus 210 connects the system components in the media system 200 together. The I/O bus 210 represents any of a number of different of mechanisms and techniques for routing signals between the media system components. For example, the I/O bus 210 may include an appropriate number of independent audio “patch” cables that route audio signals, coaxial cables that route video signals, two-wire serial lines or infrared or radio frequency transceivers that route control signals, optical fiber or any other routing mechanisms that route other types of signals.
In this exemplary embodiment, the media system 200 includes a television/monitor 212, a video cassette recorder (VCR) 214, digital video disk (DVD) recorder/playback device 216, audio/video tuner 218 and compact disk player 220 coupled to the I/O bus 210. The VCR 214, DVD 216 and compact disk player 220 may be single disk or single cassette devices, or alternatively may be multiple disk or multiple cassette devices. They may be independent units or integrated together. In addition, the media system 200 includes a microphone/speaker system 222, video camera 224 and a wireless I/O control device 226. According to exemplary embodiments of the present invention, the wireless I/O control device 226 is a media system remote control unit that supports free space pointing, has a minimal number of buttons to support navigation, and communicates with the entertainment system 200 through RF signals. For example, wireless I/O control device 226 can be a free-space pointing device which uses a gyroscope or other mechanism to define both a screen position and a motion vector to determine the particular command desired. A set of buttons can also be included on the wireless I/O device 226 to initiate the “click” primitive described below as well as a “back” button. In another exemplary embodiment, wireless I/O control device 226 is a media system remote control unit, which communicates with the components of the entertainment system 200 through IR signals. In yet another embodiment, wireless I/O control device 134 may be an IR remote control device similar in appearance to a typical entertainment system remote control with the added feature of a track-ball or other navigational mechanisms which allows a user to position a cursor on a display of the entertainment system 100.
The entertainment system 200 also includes a system controller 228. According to one exemplary embodiment of the present invention, the system controller 228 operates to store and display entertainment system data available from a plurality of entertainment system data sources and to control a wide variety of features associated with each of the system components. As shown in FIG. 3, system controller 228 is coupled, either directly or indirectly, to each of the system components, as necessary, through I/O bus 210. In one exemplary embodiment, in addition to or in place of I/O bus 210, system controller 228 is configured with a wireless communication transmitter (or transceiver), which is capable of communicating with the system components via IR signals or RF signals. Regardless of the control medium, the system controller 228 is configured to control the media components of the media system 200 via a graphical user interface described below.
As further illustrated in FIG. 3, media system 200 may be configured to receive media items from various media sources and service providers. In this exemplary embodiment, media system 200 receives media input from and, optionally, sends information to, any or all of the following sources: cable broadcast 230, satellite broadcast 232 (e.g., via a satellite dish), very high frequency (VHF) or ultra high frequency (UHF) radio frequency communication of the broadcast television networks 234 (e.g., via an aerial antenna), telephone network 236 and cable modem 238 (or another source of Internet content). Those skilled in the art will appreciate that the media components and media sources illustrated and described with respect to FIG. 3 are purely exemplary and that media system 200 may include more or fewer of both. For example, other types of inputs to the system include AM/FM radio and satellite radio.
FIG. 4 is a block diagram illustrating an embodiment of an exemplary system controller 228 according to the present invention. System controller 228 can, for example, be implemented as a set-top box and includes, for example, a processor 300, memory 302, a display controller 304, other device controllers (e.g., associated with the other components of system 200), one or more data storage devices 308 and an I/O interface 310. These components communicate with the processor 300 via bus 312. Those skilled in the art will appreciate that processor 300 can be implemented using one or more processing units. Memory device(s) 302 may include, for example, DRAM or SRAM, ROM, some of which may be designated as cache memory, which store software to be run by processor 300 and/or data usable by such programs, including software and/or data associated with the graphical user interfaces described below. Display controller 304 is operable by processor 300 to control the display of monitor 212 to, among other things, display GUI screens and objects as described below. Zoomable GUIs according to exemplary embodiments of the present invention provide resolution independent zooming, so that monitor 212 can provide displays at any resolution. Device controllers 306 provide an interface between the other components of the media system 200 and the processor 300. Data storage 308 may include one or more of a hard disk drive, a floppy disk drive, a CD-ROM device, or other mass storage device. Input/output interface 310 may include one or more of a plurality of interfaces including, for example, a keyboard interface, an RF interface, an IR interface and a microphone/speech interface. According to one exemplary embodiment of the present invention, I/O interface 310 will include an interface for receiving location information associated with movement of a wireless pointing device.
Generation and control of a graphical user interface according to exemplary embodiments of the present invention to display media item selection information is performed by the system controller 228 in response to the processor 300 executing sequences of instructions contained in the memory 302. Such instructions may be read into the memory 302 from other computer-readable mediums such as data storage device(s) 308 or from a computer connected externally to the media system 200. Execution of the sequences of instructions contained in the memory 302 causes the processor to generate graphical user interface objects and controls, among other things, on monitor 212. In alternative embodiments, hard-wire circuitry may be used in place of or in combination with software instructions to implement the present invention. As mentioned in the Background section, conventional interface frameworks associated with the television industry are severely limited in their ability to provide users with a simple and yet comprehensive selection experience. Accordingly, control frameworks described herein overcome these limitations and are, therefore, intended for use with televisions, albeit not exclusively. It is also anticipated that the revolutionary control frameworks, graphical user interfaces and/or various algorithms described herein will find applicability to interfaces which may be used with computers and other non-television devices. In order to distinguish these various applications of exemplary embodiments of the present invention, the terms “television” and “TV” are used in this specification to refer to a subset of display devices, whereas the terms “GUI”, “GUI screen”, “display” and “display screen” are intended to be generic and refer to television displays, computer displays and any other display device. More specifically, the terms “television” and “TV” are intended to refer to the subset of display devices which are able to display television signals (e.g., NTSC signals, PAL signals or SECAM signals) without using an adapter to translate television signals into another format (e.g., computer video formats). In addition, the terms “television” and “TV” refer to a subset of display devices that are generally viewed from a distance of several feet or more (e.g., sofa to a family room TV) whereas computer displays are generally viewed close-up (e.g., chair to a desktop monitor).
Having described an exemplary media system which can be used to implement control frameworks including zoomable graphical interfaces according to the present invention, several examples of such interfaces will now be described. According to exemplary embodiments of the present invention, a user interface displays selectable items which can be grouped by category. A user points a remote unit at the category or categories of interest and depresses the selection button to zoom in or the “back” button to zoom back. Each zoom in, or zoom back, action by a user results in a change in the magnification level and/or context of the selectable items rendered by the user interface on the screen. According to exemplary embodiments, each change in magnification level can be consistent, i.e., the changes in magnification level are provided in predetermined steps. Exemplary embodiments of the present invention also provide for user interfaces which incorporate several visual techniques to achieve scaling to the very large. These techniques involve a combination of building blocks and techniques that achieve both scalability and ease-of-use, in particular techniques which adapt the user interface to enhance a user's visual memory for rapid re-visiting of user interface objects.
The user interface is largely a visual experience. In such an environment exemplary embodiments of the present invention make use of the capability of the user to remember the location of objects within the visual environment. This is achieved by providing a stable, dependable location for user interface selection items. Each object has a location in the zoomable layout. Once the user has found an object of interest it is natural to remember which direction was taken to locate the object. If that object is of particular interest it is likely that the user will re-visit the item more than once, which will reinforce the user's memory of the path to the object. User interfaces according to exemplary embodiments of the present invention provide visual mnemonics that help the user remember the location of items of interest. Such visual mnemonics include pan and zoom animations, transition effects which generate a geographic sense of movement across the user interface's virtual surface and consistent zooming functionality, among other things which will become more apparent based on the examples described below.
Organizing mechanisms are provided to enable the user to select from extremely large sets of items while being shielded from the details associated with large selection sets. Various types of organizing mechanisms can be used in accordance with the present invention and examples are provided below.
Referring first to FIGS. 5-8, an exemplary control framework including a zoomable graphical user interface according to an exemplary embodiment of the present invention is described for use in displaying and selecting musical media items. FIG. 5 portrays the zoomable GUI at its most zoomed out state. Therein, the interface displays a set of shapes 500. Displayed within each shape 500 are text 502 and/or a picture 504 that describe the group of media item selections accessible via that portion of the GUI. As shown in FIG. 5, the shapes 500 are rectangles, and text 502 and/or picture 504 describe the genre of the media. However, those skilled in the art will appreciate that this first viewed GUI grouping could represent other aspects of the media selections available to the user e.g., artist, year produced, area of residence for the artist, length of the item, or any other characteristic of the selection. Also, the shapes used to outline the various groupings in the GUI need not be rectangles. Shrunk down versions of album covers and other icons could be used to provide further navigational hints to the user in lieu of or in addition to text 502 and/or picture 504 within the shape groupings 500. A background portion of the GUI 506 can be displayed as a solid color or be a part of a picture such as a map to aid the user in remembering the spatial location of genres so as to make future uses of the interface require less reading. The selection pointer (cursor) 508 follows the movements of an input device and indicates the location to zoom in on when the user presses the button on the device (not shown in FIG. 5).
According to one exemplary embodiment of the present invention, the input device can be a free space pointing device, e.g., the free space pointing device described in U.S. patent application Ser. No. 11/119,683, filed on May 2, 2005, entitled “Free Space Pointing Devices and Methods”, the disclosure of which is incorporated here by reference and which is hereafter referred to as the “'683 application”, coupled with a graphical user interface that supports the point, click, scroll, hover and zoom building blocks which are described in more detail below. One feature of this exemplary input device that is beneficial for use in conjunction with the present invention is that it can be implemented with only two buttons and a scroll wheel, i.e., three input actuation objects. One of the buttons can be configured as a ZOOM IN (select) button and one can be configured as a ZOOM OUT (back) button. Compared with the conventional remote control units, e.g., that shown in FIG. 1, the present invention simplifies this aspect of the GUI by greatly reducing the number of buttons, etc., that a user is confronted with in making his or her media item selection. An additional preferred, but not required, feature of input devices according to exemplary embodiments of the present invention is that they provide “free space pointing” capability for the user. The phrase “free space pointing” is used in this specification to refer to the ability of a user to freely move the input device in three (or more) dimensions in the air in front of the display screen and the corresponding ability of the user interface to translate those motions directly into movement of a cursor on the screen. Thus “free space pointing” differs from conventional computer mouse pointing techniques which use a surface other than the display screen, e.g., a desk surface or mousepad, as a proxy surface from which relative movement of the mouse is translated into cursor movement on the computer display screen. Use of free space pointing in control frameworks according to exemplary embodiments of the present invention further simplifies the user's selection experience, while at the same time providing an opportunity to introduce gestures as distinguishable inputs to the interface. A gesture can be considered as a recognizable pattern of movement over time which pattern can be translated into a GUI command, e.g., a function of movement in the x, y, z, yaw, pitch and roll dimensions or any subcombination thereof. Those skilled in the art will appreciate, however that any suitable input device can be used in conjunction with zoomable GUIs according to the present invention. Other examples of suitable input devices include, but are not limited to, trackballs, touchpads, conventional TV remote control devices, speech input, any devices which can communicate/translate a user's gestures into GUI commands, or any combination thereof. It is intended that each aspect of the GUI functionality described herein can be actuated in frameworks according to the present invention using at least one of a gesture and a speech command. Alternate implementations include using cursor and/or other remote control keys or even speech input to identify items for selection.
FIG. 6 shows a zoomed in view of Genre 3 that would be displayed if the user selects Genre 3 from FIG. 5, e.g., by moving the cursor 508 over the area encompassed by the rectangle surrounding Genre 3 on display 212 and depressing a button on the input device. The interface can animate the zoom from FIG. 5 to FIG. 6 so that it is clear to the user that a zoom occurred. An example of such an animated zoom/transition effect is described below. Once the shape 516 that contains Genre 3 occupies most of the screen on display 212, the interface reveals the artists that have albums in the genre. In this example, seven different artists and/or their works are displayed. The unselected genres 515 that were adjacent to Genre 3 in the zoomed out view of FIG. 5 are still adjacent to Genre 3 in the zoomed in view, but are clipped by the edge of the display 212. These unselected genres can be quickly navigated to by selection of them with selection pointer 508. It will be appreciated, however, that other exemplary embodiments of the present invention can omit clipping neighboring objects and, instead, present only the unclipped selections. Each of the artist groups, e.g., group 512, can contain images of shrunk album covers, a picture of the artist or customizable artwork by the user in the case that the category contains playlists created by the user.
A user may then select one of the artist groups for further review and/or selection. FIG. 7 shows a further zoomed in view in response to a user selection of Artist 3 via positioning of cursor 508 and actuation of the input device, in which images of album covers 520 come into view. As with the transition from the GUI screen of FIG. 5 and FIG. 6, the unselected, adjacent artists (artists #2, 6 and 7 in this example) are shown towards the side of the zoomed in display, and the user can click on these with selection pointer 508 to pan to these artist views. In this portion of the interface, in addition to the images 520 of album covers, artist information 524 can be displayed as an item in the artist group. This information may contain, for example, the artist's picture, biography, trivia, discography, influences, links to web sites and other pertinent data. Each of the album images 520 can contain a picture of the album cover and, optionally, textual data. In the case that the album image 520 includes a user created playlist, the graphical user interface can display a picture which is selected automatically by the interface or preselected by the user.
Finally, when the user selects an album cover image 520 from within the group 521, the interface zooms into the album cover as shown in FIG. 8. As the zoom progresses, the album cover can fade or morph into a view that contains items such as the artist and title of the album 530, a list of tracks 532, further information about the album 536, a smaller version of the album cover 528, and controls 534 to play back the content, modify the categorization, link to the artists web page, or find any other information about the selection. Neighboring albums 538 are shown that can be selected using selection pointer 508 to cause the interface to bring them into view. As mentioned above, alternative embodiments of the present invention can, for example, zoom in to only display the selected object, e.g., album 5, and omit the clipped portions of the unselected objects, e.g., albums 4 and 6. This final zoom provides an example of semantic zooming, wherein certain GUI elements are revealed that were not previously visible at the previous zoom level. Various techniques for performing semantic zooming according to exemplary embodiments of the present invention are provided below.
As illustrated in the FIGS. 5-8 and the description, this exemplary embodiment of a graphical user interface provides for navigation of a music collection. Interfaces according to the present invention can also be used for video collections such as for DVDs, VHS tapes, other recorded media, video-on-demand, video segments and home movies. Other audio uses include navigation of radio shows, instructional tapes, historical archives, and sound clip collections. Print or text media such as news stories and electronic books can also be organized and accessed using this invention.
As will be apparent to those skilled in the art from the foregoing description, zoomable graphical user interfaces according to the present invention provide users with the capability to browse a large (or small) number of media items rapidly and easily. This capability is attributable to many characteristics of interfaces according to exemplary embodiments of the present invention including, but not limited to: (1) the use of images as all or part of the selection information for a particular media item, (2) the use of zooming to rapidly provide as much or as little information as a user needs to make a selection and (3) the use of several GUI techniques which combine to give the user the sense that the entire interface resides on a single plane, such that navigation of the GUI can be accomplished, and remembered, by way of the user's sense of direction. This latter aspect of GUIs according to the present invention can be accomplished by, among other things, linking the various GUI screens together “geographically” by maintaining as much GUI object continuity from one GUI screen to the next, e.g., by displaying edges of neighboring, unselected objects around the border of the current GUI screen. Alternatively, if a cleaner view is desired, and other GUI techniques provide sufficient geographic feedback, then the clipped objects can be omitted. As used in this text, the phrase “GUI screen” refers to a set of GUI objects rendered on one or more display units at the same time. A GUI screen may be rendered on the same display which outputs media items, or it may be rendered on a different display. The display can be a TV display, computer monitor or any other suitable GUI output device.
Another GUI effect which enhances the user's sense of GUI screen connectivity is the panning animation effect which is invoked when a zoom is performed or when the user selects an adjacent object at the same zoom level as the currently selected object. Returning to the example of FIG. 5, as the user is initially viewing this GUI screen, his or her point-of-view is centered about point 550. However, when he or she selects Genre 3 for zooming in, his or her point-of-view will shift to point 552. According to exemplary embodiments of the present invention, the zoom in process is animated to convey the shifting the POV center from point 550 to 552. This panning animation can be provided for every GUI change, e.g., from a change in zoom level or a change from one object to another object on the same GUI zoom level. Thus if, for example, a user situated in the GUI screen of FIG. 6 selected the leftmost unselected genre 515 (Genre 2), a panning animation would occur which would give the user the visual impression of “moving” left or west. Exemplary embodiments of the present invention employ such techniques to provide a consistent sense of directional movement between GUI screens enables users to more rapidly navigate the GUI, both between zoom levels and between media items at the same zoom level.
Various data structures and algorithms can be used to implement zoomable GUIs according to the present invention. For example, data structures and algorithms for panning and zooming in an image browser which displays photographs have been described, for example, in the article entitled “Quantum Treemaps and Bubblemaps for a Zoomable Image Browser” by Benjamin B. Bederson, UIST 2001, ACM Symposium on User Interface Software and Technology, CHI Letters, 3(2), pp. 71-80, the disclosure of which is incorporated here by reference. However, in order to provide a GUI for media selection which can, at a high level, switch between numerous applications and, at a lower level, provide user controls associated with selected images to perform various media selection functions, additional data structures and algorithms are needed.
Zoomable GUIs can be conceptualized as supporting panning and zooming around a scene of user interface components in the view port of a display device. To accomplish this effect, zoomable GUIs according to exemplary embodiments of the present invention can be implemented using scene graph data structures. Each node in the scene graph represents some part of a user interface component, such as a button or a text label or a group of interface components. Children of a node represent graphical elements (lines, text, images, etc.) internal to that node. For example, an application can be represented in a scene graph as a node with children for the various graphical elements in its interface. Two special types of nodes are referred to herein as cameras and layers. Cameras are nodes that provide a view port into another part of the scene graph by looking at layer nodes. Under these layer nodes user interface elements can be found. Control logic for a zoomable interface programmatically adjusts a cameras view transform to provide the effect of panning and zooming.
FIG. 9 shows a scene graph that contains basic zoomable interface elements which can be used to implement exemplary embodiments of the present invention, specifically it contains one camera node 900 and one layer node 902. The dotted line between the camera node 900 and layer node 902 indicates that the camera node 900 has been configured to render the children of the layer node 902 in the camera's view port. The attached display device 904 lets the user see the camera's view port. The layer node 902 has three children nodes 904 that draw a circle and a pair of ovals. The scene graph further specifies that a rectangle is drawn within the circle and three triangles within the rectangle by way of nodes 912-918. The scene graph is tied into other scene graphs in the data structure by root node 920. Each node 906-918 has the capability of scaling and positioning itself relative to its parent by using a local coordinate transformation matrix. FIGS. 10(a) and 10(b) illustrate what the scene graph appears like when rendered through the camera at a first, zoomed out level of magnification and a second, zoomed in level of magnification, respectively.
Rendering the scene graph can be accomplished as follows. Whenever the display 904 needs to be updated, e.g., when the user triggers a zoom-in from the view of FIG. 10(a) to the view of FIG. 10(b), a repaint event calls the camera node 900 attached to the display 904 to render itself. This, in turn, causes the camera node 900 to notify the layer node 902 to render the area within the camera's view port. The layer node 902 renders itself by notifying its children to render themselves, and so on. The current transformation matrix and a bounding rectangle for the region to update is passed at each step and optionally modified to inform each node of the proper scale and offset that they should use for rendering. Since the scene graphs of applications operating within zoomable GUIs according to the present invention may contain thousands of nodes, each node can check the transformation matrix and the area to be updated to ensure that their drawing operations will indeed be seen by the user. Although the foregoing example, describes a scene graph including one camera node and one layer node, it will be appreciated that exemplary embodiments of the present invention can embed multiple cameras and layers. These embedded cameras can provide user interface elements such as small zoomed out maps that indicate the user's current view location in the whole zoomable interface, and also allow user interface components to be independently zoomable and pannable.
When using a zoomable interface to coordinate the operation of multiple applications, e.g., like the exemplary movie browser described below with respect to FIGS. 14-18, the memory and resource requirements for each application may exceed the total memory available in the media system. This suggests that applications unload some or all of their code and data when the user is no longer viewing them. However, in zoomable GUIs according to the present invention it can be desirable to provide the appearance that some or all of the applications appear active to the user at all times. To satisfy these two competing objectives, the applications which are “off-screen” from the user's point of view can be put into a temporarily suspended state. To achieve this behavior in zoomable GUIs according to exemplary embodiments of the present invention, events are sent to applications to indicate when they enter and exit a view port. One way to implement such events is to add logic to the code that renders a component so that it detects when the user enters a view port. However, this implies that the notification logic gets invoked at every rendering event and, more importantly, that it cannot easily detect when the user has navigated the view port away from the component. Another method for sending events to applications is to incorporate the notification logic into the GUI navigation elements (such as hyperlinks and buttons), so that they send notifications to the component when they change the view port of a camera to include the component of interest. However, this requires the programmer to vigilantly add notification code to all possible navigation UI elements.
According to one exemplary embodiment, a computationally efficient node watcher algorithm can be used to notify applications regarding when GUI components and/or applications enter and exit the view of a camera. At a high level, the node watcher algorithm has three main processing stages: (1) initialization, (2) view port change assessment and (3) scene graph change assessment. The initialization stage computes node quantities used by the view port change assessment stage and initializes appropriate data structures. The view port change assessment stage gets invoked when the view port changes and notifies all watched nodes that entered or exited the view port. Finally, the scene graph change assessment stage updates computations made at the initialization stage that have become invalid due to changes in the scene graph. For example, if an ancestor node of the watched node changes location in the scene graph, computations made at initialization may need to be recomputed.
Of these stages, view port change assessment drives the rest of the node watcher algorithm. To delineate when a node enters and exits a view port, the initialization step determines the bounding rectangle of the desired node and transforms it from its local coordinate system to the local coordinate system of the view port. In this way, checking node entrance does not require a sequence of coordinate transformations at each view port change. Since the parents of the node may have transform matrices, this initialization step requires traversing the scene graph from the node up to the camera. As described below, if embedded cameras are used in the scene graph data structure, then multiple bounding rectangles may be needed to accommodate the node appearing in multiple places.
Once the bounding rectangle for each watched node has been computed in the view port coordinate system, the initialization stage adds the bounding rectangle to the view port change assessment data structures. The node watcher algorithm uses a basic building block for each dimension in the scene. In zoomable interfaces according to some exemplary embodiments, this includes an x dimension, a y dimension, and a scale dimension. As described below, however, other exemplary implementations may have additional or different dimensions. The scale dimension describes the magnification level of the node in the view port and is described by the following equation:
Where s is the scale, d is the distance from one point of the node to another in the node's local coordinates and d′ is the distance from that point to the other in the view port.
FIG. 11 shows an exemplary building block for detecting scene entrance and exit in one dimension. The following describes handling in the x dimension, but those skilled in the art will appreciate that the other dimensions can be handled in a similar manner. The Region Block 1100 contains references to the transformed bounding rectangle coordinates. This includes the left and right (top and bottom or minimum and maximum scale) offsets of the rectangle. The left and right offsets are stored in Transition Blocks 1102 and 1104, respectively, that are themselves placed in an ordered doubly linked list, such that lower numbered offsets are towards the beginning. The current view port bounds are stored in the View Bounds block 1106. Block 1106 has pointers to the Transition Blocks just beyond the left and right side of the view, e.g., the Transition Block immediately to the right of the one pointed to by View Left Side is in the view unless that latter block is pointed to by View Right Side.
When the view port changes, the following processing occurs for each dimension. First, the View Left Side and View Right Side pointers are checked to see if they need to be moved to include or exclude a Transition Block. Next, if one or both of the pointers need to be moved, they are slid over the Transition Block list to their new locations. Then, for each Transition Block passed by the View Left Side and View Right Side pointers, the node watcher algorithm executes the Transition Block notification code described below. This notification code determines if it is possible that its respective node may have entered or exited the view port. If so, that node is added to a post processing list. Finally, at the end of this processing for each dimension, each node on the post processing list is checked that its view port status actually did change (as opposed to changing and then changing back). If a change did occur, then the algorithm sends an event to the component. Note that if the view port jumps quickly to a new area of the zoomable interface that the algorithm may detect more spurious entrance and exit events.
The Transition Block notification code can be implemented as a table lookup that determines whether the node moved into or out of the view port for the dimension being checked. An exemplary table is shown below.
TABLE 1
|
|
Transition Notification Table
NodeViewView MovePartial IntersectionFull Intersection
sidesideDirectionNotificationNotification
|
LeftLeftLeftNoneEnter
LeftLeftRightNoneExit
RightLeftLeftEnterNone
RightLeftRightExitNone
LeftRightLeftExitNone
LeftRightRightEnterNone
RightRightLeftNoneExit
RightRightRightNoneEnter
|
Columns 1, 2 and 3 are the inputs to the Transition Notification Table. Specifically, the node watcher algorithm addresses the table using a combination of the node side, view side and view move direction to determine whether the node being evaluated was entered, exited or not impacted. Column 1 refers to the side of the node represented by the Transition Block that was passed by the view port pointers. Column 2 refers to the side of the view port and column 3 refers to the direction that that side of the view port was moving when it passed the node's Transition Block. Either output column 4 or 5 is selected depending upon whether the node should be notified when it is partially or fully in view. For example, in some implementations it may be desirable to notify an application such as a streaming video window only after it is fully in view since loading a partially-in-view video window into the zoomable GUI may be visually disruptive.
When the output of the table indicates enter or exit, the node watcher algorithm adds the node to the post processing list. The output columns of Table 1 are populated based on the following rules. If the node intersects in all dimensions then an enter notification will be sent in the post processing step. If the node was in the view and now one or more dimensions have stopped intersecting, then an exit notification will be sent. To reduce the number of nodes in the post processing list, the Transition Block notification code checks for intersection with other dimensions before adding the node to the list. This eliminates the post processing step when only one or two dimensions out of the total number of dimensions, e.g., three or more, intersect. When a user interface object (e.g., an application) wants to be notified of its view port status in the GUI, it registers a function with the node watcher algorithm. When the application goes into or out of the view, the node watcher algorithm calls that application's registered function with a parameter that indicates the event which occurred. Alternatively, notification can be performed using message passing. In this case, each application has an event queue. The application tells the node watcher algorithm how to communicate with its event queue. For example, it could specify the queue's address. Then, when the node watcher detects a transition, it creates a data structure that contains the cause of the notification and places it in the application's queue.
In addition to using node watcher notifications for application memory management, this algorithm can also be used for other functions in zoomable GUIs according to the present invention. For example, the node watcher algorithm can be used to change application behavior based on the user's view focus, e.g., by switching the audio output focus to the currently viewed application. Another application for the node watcher algorithm is to load and unload higher resolution and composite images when the magnification level changes. This reduces the computational load on the graphics renderer by having it render fewer objects whose resolution more closely matches the display. In addition to having the node watcher algorithm watch a camera's view port, it is also useful to have it watch the navigation code that tells the view port where it will end up after an animation. This provides earlier notification of components that are going to come into view and also enables zoomable GUIs according to exemplary embodiments of the present invention to avoid sending notifications to nodes that are flown over due to panning animations.
To better understand operation of the node watcher algorithm, an example will now be described with reference to FIGS. 12(a), 12(b), 13(a) and 13(b). FIGS. 12(a) and 12(b) depict a portion of a zoomable GUI at two different magnification levels. At the lower magnification level of FIG. 12(a), three nodes are visible: a circle, a triangle and an ellipse. In FIG. 12(b), the view has been zoomed in so much that the ellipse and circle are only partially visible, and the triangle is entirely outside of the view. These nodes may, for example, represent applications or user interface components that depend on efficient event notification and, therefore, are tracked by the node watcher algorithm according to exemplary embodiments of the present invention. In this example, the bounding rectangles for each node are explicitly illustrated in FIGS. 12(a) and 12(b) although those skilled in the art will appreciate that the bounding rectangles would not typically be displayed on the GUI. Each side of each of the bounding rectangles has been labeled in FIGS. 12(a) and 12(b), and these labels will be used to show the correspondence between the bounding rectangle sides and the transition block data structure which were described above.
FIG. 13(a) shows exemplary node watcher data structures for the horizontal dimension for the zoomed out view of FIG. 12(a). Therein, each side of a node's bounding rectangle is represented using a transition block. The horizontal transition blocks are shown in FIG. 13(a) in the order that they appear on the GUI screen from left to right. For example, the left side of the circle, CLeft, comes first and then the left side of the triangle, TLeft, and so on until the right side of the ellipse, ERight. Both ends of the list are marked with empty sentinel transition blocks. Also shown in FIG. 13(a) are the region blocks for each node and their corresponding pointers to their bounding rectangle's horizontal transition blocks. At the bottom of FIG. 13(a) is the view bounds data structure that contains pointers to the transition blocks that are just outside of the current view. For the zoomed out view, all nodes are completely visible, and therefore all of their transition blocks are between the transition blocks pointed to by the view bounds data structure.
FIG. 13(b) shows the node watcher data structures for the zoomed in view of FIG. 12(b). Therein, it can be seen that the view bounds part of the data structure has changed so that it now points to the transition blocks for the right side of the triangle, TRight, and the right side of the ellipse, ERight, since these two bounding rectangle sides are just outside of the current (zoomed in) view.
Given these exemplary data structures and GUI scenes, the associated processing within the node watcher algorithm while the zoom transition occurs can be described as follows. Starting with the left side of the view, the node watcher algorithm moves the view left side pointer to the right until the transition block that is just outside of the view on the left side is reached. As shown in FIG. 13(b), the view left side pointer first passes the CLeft transition block. For this example, assume that the circle node represents an application or other user interface object associated with the zoomable GUI that requires a notification when it is not fully visible in the view. Given these inputs to the node watcher algorithm, Table 1 indicates that the circle node should receive an exit notification for the horizontal dimension. Of course, the node watcher algorithm will typically aggregate notifications from all dimensions before notifying the node to avoid sending redundant exit notifications. Next, the view left side pointer passes the left side of the triangle, TLeft. If the triangle node has requested notifications for when it completely leaves the view, then the node watcher algorithm indicates per Table 1 that no notification is necessary. However, when the view pointer passes TRight, Table 1 indicates that the triangle has exited the view entirely and should be notified. The view pointer stops here since the right side of the circle's bounding rectangle, CRight, is still visible in the view.
From the right side, the node watcher algorithm's processing is similar. The view right side pointer moves left to the ellipse's right side ERight. Depending on whether the ellipse has requested full or partial notifications, the node watcher algorithm will or will not send a notification to the ellipse pursuant to Table 1. The vertical dimension can be processed in a similar manner using similar data structures and the top and bottom boundary rectangle values. Those skilled in the arts will also appreciate that a plurality of boundary rectangles can be used to approximate non-rectangular nodes when more precise notification is required. Additionally, the present invention contemplates that movement through other dimensions can be tracked and processed by the node watcher algorithm, e.g., a third geometrical (depth or scale) dimension, as well as non-geometrical dimensions such as time, content rating (adult, PG-13, etc.) and content type (drama, comedy, etc). Depending on the number of dimensions in use, the algorithm, more accurately, detects intersections of boundary segments, rectangles, and n-dimensional hypercubes.
In addition to the node watcher algorithm described above, exemplary embodiments of the present invention provide resolution consistent semantic zooming algorithms which can be used in zoomable GUIs according to exemplary embodiments of the present invention. Semantic zooming refers to adding, removing or changing details of a component in a zoomable GUI depending on the magnification level of that component. For example, in the movie browser interface described below, when the user zooms close enough to the image of the movie, it changes to show movie metadata and playback controls. The calculation of the magnification level is based on the number of pixels that the component uses on the display device. The zoomable GUI can store a threshold magnification level which indicates when the switch should occur, e.g., from a view without the movie metadata and playback controls to a view with the movie metadata and playback controls.
Television and computer displays have widely varying display resolutions. Some monitors have such a high resolution that graphics and text that is readable on a low resolution display is so small to become completely unreadable. This also creates a problem for applications that use semantic zooming, especially on high resolution displays such as HDTVs. In this environment, semantic zooming code that renders based on the number of pixels displayed will change the image before the more detailed view is readable. Programmatically modifying the threshold at which semantic zooming changes component views can only work for one resolution.
The desirable result is that semantic zooming occurs consistently across all monitor resolutions. One solution is to use lower resolution display modes on high resolution monitors, so that the resolution is identical on all displays. However, the user of a high resolution monitor would prefer that graphics would be rendered at their best resolution if semantic zooming would still work as expected. Accordingly, exemplary embodiments of the present invention provide a semantic zooming technique which supports displays of all different solutions without the previously stated semantic viewing issues. This can be accomplished by, for example, creating a virtual display inside of the scene graph. This is shown in FIG. 14 by using an embedded virtual camera node 1200 and adding logic to compensate for the display resolution. The virtual camera node 1200 defines a view port whose size maps to the user's view distance and monitor size. For example, a large virtual camera view port indicates that a user is either sitting close enough to the monitor or has a large enough monitor to resolve many details. Alternately, a small view port indicates that the user is farther away from the monitor and requires larger fonts and image. The zoomable GUI code can base the semantic zooming transitions on the magnification level of components seen on this virtual camera and using the user's preferred viewing conditions.
The main camera node 1202 that is attached to the display device 1204 has its view port configured so that it displays everything that the virtual camera 1200 is showing. Since graphics images and text are not mapped to pixels until this main camera 1202, no loss of quality occurs from the virtual camera. The result of this is that high definition monitors display higher quality images and do not trigger semantic zooming changes that would make the display harder to read.
According to one exemplary embodiment of the present invention, the process works as follows. Each camera and node in the scene graph has an associated transform matrix (T1 to Tn). These matrices transform that node's local coordinate system to that of the next node towards the display. In the figure, T1 transforms coordinates from its view port to display coordinates. Likewise, T2 transforms its local coordinate system to the camera's view port. If the leaf node 1206 needs to render something on the display, it computes the following transform matrix:
A=T1T2 . . . Tn
This calculation can be performed while traversing the scene graph. Since the component changes to support semantic zooming are based on the virtual camera 1200, the following calculation is performed:
B=T4T5 . . . Tn
Typically, T1 to T3 can be determined ahead of time by querying the resolution of the monitor and inspecting the scene graph. Determining B from A is, therefore, accomplished by inverting these matrices and multiplying as follows:
B=(T1T2T3)−1A
For the case when calculating T1 to T3 ahead of time is problematic, e.g., if a graphics API hides additional transformations, logic can be added to the virtual camera to intercept the transformation matrix that it would have used to render to the display. This intercepted transformation is then inverted and multiplied as above to compute the semantic zooming threshold.
One strength of zoomable interfaces according to exemplary embodiments of the present invention is the ability to maintain context while navigating the interface. All of the interface components appear to exist in the zoomable world, and the user just needs to pan and zoom to reach any of them. The semantic zooming technique described above changes the appearance of a component depending on the zoom or magnification level. FIGS. 15(a) and 15(b) provide an example of semantic zooming for a component where the zoomed out version of the component (FIG. 15(a)) is a picture and the zoomed in version (FIG. 15(b)) includes the same picture as well as some controls and details. Some more detailed examples of this are provided below. One challenge associated with semantic zooming is that changes between views can occur abruptly, and transition techniques such as alpha blending do not provide visually pleasing results when transitioning between two such views.
Accordingly, exemplary embodiments of the present invention provide for some common image or text in all views of a component to provide a focal point for a transition effect when a semantic zoom is performed. For example, in FIGS. 15(a) and 15(b), the common element is the picture. The transition effect between the zoomed out version and the zoomed in version can be triggered using, for example, the above-described node watcher algorithm as follows. First, a registration with the node watcher can be performed to receive an event when the main camera's view port transitions from the magnification level of the zoomed out version of the component to the zoomed in version. Then, when the event occurs, an animation can be displayed which shows the common element(s) shrinking and translating from their location in the zoomed out version to their location in the zoomed in version. Meanwhile, the camera's view port continues to zoom into the component.
These capabilities of graphical user interfaces according to the present invention will become even more apparent upon review of another exemplary embodiment described below with respect to FIGS. 16-20. Therein, a startup GUI screen 1400 displays a plurality of organizing objects which operate as media group representations. The purely exemplary media group representations of home video, movies, TV, sports, radio, music and news could, of course include different, more or fewer media group representations. Upon actuation of one of these icons by a user, the GUI according to this exemplary embodiment will then display a plurality of images each grouped into a particular category or genre. For example, if the “movie” icon in FIG. 16 was actuated by a user, the GUI screen of FIG. 17 can then be displayed. Therein, a large number, e.g., 120 or more, selection objects are displayed. These selection objects can be categorized into particular group(s), e.g., action, classics, comedy, drama, family and new releases. Those skilled in the art will appreciate that more or fewer categories could be provided. In this exemplary embodiment, the media item images can be cover art associated with each movie selection. Although the size of the blocks in FIG. 17 is too small to permit detailed illustration of this relatively large group of selection item images, in implementation, the level of magnification of the images is such that the identity of the movie can be discerned by its associated image, even if some or all of the text may be too small to be easily read.
The cursor (not shown in FIG. 17) can then be disposed over a group of the movie images and the input device actuated to provide a selection indication for one of the groups. In this example the user selects the drama group and the graphical user interface then displays a zoomed version of the drama group of images as seen in FIG. 18. As with the previous embodiment, a transition effect can also be displayed as the GUI shifts from the GUI screen of FIG. 17 to the GUI screen of FIG. 18, e.g., the GUI may pan the view from the center of the GUI screen of FIG. 17 to the center of the drama group of images during or prior to the zoom. Note that although the zoomed version of the drama group of FIG. 18 only displays a subset of the total number of images in the drama group, that this zoomed version can alternatively contain all of the images in the selected group. The choice of whether or not to display all of the images in a selected group in any given zoomed in version of a GUI screen can be made based upon, for example, the number of media items in a group and a minimum desirable magnification level for a media item for a particular zoom level. This latter characteristic of GUIs according to the present invention can be predetermined by the system designer/service provider or can be user customizable via software settings in the GUI. For example, the number of media items in a group and the minimum and/or maximum magnification levels can be configurable by either or both of the service provider or the end user. Such features enable those users with, for example, poor eyesight, to increase the magnification level of media items being displayed. Conversely, users with especially keen eyesight may decrease the level of magnification, thereby increasing the number of media items displayed on a GUI screen at any one time and decrease browsing time.
One exemplary transition effect which can be employed in graphical user interfaces according to the present invention is referred to herein as the “shoe-to-detail” view effect. When actuated, this transition effect takes a zoomed out image and simultaneously shrinks and translates the zoomed out image into a smaller view, i.e., the next higher level of magnification. The transition from the magnification level used in the GUI screen of FIG. 17 to the greater magnification level used in the GUI screen of FIG. 18 results in additional details being revealed by the GUI for the images which are displayed in the zoomed in version of FIG. 18. The GUI selectively reveals or hides details at each zoom level based upon whether or not those details would display well at the currently selected zoom level. Unlike a camera zoom, which attempts to resolve details regardless of their visibility to the unaided eye, exemplary embodiments of the present invention provide for a configurable zoom level parameter that specifies a transition point between when to show the full image and when to show a version of the image with details that are withheld. The transition point can be based upon an internal resolution independent depiction of the image rather the resolution of TV/Monitor 212. In this way, GUIs according to the present invention are consistent regardless of the resolution of the display device being used in the media system.
In this exemplary embodiment, an additional amount of magnification for a particular image can be provided by passing the cursor over a particular image. This feature can be seen in FIG. 19, wherein the cursor has rolled over the image for the movie “Apollo 13”. Although not depicted in FIG. 19, such additional magnification could, for example, make more legible the quote “Houston, we have a problem” which appears on the cover art of the associated media item as compared to the corresponding image in the GUI screen of FIG. 18 which is at a lower level of magnification. User selection of this image, e.g., by depressing a button on the input device, can result in a further zoom to display the details shown in FIG. 20. This provides yet another example of semantic zooming as it was previously described since various information and control elements are present in the GUI screen of FIG. 20 that were not available in the GUI screen of FIG. 19. For example, information about the movie “Apollo 13” including, among other things, the movie's runtime, price and actor information is shown. Those skilled in the art will appreciate that other types of information could be provided here. Additionally, this GUI screen includes GUI control objects including, for example, button control objects for buying the movie, watching a trailer or returning to the previous GUI screen (which could also be accomplished by depressing the ZOOM OUT button on the input device). Hyperlinks can also be used to allow the user to jump to, for example, GUI screens associated with the related movies identified in the lower right hand corner of the GUI screen of FIG. 20 or information associated with the actors in this movie. In this example, some or all of the film titles under the heading “Filmography” can be implemented as hyperlinks which, when actuated by the user via the input device, will cause the GUI to display a GUI screen corresponding to that of FIG. 20 for the indicated movie.
A transition effect can also be employed when a user actuates a hyperlink. Since the hyperlinks may be generated at very high magnification levels, simply jumping to the linked media item may cause the user to lose track of where he or she is in the media item selection “map”. Accordingly, exemplary embodiments of the present invention provide a transition effect to aid in maintaining the user's sense of geographic position when a hyperlink is actuated. One exemplary transition effect which can be employed for this purpose is a hop transition. In an initial phase of the transition effect, the GUI zooms out and pans in the direction of the item pointed to by the hyperlink. Zooming out and panning continues until both the destination image and the origination image are viewable by the user. Using the example of FIG. 20 once again, if the user selects the hyperlink for “Saving Private Ryan”, then the first phase of the hyperlink hop effect would include zooming out and panning toward the image of “Saving Private Ryan” until both the image for “Saving Private Ryan” and “Apollo 13” were visible to the user. At this point, the transition effect has provided the user with the visual impression of being moved upwardly in an arc toward the destination image. Once the destination image is in view, the second phase of the transition effect gives the user the visual impression of zooming in and panning to, e.g., on the other half of the arc, the destination image. The hop time, i.e., the amount of time both phases one and two of this transition effect are displayed, can be fixed as between any two hyperlinked image items. Alternatively, the hop time may vary, e.g., based on the distance traveled over the GUI. For example, the hop time can be parameterized as HopTime=A log(zoomed-in scale level/hop apex scale level)+B(distance between hyperlinked media items)+C, where A, B and C are suitably selected constant values.
The node watcher algorithm described above with respect to FIGS. 9-13(b) can also be used to aid in the transition between the zoom level depicted in the exemplary GUI screen of FIG. 19 and the exemplary GUI screen of FIG. 20. The rendering of GUI screens containing text and/or control elements which are not visible in other zoom level versions of the selected image may be more computationally and/or memory intensive than the images at lower magnification levels. Accordingly, the node watcher algorithm can be used in exemplary embodiments of the present invention to aid in pre-loading of GUI screens such as that shown in FIG. 20 by watching the navigation code of the GUI to more rapidly identify the particular media item being zoomed in on.
Included in exemplary implementations of the present invention are screen-location and semantically-based navigation controls. These control regions appear when the user positions the cursor near or in a region associated with those controls on a screen where those controls are appropriate as shown in FIG. 21. For example, when playing a movie, the so-called trick functions of Fast Forward, Rewind, Pause, Stop and so on are semantically appropriate. In this exemplary embodiment, the screen region assigned to those functions is the lower right corner and when the cursor is positioned near or in that region, the set of icons for those trick functions appear. These icons then disappear when the function engaged is clearly completed or when the cursor is repositioned elsewhere on the screen. The same techniques can also be used to cover other navigational features like text search and home screen selection. In this exemplary implementation, these controls are semantically relevant on all screens and the region assigned to them is the upper right corner. When the cursor is positioned near or in that region, the set of icons for those navigational controls appear. These icons then disappear when the function is activated or the cursor is repositioned elsewhere on the screen. Note that for user training purposes, the relevant control icons may initially optionally appear briefly (e.g., 5 seconds) on some or all of the relevant screens in order to alert the inexperienced user to their presence.
Having provided some examples of zoomable graphical user interfaces according to the present invention, exemplary frameworks and infrastructures for using such interfaces will now be described. FIG. 22 provides a framework diagram wherein zoomable interfaces associated with various high level applications 1900, e.g., movies, television, music, radio and sports, are supported by primitives 1902 (referred to in the Figure as “atoms”). In this exemplary embodiment, primitives 1902 include POINT, CLICK, ZOOM, HOVER and SCROLL, although those skilled in the art will appreciate that other primitives may be included in this group as well, e.g., PAN and DRAG. As described above the POINT and CLICK primitives operate to determine cursor location and trigger an event when, for example, a user actuates the ZOOM IN or ZOOM OUT button on the handheld input device. These primitives simplify navigation and remove the need for repeated up-down-right-left button actions. As illustrated above, the ZOOM primitive provides an overview of possible selections and gives the user context when narrowing his or her choices. This concept enables the interface to scale to large numbers of media selections and arbitrary display sizes. The SCROLL primitive handles input from the scroll wheel input device on the exemplary handheld input device and can be used to, for example, accelerates linear menu navigation. The HOVER primitive dynamically enlarges the selections underneath the pointer (and/or changes the content of the selection) to enable the user to browse potential choices without committing. Each of the aforedescribed primitive operations can be actuated in GUIs according to the present invention in a number of different ways. For example, each of POINT, CLICK, HOVER, SCROLL and ZOOM can be associated with a different gesture which can be performed by a user. This gesture can be communicated to the system via the input device, whether it be a free space pointer, trackball, touchpad, etc. and translated into an actuation of the appropriate primitive. Likewise, each of the primitives can be associated with a respective voice command.
Between the lower level primitives 1902 and the upper level applications 1900 reside various software and hardware infrastructures 1904 which are involved in generating the images associated with zoomable GUIs according to the present invention. As seen in FIG. 22, such infrastructures 1904 can include a handheld input device/pointer, application program interfaces (APIs), zoomable GUI screens, developers' tools, etc.
The foregoing exemplary embodiments are purely illustrative in nature. The number of zoom levels, as well as the particular information and controls provided to the user at each level may be varied. Those skilled in the art will appreciate that the present invention provides revolutionary techniques for presenting large and small sets of media items using a zoomable interface such that a user can easily search through, browse, organize and play back media items such as movies and music. Graphical user interfaces according to the present invention organize media item selections on a virtual surface such that similar selections are grouped together. Initially, the interface presents a zoomed out view of the surface, and in most cases, the actual selections will not be visible at this level, but rather only their group names. As the user zooms progressively inward, more details are revealed concerning the media item groups or selections. At each zoom level, different controls are available so that the user can play groups of selections, individual selections, or go to another part of the virtual surface to browse other related media items. Zooming graphical user interfaces according to exemplary embodiments of the present invention can contain categories of images nested to an arbitrary depth as well as categories of categories. The media items can include content which is stored locally, broadcast by a broadcast provider, received via a direct connection from a content provider or on a peering basis. The media items can be provided in a scheduling format wherein date/time information is provided at some level of the GUI. Additionally, frameworks and GUIs according to exemplary embodiments of the present invention can also be applied to television commerce wherein the items for selection are being sold to the user.
Distributed Software Construction
There are a number of different ways to develop software usable to generate the GUI screens described above, as well as the other user interface features associated with such systems. Exemplary embodiments of the present invention provide an environment for rendering rich zoomable user interfaces (ZUIs) with reduced complexity associated with implementing and maintaining the ZUI. The terms “scene” and “brick” are used below in discussing ZUI construction according to exemplary embodiments of the present invention. A scene describes the collective set of ZUI components available to the user between navigation changes. A brick describes packaged ZUI components, e.g., software packages as simple as those used to display (and handle the functionality associated with) a button or an image or more complex, such as software packages used to generate a scene or set of scenes.
FIG. 23 illustrates an exemplary dataflow from the design of a scene or a brick to the rendering or compilation of that scene. Therein, the UI Design tool 2000 provides a visual programming environment for constructing bricks or scenes, an example of which is provided below. Typically, an artist or application developer uses the UI Design tool 2000 and saves either bricks 2002 or scenes 2004. The bricks 2002 and scenes 2004 may reference commonly used UI components stored in a brick library 2006 or multimedia resources 2008 such as bitmapped artwork, e.g., the movie covers described above as selectable media items displayed within a user interface. Within this exemplary framework, the scene loader 2010 (or toolkit back end) reads a scene file or byte stream and dynamically links in any referenced bricks 2002 or multimedia resources 2008. This results in the construction of a scene graph for either the ZSD compiler 2012 or local scene renderer 2014 to process in generating the user interface on, for example, a TV screen.
According to exemplary embodiments of the present invention, bricks and scenes can be generated using a programming language known as Scalable Vector Graphics (SVG). SVG is a language which is designed for use in describing two-dimensional graphics in Extensible Markup Language (XML). SVG is specified in the “Scalable Vector Graphics (SVG) 1.1 Specification”, promulgated by the W3C Recommendation 14 Jan. 2003, which can be found at http://www.w3.org/TR/2003/REC-SVG11-20030114/, the disclosure of which is incorporated here by reference. Among other things, SVG provides for three types of graphic objects: vector graphic shapes (e.g., paths consisting of straight lines and curves), images and text. Graphical objects can be grouped, styled, transformed and composed into previously rendered objects. The feature set includes nested transformations, clipping paths, alpha masks, filter effects and template objects. Many of the features available in SVG can be used to generate bricks and scenes for creating user interfaces, such as those described above. However, extensions to the SVG language have been developed according to exemplary embodiments of the present invention in order to provide some ZUI functionality, including the bricks constructs.
More specifically, the scene and brick descriptions support scripting using the ECMAScript language (the standardized version of JavaScript). Scripting adds, among other capabilities, scene-to-scene navigation, animation, database queries and media control to scene and brick functionality. One component of scripting support is the applications programming interface (API) used to achieve this functionality. This API is referred to herein as the ZUI Object Model (ZOM) and aspects of the ZOM are described below. One aspect of implementing a ZOM according to exemplary embodiments of the present invention involves extending the SVG programming language to include extensions to both the elements and attributes provided for in the SVG language, some examples of which are provided below for functionality associated with bricks and scenes. Therein, element names or attributes names denoted in the form “zui:name” identify element or attribute extensions to SVG.
The zui:brick tag inserts another ZML/SVG file into the scene at the specified location. A new variable context is created for the brick and the user is permitted to pass variables into the scene using child zui: variable tags. This feature of modified SVG according to an exemplary embodiment of the present invention provides a flexible programming element for use in zoomable interfaces characterized by its parameterized graphic nature which is reusable (cascades) across multiple scenes in the zoomable user interface. A detailed example of a brick implementation is provided below with reference to FIGS. 24-26 and corresponding software code.
TABLE 2
|
|
<zui:brick> Tag Attributes
Possible
AttributeValuesDescription
|
idAlpha-numericDefault assigned by toolkit; User-defined
stringvalues can make JavaScript clearer
widthIntegerThe width of the node in pixels.
heightIntegerThe height of the node in pixels.
transformStringUse “scripts/Transform.js” to read and
representation ofwrite transform attribute. The value stored
the transformby the node is the “matrix” attribute in
matrixTransform.js; the “array” is the in-memory
array that you'll want to manipulate. For
example, this attribute can define how the
brick is placed relative to the parent
(offset, scale, rotation).
pointer-none | zui:allUse none to disable mouse events; Use
eventszui:all to enable; Hidden/invisible
elements will receive pointer events
unless this attribute is set to none
visibilityhidden | visibleUse hidden to hide an element; Use visible
to make it visible; Visibility has no
effect on user input
xlink:hrefURLThe URL of the ZML/SVG file to load as
a brick
|
This extension to SVG is used to specify that the system should place a scene as a child of the current scene.
TABLE 3
|
|
<zui:scene> Tag Attributes
Possible
AttributeValuesDescription
|
idAlpha-numericDefault assigned by toolkit; User-defined
stringvalues can make JavaScript clearer
xIntegerHorizontal position of the upper left
corner of the scene placement bounds
relative to the upper left corner of
the screen.
yIntegerVertical position of the upper left corner
of the scene placement bounds relative
to the upper left corner of the screen.
widthIntegerThe width of the placement in pixels.
heightIntegerThe height of the placement in pixels.
xlink:hrefURLThe URL of the scene to load.
|
This extension to SVG sets up scene swap transition effects for scene transitions.
TABLE 4
|
|
<zui:scene-swap> Tag Attributes
Possible
AttributeValuesDescription
|
idAlpha-Default assigned by toolkit; User-defined
numericvalues can make JavaScript clearer
string
coverAlpha-The id of the cover to use for the scene
numericswap effect. This should be an id of an
stringelement which is in the same svg.
startIntegerThe start time when the swap should start
in terms of the percent of the transition
duration. 0 is an instant swap, and 100
is a swap at the very end of the transition.
endIntegerThe end time when the swap should finish
in terms of the percent of the transition
duration. Should always be greater than or
equal to start.
inheritsAlpha-The element id that this camera transition
numericshould inherit behavior from. If null, use
stringdefault values, if set, use the values set
in the other element as the defaults for
this transition.
|
This extension to SVG sets the specified variable in the current scope to the specified value. Variable scopes are automatically created by zui:scene and zui:brick tags.
TABLE 5
|
|
<zui:variable> Tag Attributes
Possible
AttributeValuesDescription
|
idAlpha-numericDefault assigned by toolkit; User-defined
stringvalues can make JavaScript clearer
nameStringThe name of the variable to set.
valueVariableThe value of the variable to set.
|
The use of the afore-described extensions to SVG to provide programming constructs which are particularly useful in generating zoomable user interfaces, e.g., for televisions, will be better understood by considering a purely illustrative example provided below with respect to FIGS. 24-26. FIG. 24 depicts a first zoomable display level of an exemplary user interface associated with music selections. Therein, a GUI screen displays six groups (music shelves) each of which contains 25 selectable music items grouped by category (5×5 music cover art images). Each group is implemented as a brick which includes a title hover effect, e.g., as shown in FIG. 24 the user's cursor (not shown) is positioned over the group entitled “Rock & Pop” such that the title of that group and the elements of that group are slightly magnified relative to the other five groups shown on this GUI screen. To generate this GUI screen, the software code associated with this brick is passed a variable named “music” which is a query to the user's music collection with the genre of Rock sorted by title, as illustrated by the highlighted portion of the exemplary software code below.
|
|
<?xml version=“1.0” encoding=“UTF-8” standalone=“no” ?>
<!DOCTYPE svg PUBLIC “-//W3C//DTD SVG 1.1//EN”
“http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd”>
<svg height=“720” id=“svg” onload=“music_shelf_system_onload(evt)” width=“1280”
xmlns=“http://www.w3.org/2000/svg” xmlns:xlink=“http://www.w3.org/1999/xlink”
xmlns:zi=“http://ns.hcrest.com/ZUIIllustratorExtensions/1.0”
xmlns:zui=“http://ns.hcrest.com/ZUIExtensions/1.0” zui:top=“true”>
<script language=“javascript” xlink:href=“./music_shelf.js”/>
<g id=“bkgd”>
<image height=“720” id=“musicbkgd” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(1.000, 0.000, 0.000, 1.000, 1, 0)” width=“1280”
xlink:href=“../background/hdtv/music_hdtv.png” zui:layer=“background”/>
<text fill=“#ffffff” font-family=“HelveticaNeue LT 87 Heavy Condensed” font-
size=“38” id=“glob_121” transform=“matrix(0.984, 0.000, 0.000, 1.000, 16, 0)” x=“1020”
y=“103”>
<![CDATA[All Music]]>
</text>
<zui:brick height=“306” id=“svg_123” transform=“matrix(0.660, 0.000, 0.000, 0.669,
245, 129)” width=“262” xlink:href=“./brick_shelf.svg” zi:cursorControl=“true”>
<zui:variable id=“var_0” name=“music” value=“com.hcrest.music.mds:albums(genres
contains ‘Rock & Pop’, @sort=‘title’)”/>
</zui:brick>
<zui:brick height=“306” id=“glob_124” transform=“matrix(0.660, 0.000, 0.000,
0.669, 522, 129)” width=“262” xlink:href=“./brick_shelf.svg”>
<zui:variable id=“var_26” name=“music” value=“com.hcrest.music.mds:albums(genres
contains ‘Jazz Vocal’, @sort=‘title’)”/>
</zui:brick>
<zui:brick height=“306” id=“glob_170” transform=“matrix(0.660, 0.000, 0.000,
0.669, 245, 391)” width=“262” xlink:href=“./brick_shelf.svg”>
<zui:variable id=“var_78” name=“music” value=“com.hcrest.music.mds:albums(genres
contains ‘International’, @sort=‘title’)”/>
</zui:brick>
<zui:brick height=“306” id=“glob_169” transform=“matrix(0.660, 0.000, 0.000,
0.669, 522, 391)” width=“262” xlink:href=“./brick_shelf.svg”>
<zui:variable id=“var_104” name=“music”
value=“com.hcrest.music.mds:albums(genres contains ‘Blues’, @sort=‘title’)”/>
</zui:brick>
<zui:brick height=“306” id=“glob_168” transform=“matrix(0.660, 0.000, 0.000,
0.669, 799, 391)” width=“262” xlink:href=“./brick_shelf.svg”>
<zui:variable id=“var_130” name=“music”
value=“com.hcrest.music.mds:albums(genres contains ‘Country’, @sort=‘title’)”/>
</zui:brick>
<zui:brick height=“365” id=“svg_0” transform=“matrix(0.660, 0.000, 0.000, 0.660,
799, 127)” width=“350” xlink:href=“./brick_shelf_soundtrackv2.svg”>
<zui:variable id=“var_51” name=“music” value=“com.hcrest.music.mds:albums(genres
contains ‘Soundtracks’, @sort=‘title’)”/>
</zui:brick>
</g>
<g id=“Layer_3”>
<zui:brick height=“720” id=“playlistBrick” transform=“matrix(1.000, 0.000, 0.000,
1.000, 0, −56)” width=“1280” xlink:href=“../playlistBrick/playlist_brick.svg”
zui:layer=“playlistOverlay”>
<zui:variable id=“var_156” name=“playlistGroup” value=“‘music’”/>
<zui:variable id=“var_157” name=“playlistType” value=“‘music’”/>
<zui:variable id=“var_158” name=“cover_art_field” value=“‘album.image.uri’”/>
<zui:variable id=“var_159” name=“title_field” value=“‘title’”/>
<zui:variable id=“var_160” name=“watch_uri_field” value=“‘uri’”/>
</zui:brick>
<g id=“screw_you_button_6_state_andjoe”>
<g id=“new_slideshow”>
<image height=“67” id=“new_slideshow_on” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(0.342, 0.000, 0.000, 1.221, 1071, 376)” width=“257”
xlink:href=“../playlistBrick/images/create_playlist_normal_over.png”/>
<image height=“65” id=“new_slideshow_off” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(0.342, 0.000, 0.000, 1.221, 1071, 377)” width=“257”
xlink:href=“../playlistBrick/images/create_playlist_normal.png”/>
</g>
</g>
<g id=“createplaylist” zi:p6Base=“createplaylist-off” zi:p6Down=“createplaylist-
down” zi:p6Label=“true” zi:p6Over=“createplaylist-over” zi:p6Sel=“createplaylist-sel”
zi:p6SelDown=“createplaylist-sel_down” zi:p6SelOver=“createplaylist-sel_over”>
<image height=“226” id=“createplaylist-sel_down” preserveAspectRatio=“xMidYMid
meet” transform=“matrix(0.734, 0.000, 0.000, 0.734, 1081, 463)” visibility=“hidden”
width=“124” xlink:href=“./images/createplaylist-over.png”/>
<image height=“226” id=“createplaylist-sel_over” preserveAspectRatio=“xMidYMid
meet” transform=“matrix(0.734, 0.000, 0.000, 0.734, 1081, 463)” visibility=“hidden”
width=“124” xlink:href=“./images/createplaylist-over.png”/>
<image height=“226” id=“createplaylist-sel” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(0.734, 0.000, 0.000, 0.734, 1081, 463)” visibility=“hidden”
width=“124” xlink:href=“./images/createplaylist-off.png”/>
<image height=“226” id=“createplaylist-down” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(0.734, 0.000, 0.000, 0.734, 1081, 463)” visibility=“hidden”
width=“124” xlink:href=“./images/createplaylist-over.png”/>
<image height=“226” id=“createplaylist-over” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(0.734, 0.000, 0.000, 0.734, 1081, 463)” visibility=“hidden”
width=“124” xlink:href=“./images/createplaylist-over.png”/>
<image height=“226” id=“createplaylist-off” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(0.734, 0.000, 0.000, 0.734, 1081, 463)” width=“124”
xlink:href=“./images/createplaylist-off.png”/>
</g>
</g>
</svg>
|
Each element (cover art image) within each group is also coded as a brick according to exemplary embodiments of the present invention. Thus, as shown in FIG. 25, when a user pauses a cursor over one of the 25 elements within the “Rock&Pop” group, this causes that element (in this example an image of an album cover “Parachutes”) to be magnified. Exemplary brick code for implementing this GUI screen is provided below.
|
|
<?xml version=“1.0” encoding=“UTF-8” standalone=“no” ?>
<!DOCTYPE svg PUBLIC “-//W3C//DTD SVG 1.1//EN” “http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd”>
<svg height=“365” onload=“brick_shelf_system_onload(evt)” width=“350” xmlns=“http://www.w3.org/2000/svg”
xmlns:xlink=“http://www.w3.org/1999/xlink” xmlns:zi=“http://ns.hcrest.com/ZUIIllustratorExtensions/1.0”
xmlns:zui=“http://ns.hcrest.com/ZUIExtensions/1.0”>
<script language=“javascript” xlink:href=“./brick_shelf.js”/>
<g id=“Layer_1”>
<zui:brick height=“46” id=“svg24” transform=“matrix(1.305, 0.000, 0.000, 1.239, 277, 290)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_0” name=“this” value=“music[24]”/>
</zui:brick>
<zui:brick height=“46” id=“svg23” transform=“matrix(1.305, 0.000, 0.000, 1.239, 210, 290)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_1” name=“this” value=“music[23]”/>
</zui:brick>
<zui:brick height=“46” id=“svg22” transform=“matrix(1.305, 0.000, 0.000, 1.239, 144, 290)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_2” name=“this” value=“music[22]”/>
</zui:brick>
<zui:brick height=“46” id=“svg21” transform=“matrix(1.305, 0.000, 0.000, 1.239, 77, 290)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_3” name=“this” value=“music[21]”/>
</zui:brick>
<zui:brick height=“46” id=“svg20” transform=“matrix(1.305, 0.000, 0.000, 1.239, 11, 290)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_4” name=“this” value=“music[20]”/>
</zui:brick>
<zui:brick height=“46” id=“svg19” transform=“matrix(1.305, 0.000, 0.000, 1.239, 278, 228)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_5” name=“this” value=“music[19]”/>
</zui:brick>
<zui:brick height=“46” id=“svg18” transform=“matrix(1.305, 0.000, 0.000, 1.239, 210, 228)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_6” name=“this” value=“music[18]”/>
</zui:brick>
<zui:brick height=“46.” id=“svg17” transform=“matrix(1.305, 0.000, 0.000, 1.239, 144, 228)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_7” name=“this” value=“music[17]”/>
</zui:brick>
<zui:brick height=“46” id=“svg16” transform=“matrix(1.305, 0.000, 0.000, 1.239, 77, 228)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_8” name=“this” value=“music[16]”/>
</zui:brick>
<zui:brick height=“46” id=“svg15” transform=“matrix(1.305, 0.000, 0.000, 1.239, 11, 228)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_9” name=“this” value=“music[15]”/>
</zui:brick>
<zui:brick height=“46” id=“svg14” transform=“matrix(1.305, 0.000, 0.000, 1.239, 278, 165)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_10” name=“this” value=“music[14]”/>
</zui:brick>
<zui:brick height=“46” id=“svg13” transform=“matrix(1.305, 0.000, 0.000, 1.239, 210, 165)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_11” name=“this” value=“music[13]”/>
</zui:brick>
<zui:brick height=“46” id=“svg12” transform=“matrix(1.305, 0.000, 0.000, 1.239, 144, 165)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_12” name=“this” value=“music[12]”/>
</zui:brick>
<zui:brick height=“46” id=“svg11” transform=“matrix(1.305, 0.000, 0.000, 1.239, 77, 165)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_13” name=“this” value=“music[11]”/>
</zui:brick>
<zui:brick height=“46” id=“svg10” transform=“matrix(1.305, 0.000, 0.000, 1.239, 11, 165)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_14” name=“this” value=“music[10]”/>
</zui:brick>
<zui:brick height=“46” id=“svg9” transform=“matrix(1.305, 0.000, 0.000, 1.239, 278, 101)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_15” name=“this” value=“music[9]”/>
</zui:brick>
<zui:brick height=“46” id=“svg8” transform=“matrix(1.305, 0.000, 0.000, 1.239, 210, 101)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_16” name=“this” value=“music[8]”/>
</zui:brick>
<zui:brick height=“46” id=“svg7” transform=“matrix(1.305, 0.000, 0.000, 1.239, 144, 101)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_17” name=“this” value=“music[7]”/>
</zui:brick>
<zui:brick height=“46” id=“svg6” transform=“matrix(1.305, 0.000, 0.000, 1.239, 77, 101)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_18” name=“this” value=“music[6]”/>
</zui:brick>
<zui:brick height=“46” id=“svg5” transform=“matrix(1.305, 0.000, 0.000, 1.239, 11, 101)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_19” name=“this” value=“music[5]”/>
</zui:brick>
<zui:brick height=“46” id=“svg4” transform=“matrix(1.305, 0.000, 0.000, 1.239, 278, 36)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_20” name=“this” value=“music[4]”/>
</zui:brick>
<zui:brick height=“46” id=“svg3” transform=“matrix(1.305, 0.000, 0.000, 1.239, 210, 36)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_21” name=“this” value=“music[3]”/>
</zui:brick>
<zui:brick height=“46” id=“svg2” transform=“matrix(1.305, 0.000, 0.000, 1.239, 144, 36)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_22” name=“this” value=“music[2]”/>
</zui:brick>
<zui:brick height=“46” id=“svg1” transform=“matrix(1.305, 0.000, 0.000, 1.239, 77, 36)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_23” name=“this” value=“music[1]”/>
</zui:brick>
<zui:brick height=“46” id=“svg0” transform=“matrix(1.305, 0.000, 0.000, 1.239, 11, 36)” width=“47”
xlink:href=“./albumCoverEffect.svg”>
<zui:variable id=“var_24” name=“this” value=“music[0]”/>
</zui:brick>
<g id=“more” visibility=“hidden” zi:p6Base=“more-off” zi:p6Down=“more-down” zi:p6Label=“true” zi:p6Over=“more-over”
zi:p6Sel=“more-sel” zi:p6SelDown=“more-sel_down” zi:p6SelOver=“more-sel_over”>
<image height=“84” id=“more-sel_down” preserveAspectRatio=“xMidYMid meet” transform=“matrix(0.274, 0.000, 0.000,
0.274, 281, 9)” visibility=“hidden” width=“213” xlink:href=“../movielink/images/homescreen/more-over.png”/>
<image height=“84” id=“more-sel_over” preserveAspectRatio=“xMidYMid meet” transform=“matrix(0.274, 0.000, 0.000,
0.274, 281, 9)” visibility=“hidden” width=“213” xlink:href=“../movielink/images/homescreen/more-over.png”/>
<image height=“84” id=“more-sel” preserveAspectRatio=“xMidYMid meet” transform=“matrix(0.274, 0.000, 0.000, 0.274,
281, 9)” visibility=“hidden” width=“213” xlink:href=“../movielink/images/homescreen/more-off.png”/>
<image height=“84” id=“more-down” preserveAspectRatio=“xMidYMid meet” transform=“matrix(0.274, 0.000, 0.000,
0.274, 281, 9)” visibility=“hidden” width=“213” xlink:href=“../movielink/images/homescreen/more-over.png”/>
<image height=“84” id=“more-over” preserveAspectRatio=“xMidYMid meet” transform=“matrix(0.274, 0.000, 0.000, 0.274,
281, 9)” visibility=“hidden” width=“213” xlink:href=“../movielink/images/homescreen/more-over.png”/>
<image height=“84” id=“more-off” preserveAspectRatio=“xMidYMid meet” transform=“matrix(0.274, 0.000, 0.000, 0.274,
281, 9)” width=“213” xlink:href=“../movielink/images/homescreen/more-off.png”/>
</g>
<zui:text-rect fill=“#ffffff” font-family=“HelveticaNeue LT 67 Medium Condensed” font-size=“24” height=“23” id=“genre”
pointer-events=“none” width=“235” x=“10” y=“9” zui:metadata=“music[0].genres[0]” zui:text-allcaps=“original” zui:text-
justification=“left”>
<![CDATA[Genre]]>
</zui:text-rect>
<view id=“top” viewBox=“(−71, −30, 493, 302)” zui:transition=“hcrest_view”/>
<a id=”top_bounds” xlink:href=”#top”>
<rect height=“302” id=“top_rect_1” width=“493” x=“−71” y=“−30”/>
</a>
<view id=“bottom” viewBox=“(−71, 97, 493, 302)” zui:transition=“hcrest_view”/>
<a id=“bottom_bounds” xlink:href=“#bottom”>
<rect height=“302” id=“bottom_rect_1” width=“493” x=“−71” y=“97”/>
</a>
<rect height=“188” id=“autopan_up” stroke=“#ff0000” visibility=“hidden” width=“399” x=“−24” y=“−23”/>
<rect height=“167” id=“autopan_down” stroke=“#00ff00” visibility=“hidden” width=“399” x=“−24” y=“222”/>
</g>
<zui:scene height=“48” id=“trans_xx_25” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_26” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_27” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_28” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_29” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_30” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_31” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_32” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_33” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_34” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_35” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_36” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_37” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_38” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
<zui:scene height=“48” id=“trans_xx_39” width=“47” x=“8” xlink:href=“music_detail.svg” y=“37”/>
</svg>
|
Note that the bolded code in the example above refers to the 25th element of the variable music which was set up in the parent SVG brick (music_shelf.svg). The prior music query returns up to 25 elements. Then the music element (in this example an album) is passed into the child brick called albumCoverEffect.svg using a variable named “this”. The two code snippets above, and corresponding GUI screens (scenes) of FIGS. 24 and 25, serve to illustrate two beneficial characteristics associated with the reusable extensions to SVG according to exemplary embodiments of the present invention, described herein for use in generating zoomable graphical user interfaces. First, SVG bricks provide a programming construct which provides code that is reusable from GUI screen to GUI screen (scene to scene). In this context, the brick code used to generate the GUI screen of FIG. 24 is reused to generate the GUI screen of FIG. 25. Additionally, the bricks are parameterized in the sense that at least some of the graphical display content which they generate is drawn from metadata, which may change over time. This means that the same program code can be used to generate user interfaces to select, e.g., on demand movies, as those movies change over time and that the content of the user interface portrayed on any given zoom level of an interface according to the present invention may also change accordingly over time.
The brick code itself can be generated using, for example, a visual programming interface, an example of which is illustrated in FIG. 26, wherein a music element 2600 (album cover image brick) is being coded. Some exemplary code associated with this toolkit function is provided below.
|
|
<?xml version=“1.0” encoding=“UTF-8” standalone=“no” ?>
<!DOCTYPE svg PUBLIC “-//W3C//DTD SVG 1.1//EN”
“http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd”>
<svg height=“46” onload=“albumCoverEffect_system_onload(evt)” width=“47”
xmlns=“http://www.w3.org/2000/svg” xmlns:xlink=“http://www.w3.org/1999/xlink”
xmlns:zi=“http://ns.hcrest.com/ZUIIllustratorExtensions/1.0”
xmlns:zui=“http://ns.hcrest.com/ZUIExtensions/1.0”>
<script language=“javascript” xlink:href=“./albumCoverEffect.js”/>
<g id=“layer”>
<a id=“anchor_0” xlink:href=“zuichild:trans_0”>
<g id=“cover”>
<image height=“150.00” id=“image” preserveAspectRatio=“xMidYMid meet”
transform=“matrix(0.313, 0.000, 0.000, 0.307, 0.000, −0.050)” width=“150.00”
xlink:href=“../placeholders/cdcover.png” zui:metadata=“this.image.uri”/>
<g id=“title”>
<rect fill=“#000000” height=“15” id=“rect_0” width=“47” x=“0” y=“31”/>
<zui:text-rect fill=“#ffffff” font-family=“HelveticaNeue LT 67 Medium
Condensed” font-size=“6” height=“14” id=“textrect_0” width=“45” x=“1” y=“32”
zui:metadata=“this.title” zui:text-allcaps=“original” zui:text-justification=“left”>
<![CDATA[album title
line two]]>
</zui:text-rect>
</g>
</g>
</a>
</g>
<zui:scene height=“46” id=“trans_0” transition=“trans_0_transition” width=“47” x=“0”
xlink:href=“music_detail.svg” y=“0”>
<zui:variable name=“this” value=“this” usage=“musicDetail” />
</zui:scene>
<zui:transition id=“trans_0_transition” inherits=“hcrest_placement_swap_effect”>
<zui:scene-swap cover=“cover”/>
</zui:transition>
</svg>
Also see albumCoverAffect.js This file is a companion file to the SVG. The javascript is what actually creates the title hover
effect.
document.include(“../scripts/Hoverzoom.js”);
document.include(“../scripts/Cursor.js”);
function albumCoverEffect_user_onload_pre(evt) {
createCursorController(document.getElementById(“cover”));
createHoverzoomTitleEffect(document.getElementById(“cover”),
0.400000,
250.000000,
document.getElementById(“title”));
}
// @Toolkit-begin (pseudo-tag for Toolkit-generated code) //
////////////////////////////////////////////////////////////
!!! The prior function albumCoverEffect_user_onload_pre is what actually creates the
title hover effect.
/**
* AUTO GENERATED CODE : DO NOT EDIT
*/
function albumCoverEffect_system_onload(evt) {
if (“albumCoverEffect_user_onload_pre” in this) {
albumCoverEffect_user_onload_pre(evt) ;
}
if (“albumCoverEffect_user_onload_post” in this) {
albumCoverEffect_user_onload_post(evt) ;
}
}
// @Toolkit-end (pseudo-tag for Toolkit-generated code) //
//////////////////////////////////////////////////////////
|
In the bolded portion of the above software code example, there is an element called “cover”. The cover element is the image metadata associated with the album cover to be portrayed by this brick at a particular location on the GUI screen. Also note therein the program line that says “zui:metadata=‘this.image.uri’”. This was setup in the first code example (parent SVG) which is the album of interest, i.e., the album is passed into this brick and the associated cover art is referenced by this variable.
While the foregoing exemplary embodiment describes bricks in the context of their usage as a user interface building block based on an extension of the SVG programming language, bricks can be employed more generically as system building blocks which facilitate distributed software design. Consider, for example, the system illustrated in FIG. 27. Therein, a software system 2700 provides a complete content delivery framework for control and interaction between metadata 2702 (e.g., data associated with movies, shopping, music, etc.) and end-user devices such as a television 2704 and a remote control device 2706. More generally, metadata is information about a particular data set which may describe, for example, one or more of how, when, and by whom other data was received, created, accessed, and/or modified and how the other data is formatted, the content, quality, condition, history, and other characteristics of the other data. Bricks are created by brick engines based on pre-defined brick models as reusable software constructs which, in the exemplary system of FIG. 27, embody all of the relevant logic above the framework level which applies to a particular application associated with the system. To modularize this logic, different levels bricks can be developed, e.g., application, applet, semantic and elemental, as shown in FIG. 28. Each of these different types of bricks will now be described in more detail, along with some examples.
At a highest level is an application brick. In the system example of FIG. 27, an application corresponds to a metadata type, e.g., a music application for delivering music to an end user, a movie application for delivering on-demand movies to an end-user, etc. The application movie brick provides an entry hierarchy which allows users to browse/search/find movie metadata which acts as a mini-application that describes the full interaction between the end user and movie metadata. Similarly, the movie application brick describes the full interaction between the end user and music metadata. Thus, an application brick is essentially the definition of a distributed class associated with a particular type of metadata, for the exemplary system of FIG. 27, and provides a specific mechanism for identifying and partitioning the relevant source metadata 2702. Once an application brick is generated, it can be reused by creating a separate instance of that application brick which is customized by passing in new parameters. For example, after a movie application brick is created for handling, among other things, metadata parsing, generation of a user interface, and user requests, for movies provided on demand by CinemaNow, another instance of that brick can be used to handle the provision of movies by another provider (e.g., Movielink) by passing different parameters into another instance of that brick. An application brick can thus be considered as a self-contained, system wide construct that fully manipulates a top-level metadata category. Each of the different functional icons illustrated in FIG. 16 can be associated with a different application brick.
Descending one level in the layers of FIG. 28, an application brick will be composed of several applet bricks. Applet bricks are self-contained, system-wide software constructs which either fully manipulate a second-level metadata category or fully express a metadata specific function. In this context, second-level metadata refers to the types of metadata available within the context of the high level metadata domain, e.g., for a high level metadata of movies, second-level metadata can include movie titles, stars, runtime, etc. A metadata specific function refers to a function which is tied to a particular high level metadata, e.g., browse/play for a movie or browse/put into a shopping cart for shopping metadata For example, a navigation screen full of bookshelves associated with a particular application may be defined using a bookshelf navigation applet brick. This navigation applet brick maps all of the relevant metadata organized in a manner which is appropriate for its higher level application brick. For example, all of the offerings provided by a particular movie provider can be depicted as a layout of bookshelves in accordance with the available metadata as defined in a movie navigation applet brick. Another instance of the same movie navigation applet brick can be used to generate a similar user interface screen, and handle interactions, for offerings provided by a different movie provider. The applet bricks provide a linkage between the relevant metadata (as previously organized by the application brick), and a scene layout for the user interface to control various aspects of the interface, e.g., the bookshelf dimensions, cover art dimensions, etc. The applet brick can also control functional interactions between a user and the system at this level, e.g., the manner in which the bookshelf reacts to a cursor being paused over its display region (see, e.g., FIG. 24).
Each applet brick can be composed of several semantic bricks, which are intended to operate as self-contained system-wide constructs that fully encapsulate a particular semantic interaction associated with the system. For example, whereas an applet brick may be associated with a particular metadata ontology, e.g., for a navigation bookshelf user interface screen such as that of FIG. 24, a semantic brick may do the same for a specific bookshelf, e.g., that shown in FIG. 25. Thus semantic brick may include details of item (e.g., cover art image) sizing, cover art details, semantic hover details (i.e., how to generate a hoverzoom when a user pauses a cursor over a particular cover art image to generate the result shown in FIG. 25), title details, etc.
Consider the following example of a semantic brick. Specifically, consider that a semantic brick has been instantiated by a brick engine to display information about a particular person (e.g., an actor in a movie which can be selected using an interface). This semantic brick displays to the user of the system the following information: name, birthdate, a short biography, and relevant work, e.g., the movies that he or she starred in, which are attributes of this semantic brick. The biography also contains a scrollable text box (which can be created using the lowest order, elemental brick referred to in FIG. 28). This semantic brick can be reused for any generic metadata type that supports the attributes described above. Also note that this semantic brick may show thumbnail images for the relevant work. However the semantic brick could further define the functionality that it would pre-cache a larger image associated with each thumbnail in case the user clicks on the thumbnail to go to that view, so that latency is reduced to get to that scene. This can be viewed as analogous to an OO class in that the “person” class has different instantiations depending on whether the creator is a musician, musical group, actor, director, or author. However this semantic brick may only need to show the cover art for the relevant work and so any type of generic metadata that supports name, birthdate, short bio, and cover art can reuse this brick. In the case where there is a relevant work but cover art is not available to represent that work, the brick could be structured to instead show a placeholder image on the user interface when called. In fact, a different type of placeholder image could be employed depending on the metadata type (e.g., looks like a movie reel or a book). This illustrates the error handling capability of the brick.
As mentioned above, elemental bricks are self-contained, system-wide constructs that encapsulate primitive interactions. Examples of elemental bricks are text boxes, buttons, a picture, a scroll list, etc.
The above-described exemplary embodiments are intended to be illustrative in all respects, rather than restrictive, of the present invention. Thus the present invention is capable of many variations in detailed implementation that can be derived from the description contained herein by a person skilled in the art. All such variations and modifications are considered to be within the scope and spirit of the present invention as defined by the following claims. No element, act, or instruction used in the description of the present application should be construed as critical or essential to the invention unless explicitly described as such. Also, as used herein, the article “a” is intended to include one or more items.