This invention relates to focus navigation and, in particular, to directional focus navigation.
A graphical user interface (GUI) may include multiple elements that can receive input. The target of the input is said to have focus, and elements eligible for focus within a document are called focusable elements. In a GUI, input that is received is processed according to an element that currently has focus. For example, if a GUI contains two editable text boxes, when a user enters text through a keyboard, the text is rendered in the text box that currently has focus.
In a GUI that receives input from a pointing device, such as a mouse, the currently focused element is determined based on the location of a cursor or pointer when a user submits a select command (e.g., clicks a mouse button). If the cursor or pointer is over a text box when the user clicks the mouse button, then that text box is given focus and future input, such as through a keyboard, is directed to the text box until the focus is moved to another element.
In a GUI that receives input from devices without pointing mechanisms, such as keyboards or remote controls, an alternative mechanism is implemented to allow a user to specify an element to be given focus, and thus, the target of keyboard (or other device) input.
Browser or other rendering engine applications that provide a mechanism for displaying documents that include focusable elements (e.g., hypertext markup language (HTML) documents, dynamic HTML documents, or web pages) are implemented to support navigation between focusable elements in a document by way of a mouse or other similar pointing device. Some such documents also support navigation through the focusable elements of the document according to a tab order that defines a sequential order that determines which element is focused next when a user presses a tab key.
Recently introduced technology allows web pages (and potentially other types of documents that may include focusable elements) to be rendered using devices other than computers. These other devices may include, but are not limited to, a television set, a cellular phone display, and a multi-modal device, such as the NOKIA 9110 COMMUNICATOR. Such devices do not typically include a pointer input device, such as a mouse.
Rendering engines implemented on such devices may be programmed to receive an input that behaves like the press of a tab key to allow a user to move sequentially from one focusable element in a document to another. However, cascading style sheets, dynamic HTML, and the like support dynamic rendering of documents in such a way that the visual layout of a rendered document may differ from the layout that was intended when the document was generated and possibly, when the tab order of focusable elements in the document was established. Accordingly, moving from one focusable element to another either sequentially or based on an established tab order may not result in visually logical focus navigation, and it is desirable for a rendering engine to support directional focus navigation based on directional inputs, such as an arrow key press.
Directional focus navigation is described. A browser or other rendering engine that supports directional focus navigation determines focusable elements that are candidates to receive focus based on a directional input and a display layout of the focusable elements. After candidate focusable elements are identified, the rendering engine determines the most appropriate candidate, and moves the focus to that element.
The same numbers are used throughout the drawings to reference like features and components.
a illustrates a portion of an exemplary method for implementing directional focus navigation.
b illustrates a portion of an exemplary method for implementing directional focus navigation.
a illustrates a view of a scrollable document with several focusable elements.
b illustrates a view of a scrollable document with several focusable elements.
c illustrates a view of a scrollable document with several focusable elements.
a illustrates a view of a parent document that includes a scrollable element.
b illustrates a view of a parent document that includes a scrollable element.
The following discussion is directed to directional focus navigation among IS focusable elements of a document. Directional focus navigation as described herein may be implemented using a browser or other interface or rendering engine that renders documents (e.g., HTML or dynamic HTML) that may contain focusable elements to provide a mechanism for navigating among those focusable elements using directional inputs (e.g., left, right, up, and down arrow keys) instead of a traditional pointing device such as a mouse. For example, directional focus navigation may be implemented to support web site navigation using a television remote control, when an HTML rendering engine is implemented as part of a television or television set-top box. In another example, directional focus navigation may be implemented to support web site navigation using a cellular telephone or multi-modal device.
Method for Directional Focus Navigation
Directional focus navigation may be described in the general context of computer-executable instructions, such as application modules, being executed by a computer. Generally, application modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Directional focus navigation may be implemented using any number of programming techniques and may be implemented in local computing environments or in distributed computing environments where tasks are performed by remote processing devices that are linked through various communications networks based on any number of communication protocols. In such a distributed computing environment, application modules may be located in both local and remote computer storage media including memory storage devices.
a and 1b illustrate a method 100 for implementing directional focus navigation among focusable elements and/or focusable areas of a rendered document.
Documents that may be rendered as described herein can be represented by a hierarchical tree structure that illustrates the relationships between elements of the document. For example, the document itself may be represented by an element (e.g., an HTML document element) that contains the other elements of the document, such as buttons, hyperlinks, tables, and so on. Per the hierarchical structure, elements within the document element may also contain other elements. For example, a table element may contain cell elements. When a document is first rendered, the focus is given to some element associated with the document. That element may be the document element, or it may be any other focusable element contained within the document element
At block 102, a browser (or other type of rendering application) receives an indication of a directional input. For example, the indication is received when a user presses an arrow key (e.g., up, down, left, or right) on a keyboard, remote control, phone keypad, or other such device.
At block 104 the browser determines whether or not the current focus is on an element associated with the currently rendered document. If the browser determines that the current focus is not on an element associated with the document (the “No” branch from block 104), then at block 106 the browser identifies the first focusable element. The method continues at block 108, setting the focus to the identified first focusable element. In the described implementation, the first focusable element of a document is a focusable element that is a child of the element representing the entire document that is displayed above and to the left of any other focusable elements that are children of the element representing the entire document.
If the browser determines that there is a currently focused element associated with the document (the “Yes” branch from block 104), then at block 110, the browser identifies the element that currently has the focus.
At block 112 the browser determines whether a focus candidate exists. In the described implementation, a focus candidate is a focusable element that is within the viewable area of the document and displayed at a location that is indicated by the received directional input in relation to the display location of the currently focused element. If it is determined that there is not a focus candidate associated with the current document (the “No” branch from block 112), then the method continues at block 122 (shown in
At block 114 the browser selects a focus candidate that is associated with the current document.
At block 116 the browser determines whether or not the selected element is the most appropriate element to receive the focus based on the display location of the selected element in relation to the display location of the element that currently has the focus. If the selected element is an appropriate element to receive the focus based on its display location, and no other focusable element has been identified as an appropriate element, then the selected element is identified as the most appropriate element thus far. If the selected element is an appropriate element to receive the focus based on its display location, and another focusable element has previously been identified as the most appropriate element thus far, then the browser compares the locations of the two identified appropriate elements to determine which one is the most appropriate element thus far. If the browser determines that the selected element is the most appropriate element to receive the focus (the “Yes” branch from block 116), then the method continues at block 118. On the other hand, if the browser determines that the selected element is not the most appropriate element to receive the focus (the “No” branch from block 116), then the method continues at block 120.
At block 118, the browser identifies the selected element as the element to which the focus will be given.
At block 120 the browser determines whether or not there is another focusable element within the viewable area of the document that has not been selected as described with reference to block 114. If the browser determines that there is another focusable element that has not been selected (the “Yes” branch from block 120), then the method continues at block 114, as described above. On the other hand, if the browser determines that there are no other focusable elements that have not been selected (the “No” branch from block 120), then the method continues at block 108, setting the current focus to the focusable element identified as described above with reference to block 118.
As described above with reference to block 112, if the browser determines that there is not another focusable element within the viewable area of the document, then at block 122 (shown in
At block 126 the browser determines whether the document associated with currently focused element is a child document associated with another parent document. If the browser determines that there is not a parent document (the “No” branch from block 126), then at block 128, the method terminates with the focus remaining on the currently focused element. On the other hand, if the browser determines that there is a parent document (the “Yes” branch from block 126), then the method continues at block 130.
At block 130 the browser determines whether there are focusable elements within the viewable area of the parent document that are appropriate based on the display location of the elements in relation to the display location of the element with the current focus based on the direction indicated by the directional input. If there are no appropriate focusable elements within the viewable area of the parent (the “No” branch from block 130), then at block 132 the browser sets the focus to the parent document. On the other hand, if there is at least one appropriate focusable element within the viewable area of the parent document (the “Yes” branch from block 130), the method continues at block 134.
At block 134 the browser identifies the most appropriate focusable element in the parent document to receive the focus.
At block 136 the browser sets the current focus to the element identified as described with reference to block 134.
Horizontal Focus Navigation
Horizontal focus navigation pertains to the moving of focus from one element to another element in response to a left or right directional input. In an implementation of directional focus navigation, when moving focus horizontally, preference is given to a focusable element with a display position that has more vertical overlap with the display position of the currently focused element over a focusable element that has less vertical overlap with the display position of the currently focused element. Details of a horizontal focus navigation implementation will be described below with reference to
Right Directional Input
The above (and all following) equations are written using the assumption that the display area can be mapped to a two-dimensional grid in which the upper-left-most corner has (x,y) coordinates (0,0).
If a focusable element E meets the following criteria, then E is considered a focus candidate (an appropriate element to receive focus) when the directional input indicates moving focus to the right:
Comparing the horizontal midpoints (E.mid>F.mid) provides a mechanism for handling the special case in which two focusable elements have overlapping display areas.
Given the document illustrated in
As described above with reference to block 116 shown in
In this implementation, preference is given to an element that vertically overlaps the currently focused element because most documents have a visual layout that suggests reading the document first from left to right and then from top to bottom. In alternate implementations, the specific algorithm can be modified to support other document layouts. Given the calculations above, in the document shown in
Left Directional Input
Similarly, if a focusable element E meets the following criteria, then E is considered an appropriate element to receive focus when the directional input indicates moving focus to the left:
Element E is determined to be the most appropriate element to receive the focus when compared to element W when a left directional input is received if any of the following is true:
Vertical Focus Navigation
Vertical focus navigation pertains to the moving of focus from one element to another element in response to an up or down directional input. In an implementation of directional focus navigation, when moving focus vertically, preference is first given to a focusable element that is structurally close to the element that currently has the focus. Preference is next given to a focusable element that has horizontal coordinates closest to the horizontal coordinates of the last element that got focus by means other than an up or down directional input. Details of a vertical focus navigation implementation will be described below with reference to
Down Directional Input
Documents that can be rendered using a rendering engine can be defined in terms of hierarchical tree structure of elements. For example, the document 304 illustrated in
When the rendering engine receives a down directional input (e.g., a user presses a down arrow key), the rendering engine determines which, if any, of the other focusable elements (i.e., hyperlink 310 or button 312) is the most appropriate element to receive the focus next. (This process is described above with reference to blocks 102-120 shown in
The following mathematical definitions will be used below in vertical focus navigation equations that may be used in an implementation of a rendering engine that supports directional focus navigation:
The above (and all following) equations are written using the assumption that the display area can be mapped to a two-dimensional grid in which the upper-left-most corner has (x,y) coordinates (0,0).
If a focusable element E meets the following criteria, then E is considered an appropriate element to receive focus when the directional input indicates moving focus down:
Given the document illustrated in
As described above with reference to block 116 shown in
In this implementation, preference is given first to an element that is structurally closer to the currently focused element, as defined by the tree structure that represents the document. If a decision between multiple focusable elements remains, preference is then given to an element that is horizontally closer to the element that had focus when up/down navigation began. Therefore, in the document shown in
Link02-Link22 are all identified as appropriate elements to receive focus in response to the down directional input because each is within the focusable area of the document (indicated by box 404), the midpoint of each is below the midpoint of the currently focused element, and the top of each is below the bottom of the currently focused element.
When the focus is on element 406 and the rendering engine receives a down directional input, element 408 is given focus because it is vertically closer to the currently focused element 406 than any of the other elements that were determined appropriate for receiving focus.
When the focus is on element 408 and the rendering engine receives another down directional input, Link03-Link22 are all identified as appropriate elements to receive focus because they are below the currently focused element, 408. Because they are vertically closer to the currently focused element, Link03 and Link 04 are given preference. To decide between Link03 and Link04, the rendering engine compares the horizontal location of each with the horizontal location of the element that had focus when the up/down navigation began, which is hyperlink 406. Because Link04 is horizontally closer to hyperlink 406 than is Link03, hyperlink 410 (Link04) is given the focus in response to a down directional input received when the focus is on hyperlink 408.
Similarly, when the focus is on element 410 and the rendering engine receives another down directional input, Link05-Link22 are all identified as appropriate elements to receive focus because they are below the currently focused element, 410. Because they are vertically closer to the currently focused element, Link05-Link 07 are given preference. To decide between Link05, Link06, and Link07, the rendering engine compares the horizontal location of each with the horizontal location of the element that had focus when the up/down navigation began, which is hyperlink 406. Because Link07 is horizontally closer to hyperlink 406 than are Link05 or Link06, hyperlink 412 (Link07) is given the focus in response to a down directional input received when the focus is on hyperlink 410.
By the same rationale, if the rendering engine receives another down directional input while focus is on element 412, focus will move to element 414. When focus is on element 414 and the rendering engine receives another down directional input, focus moves to element 416. Finally, when focus is on element 414 and the rendering engine receives another down directional input, focus move to element 418.
Up Directional Input
Similarly, if a focusable element E meets the following criteria, then E is considered an appropriate element to receive focus when the directional input indicates moving focus up:
Element E is determined to be the most appropriate element to receive the focus when an up directional input is received if any of the following is true:
Directional Focus Navigation Among Focusable Areas
A rendering engine as described herein can be implemented to support directional focus navigation across arbitrarily shaped areas, such as areas defined for an image map.
The following mathematical definitions will be used below in focus navigation equations that may be used in an implementation of a rendering engine that supports directional focus navigation across areas of arbitrary shape:
The above (and all following) equations are written using the assumption that the display area can be mapped to a two-dimensional grid in which the upper-left-most corner has (x,y) coordinates (0,0).
Right Directional Input
If a focusable area E meets the following criteria, then E is considered an appropriate area to receive focus when the directional input indicates moving focus to the right:
As described above with reference to block 116 shown in
In the described implementation, the representative point of an area has a horizontal coordinate and a vertical coordinate. In one implementation, the horizontal coordinate of the representative point of an area is the average of the horizontal coordinates of all of the points of the area, while the vertical coordinate of the representative point is the average of the vertical coordinates of all of the points of the area. In an alternate implementation, the representative point of an area may be the midpoint of a particular side of the area. Although the described implementation suggests representative points that are calculated as average points, it is recognized that any number of methods and/or calculations may be used to determine a representative point associated with an area.
Left Directional Input
If a focusable area E meets the following criteria, then E is considered an appropriate area to receive focus when the directional input indicates moving focus to the left:
In this implementation, area E is determined to be the most appropriate area to receive the focus when a left directional input is received if the distance between the representative point of E and the representative point of F is shorter than the distance between the representative point of F and any of the other appropriate elements or areas to receive focus.
Down Directional Input
If a focusable area E meets the following criteria, then E is considered an appropriate area to receive focus when the directional input indicates moving focus down:
As described above with reference to block 116 shown in
In the described implementation, the representative point of an area, as described above, has a horizontal coordinate and a vertical coordinate. In the described implementation, when comparing focus candidates in response to vertical directional input (e.g., up or down), the horizontal component of the representative point of F is derived from the horizontal coordinate of the representative point of the element that had the focus when the first of the current sequence of up and/or down directional inputs was received.
By defining the horizontal coordinate of the vertical representative point of F in this way, priority is given to areas and/or elements that are horizontally closest to the area or element that had focus when a user first submitted an up or down directional input.
Up Directional Input
If a focusable area E meets the following criteria, then E is considered an appropriate area to receive focus when the directional input indicates moving focus Up:
Area E is determined to be the most appropriate area to receive the focus when an up directional input is received if the distance between the representative point of E and a vertical representative point of F is shorter than the distance between the vertical representative point of F and any of the other appropriate elements or areas to receive focus.
Focus Navigation in Scrollable Documents
a illustrates a scrollable document that includes focusable elements. Window 602 represents a display area, box 604 represents the document. The document 604 is vertically larger than the display area 602. To view portions of the document 604 that are outside of the display area 602, the document can be scrolled.
In typical browser implementations, in response to directional input (e.g., an arrow key press), if the document currently being rendered is scrollable, then the browser scrolls the document in the direction indicated by the directional input. In the described browser implementation that supports directional focus navigation, when directional input is received, the browser can move focus from one focusable element to another, as described above, or can cause the document to scroll in the direction indicated by the directional input.
In the example illustrated in
b illustrates the scrollable document that is described above with respect to
In an alternate implementation, in response to a directional input, the browser may both scroll and move focus. For example, if a browser receives a down directional input, and the only focus candidate element is located outside the viewable area, the browser may scroll the document in the down direction until the focus candidate is within the viewable area, and then move focus to that element.
c illustrates the scrollable document that is described above with respect to
a illustrates a document that includes focusable elements where one of the focusable elements is scrollable and contains other focusable elements. Document 702 includes focusable elements 704 (a hyperlink), 706 (a scrollable frame), and 708 (a button). Element 706 is representative of any scrollable element, which, in HTML, may include, but is not limited to, frame elements, object elements, div elements, and iframe elements. Document 710 is displayed in scrollable frame 706 and includes focusable elements 712, 714, 716, and 718 (all hyperlinks). Assuming in this example that the browser or rendering engine that is rendering document 702 receives several sequential down directional inputs. Assuming that focus is initially on hyperlink 702 (as indicated by the dashed line), when the browser receives the first down directional input, the browser identifies scrollable frame 706 as the most appropriate element to receive the focus. Because scrollable frame 706 includes focusable elements, the focus moves to the first identified focusable element in document 710, which is hyperlink 712.
When the browser receives a second down directional input, the focus moves to hyperlink 714.
When the browser receives a third down directional input, the focus remains on hyperlink 714 and the browser scrolls document 710. Depending on the amount that document 710 is scrolled, several down directional inputs may be received before 716 is scrolled into the displayable area (indicated by the boundaries of scrollable frame 706). Once 716 is within the displayable area, when another down directional input is received, focus moves to hyperlink 716.
A similar process of receiving down directional inputs and scrolling is repeated until hyperlink 718 is within the displayable area, and then in response to the next down directional input hyperlink 718 receives the focus.
Once hyperlink 718 has focus, additional down directional inputs will continue to scroll document 710 (while the focus remains on hyperlink 718) until the lowermost portion of document 710 is within the displayable area.
b illustrates the visual state of document described with reference to
If the browser receives an additional down directional input, focus moves from hyperlink 718 to button 708. The browser determines that there is not another focusable element below hyperlink 718 and within the displayable area of document 710, and the browser determines that document 710 cannot be scrolled in the down direction any further Accordingly, the browser identifies button 708 as the next focusable element below scrollable frame 706 in document 710.
Focus Navigation Between Frames
If focus is on hyperlink 812 (as indicated by the dashed line) when a browser application that is rendering the document receives a down directional input, focus moves to hyperlink 814.
However, if focus is on hyperlink 812 when a browser application that is rendering the document receives a right directional input, focus moves to hyperlink 818 because even though it is associated with a different document, it is the first focusable element that has a display location that is to the right of hyperlink 812.
Allowing Standard Behavior in Focusable Elements
Some focusable elements may, as a standard, exhibit a particular behavior in response to directional input. For example, if a pop-up menu has focus and a user presses a down arrow key, the next element down in the menu may become highlighted, indicated that it is the currently selected menu item. As another example, if a text box has focus and a user presses a right arrow key, a cursor in the text box may shift one character to the right.
In the implementation described, the browser application is implemented to support directional focus navigation while allowing standard behavior in those focusable elements that may respond in and of themselves to directional input. To allow this behavior, the browser may be implemented such that when an element has focus, a user can send a “select” input to indicate that directional inputs from that point on are to be directed to the focused element. A subsequent “select” input can be used to indicate that directional inputs from that point are to be directed to the document in such a way that they drive focus navigation.
For example, when a pop-up menu element receives the focus, a user may press the Enter key (or other designated “select” input device) to select the pop-up menu as the element that will be affected by subsequent directional input. Once the user has navigated to the desired menu item, the user may press the Enter key again to indicate that a selection has been made and to signify to the browser that additional directional inputs will now drive focus navigation. To cancel a selection, the user may press the Escape key (or other designated “cancel” input device).
Exemplary Implementation
Client device 902 includes a wireless receiving port 906 (e.g., an infrared (IR) wireless port) for receiving wireless communications from a remote control device 908, a handheld device 910 (such as a personal digital assistant (PDA) or handheld computer), or other wireless device, such as a wireless keyboard. Additionally, a wired keyboard 912 is coupled to client device 902 for communicating with the client device. In alternate embodiments, remote control device 908, handheld device 910, and/or keyboard 912 may us an RF communication link (or other mode of transmission) to communicate with client 1S device 902. In another alternate embodiment, client device 902, display device 904, and an input device may be implemented as a single hardware unit, such as a cellular telephone or as a handheld device similar to handheld device 910.
In the exemplary system described herein, directional inputs are provided by pressing keys arrow keys 914 of remote control 908, arrow keys or other inputs associated with handheld device 910, or arrow keys 916 of keyboard 912. Such keys might comprise four orthogonal direction keys (up, down, left, and right), or might allow movement in additional directions. Alternatively, a particular input device might utilize a joystick or similar mechanism for directional input, and such an input device might provide increased angular granularity as opposed to the four-direction granularity provided by four orthogonal direction keys.
Client device 902 includes a rendering engine for generating a video signal and/or an audio signal, both of which are communicated to display device 904. The video signals and audio signals can be communicated from client device 902 to display device 904 via an RF (radio frequency) link, S-video link, composite video link, component video link, or other communication link. Although not shown in
Client device 902 includes a processor 1002, a memory 1004, a display interface 1006, an audio output 1008, and a video output 1010. Operating system 1012, rendering engine 1014, and other applications 1016 are stored in memory 1004 and executed on processor 1002. Display interface 1006 provides a mechanism for client device 902 to display documents rendered by rendering engine 1014.
To support directional focus navigation, rendering engine 1014 includes directional input interface 1018 for receiving directional input, candidate element identifier 1020 for identifying focusable elements that may receive focus based on received directional input, and optimal element selector 1022 for selecting an optimal focusable element to receive focus when multiple candidate elements are selected based on received directional input.
Conclusion
Although the systems and methods have been described in language specific to structural features and/or methodological steps, it is to be understood that the invention defined in the appended claims is not necessarily limited to the specific features or steps described. Rather, the specific features and steps are disclosed as preferred forms of implementing the claimed invention.
This application claims priority under 35 U.S.C. §120 as a continuation of U.S. patent application Ser. No. 10/293,447, filed Nov. 13, 2002, and titled “Directional Focus Navigation”, the entire disclosure of which is hereby incorporated by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5548703 | Berry et al. | Aug 1996 | A |
5625763 | Cirne | Apr 1997 | A |
5808604 | Robin | Sep 1998 | A |
6034689 | White et al. | Mar 2000 | A |
6072485 | Barnes et al. | Jun 2000 | A |
6101510 | Stone et al. | Aug 2000 | A |
6154205 | Carroll et al. | Nov 2000 | A |
6249284 | Bogdan | Jun 2001 | B1 |
6256030 | Berry et al. | Jul 2001 | B1 |
6317144 | Pabla et al. | Nov 2001 | B1 |
6549221 | Brown et al. | Apr 2003 | B1 |
6606106 | Mendenhall et al. | Aug 2003 | B1 |
6614457 | Sanada et al. | Sep 2003 | B1 |
6622306 | Kamada | Sep 2003 | B1 |
6822664 | Vale | Nov 2004 | B2 |
6862598 | Higashigawa | Mar 2005 | B2 |
6892360 | Pabla et al. | May 2005 | B1 |
6907574 | Xu et al. | Jun 2005 | B2 |
6947062 | Cuijpers et al. | Sep 2005 | B2 |
6966037 | Fredriksson et al. | Nov 2005 | B2 |
7055108 | Goloshubin et al. | May 2006 | B2 |
20010017634 | Scott | Aug 2001 | A1 |
20020178142 | Higashigawa | Nov 2002 | A1 |
20030014401 | Goloshubin et al. | Jan 2003 | A1 |
20030174170 | Jung et al. | Sep 2003 | A1 |
20030231210 | Anderson et al. | Dec 2003 | A1 |
20030233652 | Hsieh | Dec 2003 | A1 |
20040041837 | Yamaguchi et al. | Mar 2004 | A1 |
20040210851 | Premchandran et al. | Oct 2004 | A1 |
20050097479 | Takabe et al. | May 2005 | A1 |
20050114788 | Fabritius | May 2005 | A1 |
20050125740 | Clow et al. | Jun 2005 | A1 |
20050210399 | Filner et al. | Sep 2005 | A1 |
20060112346 | Miksovsky et al. | May 2006 | A1 |
20080222530 | Lakshmanan et al. | Sep 2008 | A1 |
Number | Date | Country | |
---|---|---|---|
20060206832 A1 | Sep 2006 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10293447 | Nov 2002 | US |
Child | 11380905 | US |