This application is related to the following patent documents, the contents of each of which are incorporated into this disclosure: (i) U.S. patent application Ser. No. 13/083,470, entitled “Systems and Methods for Accurate User Foreground Video Extraction,” filed Apr. 8, 2011 and published Oct. 13, 2011 as U.S. Patent Application Pub. No. US2011/0249190, (ii) U.S. patent application Ser. No. 13/076,264, entitled “Systems and Methods for Embedding a Foreground Video into a Background Feed based on a Control Input,” filed Mar. 30, 2011 and published Oct. 6, 2011 as U.S. Patent Application Pub. No. US2011/0242277, and (iii) unpublished U.S. Patent Application entitled “System and Methods for Persona Identification Using Combined Probability Maps,” filed Dec. 31, 2013 and having Ser. No. 14/145,874.
Online data communications are quite prevalent and pervasive in modern society, and are becoming more so all the time. Moreover, developments in software, communication protocols, and peripheral devices (e.g., video cameras), along with developments in other computing disciplines, have collectively enabled and facilitated the inclusion of multimedia experiences as part of such communications. These multimedia experiences take forms such as audio chats, video chats (that are usually also audio chats), online meetings (e.g., web meetings), and the like.
Using the context of online meetings as an illustrative example, it is often the case that one of the participants is the designated presenter, and often this designated presenter opts to include some visual materials as part of the offered presentation. Such visual materials may take the form of (or at least include) visual aids such as shared desktops, multiple-slide presentations, and the like. In some instances, from the perspective of another attendee at the online meeting, only such visual materials are presented on the display of the online meeting, while the presenter participates only as an audio voiceover. In other instances, the presenter may be shown in one region of the display while the visual materials are shown in another. And other similar examples exist as well.
Described herein are methods and systems for visually deemphasizing a displayed persona. A “persona” of a presenter may be extracted from a video feed of a video camera that is capturing video of the presenter. The extracted persona, which in some examples appears as a depiction of the presenter from the torso up (i.e., upper torso, shoulders, arms, hands, neck, and head), may then be visually combined with content such as a multiple-slide presentation such that the presenter appears to attendees of an online meeting to be superimposed over the content, thus personalizing and otherwise enhancing the attendees' experiences.
At least one embodiment of the present disclosure takes the form of a method carried out by a user-interface (UI)-management device. The UI-management device identifies a persona region of a displayed persona and identifies a user-interface-focus location. The UI-management device then makes a persona-deemphasize determination based on the identified persona region and the identified user-interface-focus location. In response to making the persona-deemphasize determination, the UI-management device visually deemphasizes the displayed persona.
At least one embodiment takes the form of a UI-management device that includes a wireless-communication interface, a processor, and data storage containing instructions executable by the processor for causing the UI-management device to carry out at least the functions described in the preceding paragraph. Moreover, any of the variations and permutations that are described in the ensuing paragraphs and anywhere else in this disclosure can be implemented with respect to any embodiments, including with respect to any method embodiments and with respect to any system embodiments. Furthermore, this flexibility and cross-applicability of embodiments is present in spite of the use of slightly different language (e.g., process, method, steps, functions, set of functions, and the like) to describe and/or characterize such embodiments.
In at least one embodiment, the persona-deemphasize determination is based on a calculated display distance between a persona-region location of the persona region and the user-interface-focus location.
In at least one such embodiment, the persona-deemphasize determination is further based on a determination that the calculated display distance is less than a proximity threshold.
In at least one other such embodiment, the persona-region location is a location within the persona region that is closest to the user-interface-focus location. In at least one other such embodiment, the persona-region location is a centroid of the persona region. In at least one other such embodiment, the persona-region location is a centroid of a minimum-bounding box that encloses the persona region.
In at least one embodiment, the persona-deemphasize determination is based on a determination that the user-interface-focus location is within a deemphasize region of the persona region.
In at least one embodiment, the user-interface-focus location is a mouse-pointer location. In at least one other embodiment, the user-interface-focus location is a stylus location.
In at least one embodiment, the user-interface-focus location is a location within a user-interface region.
In at least one such embodiment, the user-interface region is a focus-window region of a focus window. In at least one such embodiment, the focus window is a selected window. In at least one such embodiment, the selected window is a moved window.
In at least one other such embodiment, the user-interface-focus location is a location within the user-interface region that is closest to the persona-region location. In at least one other such embodiment, the user-interface-focus location is a centroid of the user-interface region. In at least one other such embodiment, the user-interface-focus location is a centroid of a minimum-bounding box that encloses the user-interface region.
In at least one other such embodiment, the persona-deemphasize determination comprises a partial-overlap determination that the user-interface region at least partially overlaps the persona region. In at least one other such embodiment, the persona-deemphasize determination comprises a total-overlap determination that the user-interface region totally overlaps the persona region.
In at least one embodiment, visually deemphasizing the displayed persona comprises decreasing an opacity of the displayed persona. In at least one other embodiment, visually deemphasizing the displayed persona comprises decreasing a display size of the displayed persona.
In at least one embodiment, the UI-management device presents a user-interface-adjustment menu in response to making the persona-deemphasize determination. The user-interface-adjustment menu (i) at least partially overlaps the persona region and (ii) comprises a set of one or more user-interface-adjustment-menu options that are associated with respective user-interface-adjustment functions. In at least one such embodiment, one of the respective user-interface-adjustment functions comprises muting an audio presentation associated with the displayed persona. In at least one other such embodiment, one of the respective user-interface-adjustment functions comprises hiding the displayed persona.
The above overview is provided by way of example and not limitation, as those having ordinary skill in the relevant art may well implement the disclosed systems and methods using one or more equivalent components, structures, devices, and the like, and may combine and/or distribute certain functions in equivalent though different ways, without departing from the scope and spirit of this disclosure.
A more detailed understanding may be had from the following description, which is presented by way of example in conjunction with the following drawings, in which like reference numerals are used across the drawings in connection with like elements.
The present systems and methods will now be described with reference to the figures. It should be understood, however, that numerous variations from the depicted arrangements and functions are possible while remaining within the scope and spirit of the claims. For instance, one or more elements may be added, removed, combined, distributed, substituted, re-positioned, re-ordered, and/or otherwise changed. Further, where this description refers to one or more functions being implemented on and/or by one or more devices, one or more machines, and/or one or more networks, it should be understood that one or more of such entities could carry out one or more of such functions by themselves or in cooperation, and may do so by application of any suitable combination of hardware, firmware, and/or software. For instance, one or more processors may execute one or more sets of programming instructions as at least part of carrying out of one or more of the functions described herein.
The described alpha masks correspond in name with the definition of the “A” in the “RGBA” pixel-data format known to those of skill in the art, where “R” is a red-color value, “G” is a green-color value, “B” is a blue-color value, and “A” is an alpha value ranging from 0 (complete transparency) to 1 (complete opacity). In a typical implementation, the “0” in the previous sentence may take the form of a hexadecimal number such as 0x00 (equal to a decimal value of 0 (zero)), while the “1” may take the form of a hexadecimal number such as 0xFF (equal to a decimal value of 255); that is, a given alpha value may be expressed as an 8 bit number that can be set equal to any integer that is (i) greater than or equal to zero and (ii) less than or equal to 255. Moreover, a typical RGBA implementation provides for such an 8 bit alpha number for each of what are known as the red channel, the green channel, and the blue channel; as such, each pixel has (i) a red (“R”) color value whose corresponding transparency value can be set to any integer value between 0x00 and 0xFF, (ii) a green (“G”) color value whose corresponding transparency value can be set to any integer value between 0x00 and 0xFF, and (iii) a blue (“B”) color value whose corresponding transparency value can be set to any integer value between 0x00 and 0xFF. And certainly other pixel-data formats could be used, as deemed suitable by those having skill in the relevant art for a given implementation.
The extracted persona may be merged with content (such as the content projected by projector 108) in a manner consistent with these conventions. In particular, on a pixel-by-pixel (i.e., pixel-wise) basis, the merging is carried out using pixels from the captured video frame for which the corresponding alpha-mask values equal 1, and otherwise using pixels from the content. Pixel data structures will typically include (or be associated with) one or more other values corresponding respectively to one or more other properties of the pixel, where brightness is an example of one such property. In some embodiments, the brightness value is the luma component of the image or video frame. In other embodiments, the brightness value is the pixel values of one of an R, G, or B color channel, or other similar color space (e.g., gamma compressed RGB, or R′G′B′, or YUV, or YCbCr, as examples). In other embodiments, the brightness value may be a weighted average of pixel values from one or more color channels. And other approaches exist as well.
UI-management device 302 includes a processor 310, a data storage 312, and a communication interface 314, all of which are interconnected via a system bus 332. Those of skill in the art will appreciate that UI-management device 302 could include different and/or additional elements. UI-management device 302 could take the form of a desktop computer, a laptop computer, a server computer, a tablet computer, and/or a smartphone, among other examples.
Processor 310 may include one or more processors of any type deemed suitable by those of skill in the relevant art, some examples including a microprocessor and a dedicated digital signal processor (DSP).
Data storage 312 may take the form of any non-transitory computer-readable medium or combination of such media, some examples including flash memory, read-only memory (ROM), and random-access memory (RAM) to name but a few, as any one or more types of non-transitory data-storage technology deemed suitable by those of skill in the relevant art could be used. As depicted in
Communication interface 314 may include any necessary hardware (e.g., chipsets, antennas, Ethernet cards, and the like), any necessary firmware, and/or any necessary software for conducting one or more forms of communication. Communication interface 314 may be configured to communicate according to one or more wired- and/or wireless-communication types and/or protocols described herein or otherwise deemed suitable by those having skill in the relevant art for a given implementation or in a given context.
Keyboard 304, display 306, and mouse 308 are depicted as being communicatively connected to communication interface 314 via communication links 326, 328, and 330, respectively, and processor 310, data storage 312, and communication interface 314 are shown as being interconnected via a communication link 332 (which could be a system bus, for example). Any one or more of communication links 326-332 could take the form of one or more wired and/or wireless communication links, and communication over communication links 326-332 may comply with one or more communication protocols such as IEEE 802.15 (Bluetooth), universal serial bus (USB), IEEE 1394 (FireWire), IEEE 802.11 (Wi-Fi), 802.3 (Ethernet), digital visual interface (DVI), and/or high-definition multimedia interface (HDMI), among numerous other possibilities. One or more routers, switches, gateways, hubs, repeaters, and the like may be disposed along communication links 326-332. For example, even though keyboard 304, display 306, and mouse 308 are depicted as being directly connected to communication interface 314, these components could instead (or additionally) be communicatively connected to user-interface device 302 via a network 120, which could take the form of a packet-switched network (such as the Internet) or any other suitable network.
In at least one embodiment, the persona-deemphasize determination is based on a calculated display distance between an identified persona-region location and the identified user-interface-focus location. For example,
Persona-region location 502 is a location within a persona region 550 that is closest to a given user-interface-focus location. In the illustrated embodiment, persona-region location 502 is located at an endpoint of line segment 516, which represents the shortest-possible line segment between persona region 550 and a user-interface region 552 (which described in additional detail below).
Persona-region location 504 is a centroid of a minimum-bounding box that encloses persona region 550 (as depicted in and explained below in connection with
Persona-region location 506 is a location at an endpoint, on the border of persona region 550, of the line segment 522, which is the shortest possible line segment between persona-region location 504 and user-interface-focus location 512 (which, in the illustrated embodiment, is the centroid of user-interface region 552).
User-interface-focus location 508 is a location of mouse pointer 204 which, as illustrated in
In at least one embodiment, the user-interface-focus location is a location within user-interface region 552, which, in the embodiment illustrated in
In at least one embodiment, user-interface region 552 is a focus-window region of a focus window, which could be, e.g., a window (or other component) that will receive any input (e.g., via a keyboard or mouse). For example, text entered at keyboard 304 or pasted from a clipboard may be sent to the focus window. Focus may be withdrawn from the focus window by giving another window (or other user-interface element) the focus, e.g., by determining that another window or other user-interface element has been selected. Accordingly, the focus window may be a selected window—perhaps selected by a mouse click (via mouse 308) on another window or user-interface element that can receive focus. In at least one embodiment, the selected window is a moved window—moved, for example, by selecting a window and dragging the selected window to another location.
Though various embodiment have been described in which the persona-deemphasize determination made at step 406 is based on a calculated display distance between an identified persona-region location and the identified user-interface-focus location, those of skill in the art will appreciate that the persona-deemphasize determination may be made by different and/or additional means. For example, the persona-deemphasize determination may be based on a determination that the user-interface-focus location is within a deemphasize region of the persona region. To illustrate,
As another example, the persona-deemphasize determination may be a partial-overlap determination that the user-interface region at least partially overlaps the persona region. To illustrate,
With respect to step 408, visually deemphasizing displayed persona 202 could include, for example, decreasing an opacity of the displayed persona, changing a position of the displayed persona, and/or decreasing a display size of the displayed persona, among numerous other possibilities that will be known to those of skill in the art.
The magnitude of the visual de-emphasis could be based on the calculated display distance (among other possible factors). In an embodiment, the magnitude of the visual de-emphasis is proportional to the calculated display distance. For example, the opacity and/or size of displayed persona 202 could gradually decrease as the calculated display distance gradually decreases (e.g., as mouse pointer 204 gradually moves closer to persona region 550).
The magnitude of the visual de-emphasis could be based on the displayed contents of window 206 and/or another UI element. For example, a change of a displayed slide in a multi-slide presentation and/or a movement of a character in a video game (as displayed in window 206) could result in a greater amount of visual de-emphasis as compared to no change of the displayed contents of window 206. Relatedly, the persona-deemphasize determination could be made in response to (and/or based on) a change of the displayed contents of window 206 and/or another UI element.
In the embodiment depicted in
Although features and elements are described above in particular combinations, those having ordinary skill in the art will appreciate that each feature or element can be used alone or in any combination with the other features and elements without departing from the scope and spirit of the present disclosure.
Number | Name | Date | Kind |
---|---|---|---|
5001558 | Burley et al. | Mar 1991 | A |
5022085 | Cok | Jun 1991 | A |
5343311 | Morag | Aug 1994 | A |
5506946 | Bar | Apr 1996 | A |
5517334 | Morag et al. | May 1996 | A |
5534917 | MacDougall | Jul 1996 | A |
6119147 | Toomey | Sep 2000 | A |
6150930 | Cooper | Nov 2000 | A |
6618444 | Haskell | Sep 2003 | B1 |
6664973 | Iwamoto et al. | Dec 2003 | B1 |
6760749 | Dunlap | Jul 2004 | B1 |
7124164 | Chemtob | Oct 2006 | B1 |
7518051 | Redmann | Apr 2009 | B2 |
7755016 | Toda et al. | Jul 2010 | B2 |
7770115 | Gallmeier | Aug 2010 | B2 |
7773136 | Ohyama et al. | Aug 2010 | B2 |
7821552 | Suzuki | Oct 2010 | B2 |
7831087 | Harville | Nov 2010 | B2 |
8094928 | Graepel | Jan 2012 | B2 |
8225208 | Sprang | Jul 2012 | B2 |
8264544 | Chang | Sep 2012 | B1 |
8345082 | Tysso | Jan 2013 | B2 |
8379101 | Mathe | Feb 2013 | B2 |
8411148 | Hsieh | Apr 2013 | B2 |
8411149 | Maison | Apr 2013 | B2 |
8533593 | Grossman | Sep 2013 | B2 |
8533594 | Grossman | Sep 2013 | B2 |
8533595 | Grossman | Sep 2013 | B2 |
8588515 | Bang | Nov 2013 | B2 |
8649932 | Mian | Feb 2014 | B2 |
8659658 | Vassigh | Feb 2014 | B2 |
8666153 | Hung | Mar 2014 | B2 |
8701002 | Grossman | Apr 2014 | B2 |
8818028 | Nguyen | Aug 2014 | B2 |
8874525 | Grossman | Oct 2014 | B2 |
8890923 | Tian | Nov 2014 | B2 |
8890929 | Paithankar | Nov 2014 | B2 |
8913847 | Tang | Dec 2014 | B2 |
8994778 | Weiser | Mar 2015 | B2 |
9008457 | Dikmen | Apr 2015 | B2 |
9065973 | Graham | Jun 2015 | B2 |
9088692 | Carter | Jul 2015 | B2 |
20020158873 | Williamson | Oct 2002 | A1 |
20060259552 | Mock | Nov 2006 | A1 |
20080266380 | Gorzynski | Oct 2008 | A1 |
20090199111 | Emori | Aug 2009 | A1 |
20110242277 | Do | Oct 2011 | A1 |
20110304632 | Evertt | Dec 2011 | A1 |
20120314077 | Clavenna, II | Dec 2012 | A1 |
20130094780 | Tang | Apr 2013 | A1 |
20130129205 | Wang | May 2013 | A1 |
20140245190 | Bragdon | Aug 2014 | A1 |
20140300630 | Flider | Oct 2014 | A1 |
20160196675 | Kosmach | Jul 2016 | A1 |
Number | Date | Country |
---|---|---|
2013019259 | Feb 2013 | WO |
Entry |
---|
Working screenshot of Snagit manufactured by Techsmith, relsesed Apr. 18, 2014. |
Number | Date | Country | |
---|---|---|---|
20160196675 A1 | Jul 2016 | US |