Many current applications executed on a modern computer feature an audio component. A media player application may play a digital song file. A video game may have music and incidental game sounds. A video chat program may render a voice transmission from a remote party.
Most modern computers may have a set of limited audio resources to execute those audio components. Each audio stream of the application may compete for use of the audio resources. The computer may merge the audio streams from each active application and play the combined audio back to the user.
This Summary is provided to introduce a selection of concepts in a simplified form that is further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.
Embodiments discussed below relate to a computer audio system that may use a playback manager to manage the audio streams of audio applications. A playback manager may receive an audio stream category declaration from an audio application. The playback manager may determine a media focus status for an audio stream of the audio application based in part on the audio stream category declaration.
In order to describe the manner in which the above-recited and other advantages and features can be obtained, a more particular description is set forth and will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings. Understanding that these drawings depict only typical embodiments and are not therefore to be considered to be limiting of its scope, implementations will be described and explained with additional specificity and detail through the use of the accompanying drawings.
Embodiments are discussed in detail below. While specific implementations are discussed, it should be understood that this is done for illustration purposes only. A person skilled in the relevant art will recognize that other components and configurations may be used without parting from the spirit and scope of the subject matter of this disclosure. The implementations may be a machine-implemented method, a tangible machine-readable medium having a set of instructions detailing a method stored thereon for at least one processor, or a computer audio system for a computing device.
If a user may interact with one application via a visual or graphical interface at a time, a computer audio system may treat an application that plays audio, or an audio application, differently from applications that do not play audio. An audio application is any application that has an audio output, whether as the primary purpose of the application or as an incidental result of the application.
A foreground application is an application that executes while visually displaying a graphical user interface to the user. A background application is an application that executes without visually displaying a graphical user interface to the user. A background application may be moved to the foreground by producing a graphical user interface for the application to the user, while the foreground application may be moved to the background by minimizing or removing the graphical user interface for the application.
Audio applications may continue to provide audio streams while being executed in the background. The computer audio system may use a playback manager to determine which audio applications may still be heard by the user while operating in the background. The playback manager may also route any media control commands to the proper audio application and manage utilization of audio resources. The media control commands may originate from a physical input, an operating system user interface, or other sources outside the application.
An audio application may declare the category of the audio being played back to a playback manager and register for media focus status notifications and transport control notifications. The playback manager may reside alongside the audio server and monitor the audio applications. The audio server is a set of audio services provided by the operating system. The playback manager may use the self-declared audio stream category to determine a media focus status of the audio application and make routing decisions for transport control notifications. The media focus status describes if an audio application or an audio stream of the audio application has media focus and is played to the user, or does not have media focus and is not played to the user. The playback manager may arbitrate the use of audio resources between audio applications on the system.
The playback manager may assign an audio stream category on an application basis or an individual audio stream basis. By declaring an application audio stream category for the audio application, the audio application may apply a relevant audio stream category to each audio stream from that audio application. The playback manager may assign an application audio stream category to an audio application based on the audio stream categories of the audio streams of the audio application.
The playback manager may assign a primary audio stream category or a secondary audio stream category to an audio stream based on the audio stream category declaration. A primary audio stream category indicates an audio stream to be played regardless of the shell state of the audio application. The shell state describes whether the audio application is in the foreground or in the background. A secondary audio stream category indicates an audio stream to be played when the audio application is in a foreground shell state. A first audio application with a primary audio stream category in the background shell state may have its audio stream merged with a second audio application in the foreground shell state. Alternatively, if the second audio application has a secondary audio stream category, the first audio application may interrupt any audio stream from the second audio application.
Additionally, an audio application may have a priority audio stream category, allowing that audio application to interrupt the audio streams of any other audio applications. The priority audio stream category may be used for an application having a communication audio stream category, such as a voice over internet protocol (VOIP) application. Conversely, an audio application may have a null audio stream category, preventing any of the audio streams of that audio application being played. The null audio stream category may be applied to programs with nonessential audio, unidentified audio, or even malware.
The playback manager may use the application audio stream category of an audio application to determine the media focus status of that audio application. An audio application having media focus has audio streams that may be heard by the user. An audio application having lost media focus has no audio streams that are heard by the user. The audio application may or may not receive an error notification indicating whether the audio streams are being heard by the user. An audio application without media focus may have any audio streams from that audio application muted in a manner transparent to the application.
The playback manager implements media focus status and other audio policies for the computer audio system. The playback manager may have a communication channel with the operating shell to get information about the shell state of each application, such as visibility to the user. Using the shell state and the audio stream category, the playback manager may play audio streams from an audio application having a primary audio stream category even when that audio application is in a background shell state. The playback manager may attenuate or fully mute the other audio streams when a communications stream is activated. The playback manager may also coordinate with other operating system components to arbitrate audio resources among multiple audio applications based on a shell state and audio categories. An audio application with a secondary audio stream category may lose media focus when moved to the background.
The playback manager may notify an audio application when media focus is granted or revoked. Upon the loss of media focus, the audio application may choose to perform optimizations in order to save processing and network resources and improve the user experience of the application. The audio application may pause the current activity after a change in media focus status, such as pausing media playback or pausing a game. The audio application may receive notifications by registering callbacks with the playback manager. An audio application may choose to ignore the media focus status notifications sent by the playback manager.
The playback manager may intelligently route transport control commands that originate from a transport control input, such as outside applications or hardware. For example, the playback manager may route signals from a remote control, a keyboard, or a soft button provided by the operating system user interface to the audio application, even when in a background shell state. The playback manager may route commands to an audio application that has been granted media focus. The operating system user interface may determine whether to display soft transport control buttons by querying the playback manager for active audio applications having media focus.
Audio applications may consume limited hardware or software resources. The playback manager may improve the efficiency of those resources by favoring allocation of limited audio resources to the audio application with media focus. The audio application, upon notification of a revoked media focus, may optimize operations to consume fewer resources.
Thus, in one embodiment, a computer audio system may use a playback manager to manage the audio streams of audio applications. A playback manager receives audio with a certain specified or computed audio stream category from an audio application. The playback manager may determine a media focus status for an audio stream of the audio application based in part on the audio stream category.
The processor 120 may include at least one conventional processor or microprocessor that interprets and executes a set of instructions. The memory 130 may be a random access memory (RAM) or another type of dynamic storage device that stores information and instructions for execution by the processor 120. The memory 130 may also store temporary variables or other intermediate information used during execution of instructions by the processor 120. The ROM 140 may include a conventional ROM device or another type of static storage device that stores static information and instructions for the processor 120. The storage device 150 may include any type of tangible machine-readable medium, such as, for example, magnetic or optical recording media and its corresponding drive. The storage device 150 may store a set of instructions detailing a method that when executed by one or more processors cause the one or more processors to perform the method. The storage device 150 may also be a database or a database interface for storing an audio stream category for an audio application.
The input device 160 may include one or more conventional mechanisms that permit a user to input information to the computing device 100, such as a touch screen, a keyboard, a mouse, a voice recognition device, a microphone, a headset, etc. The output device 170 may include one or more conventional mechanisms that output information to the user, including a display, a printer, one or more speakers, a headset, or a medium, such as a memory, or a magnetic or optical disk and a corresponding disk drive. Additionally, the output device 170 may be one or more audio resources to produce an audio output for the computer, such as speakers or a headphone jack. The communication interface 180 may include any transceiver-like mechanism that enables processing device 100 to communicate with other devices or networks. The communication interface 180 may include a network interface or a mobile transceiver interface. The communication interface 180 may be a wireless, wired, or optical interface.
The computing device 100 may perform such functions in response to processor 120 executing sequences of instructions contained in a computer-readable medium, such as, for example, the memory 130, a magnetic disk, or an optical disk. Such instructions may be read into the memory 130 from another computer-readable medium, such as the storage device 150, or from a separate device via the communication interface 180.
The computer device 100 may implement an audio resource management layout to control the use of the computer audio resources by an application.
The operating shell 208 may send a notification to the playback manager 202 of any changes to the shell state of any of the audio applications 204. An audio application 204 may change from a foreground shell state, with the audio application active and displayed to a user, to a background shell state, with the audio application active but not fully displayed to the user. The operating shell 208 may send a shell change notification to the playback manager 202 if an audio application 204 changes shell state. The playback manager 202 may use the shell state, in conjunction with an audio stream category of an audio application, to determine the media focus status for an audio application 204 or an audio stream of the audio application 204. The audio stream category may refer to an application audio stream category of an audio application 204 or an audio stream category of an audio stream of the audio application 204. If the audio application 204 has a primary audio stream category, the audio application 204 may be granted media focus regardless of the shell state. If the audio application 204 has a secondary audio stream category, the audio application 204 may be granted media focus while having a foreground shell state but may lose media focus while having a background shell state.
A transport control input 210 may send transport control commands and other media control commands to the playback manager 202. The playback manager 202 may send those transport control commands and other media control commands to those audio applications 204 having a media focus. Additionally, the transport control input 210 may be used by the user to directly change the audio stream category or the media focus status of an audio application 204 by sending a user override.
An audio application 204 may declare the audio stream category of the audio streams that the audio application 204 is playing. The playback manager 202 may use the audio stream category to determine whether the audio application 204 has media focus and to prioritize audio streams among audio applications 204.
The category field 330 may identify an audio application 204 or an audio stream as media 331, communications 332, narration 333, alerts 334, game media 335, game effects 336, sound effects 337, or other 338. Media 331 may describe any multimedia content, such as music players or video players. Communications 332 may describe voice over internet protocol applications or chat applications. Narration 333 may describe narration software, book readers, or other text-to-audio readers. Alerts 334 may describe ring tones, low battery alerts, or other notification software or components. Game media 335 may describe background game audio and other sounds associated with games. Game effects 336 may specifically describe the sound effects in games. Sound effects 337 may describe sound effects used in non-game applications. Other 338 may be used as a default category to describe any unclassified audio streams. Some categories 330, such as media 331, communications 332, or narration 333, may be considered primary media categories, while the other categories 330 may considered secondary media categories.
For a computer audio system that applies an audio stream category on an application-by-application basis, the category field 330 may describe the general type of the audio streams being provided, but not every audio stream. For example, an audio application 204 identified as game media 335 may provide one or more communications 332 audio streams in addition to several game media 335 streams.
If the audio stream does not have media focus (Block 412), the playback manager 202 may deny an audio resource for the audio stream (Block 414). If the audio stream has media focus (Block 412), the playback manager 202 may grant audio resources to the audio stream (Block 416). The playback manager 202 may direct a transport control notification to the audio application 204 with the audio stream having media focus (Block 418). The playback manager 202 may send a media focus status notification to the audio application 204 (Block 420).
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms for implementing the claims.
Embodiments within the scope of the present invention may also include non-transitory computer-readable storage media for carrying or having computer-executable instructions or data structures stored thereon. Such non-transitory computer-readable storage media may be any available media that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, such non-transitory computer-readable storage media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store desired program code means in the form of computer-executable instructions or data structures. Combinations of the above should also be included within the scope of the non-transitory computer-readable storage media.
Embodiments may also be practiced in distributed computing environments where tasks are performed by local and remote processing devices that are linked (either by hardwired links, wireless links, or by a combination thereof) through a communications network.
Computer-executable instructions include, for example, instructions and data which cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions. Computer-executable instructions also include program modules that are executed by computers in stand-alone or network environments. Generally, program modules include routines, programs, objects, components, and data structures, etc. that perform particular tasks or implement particular abstract data types. Computer-executable instructions, associated data structures, and program modules represent examples of the program code means for executing steps of the methods disclosed herein. The particular sequence of such executable instructions or associated data structures represents examples of corresponding acts for implementing the functions described in such steps.
Although the above description may contain specific details, they should not be construed as limiting the claims in any way. Other configurations of the described embodiments are part of the scope of the disclosure. For example, the principles of the disclosure may be applied to each individual user where each user may individually deploy such a system. This enables each user to utilize the benefits of the disclosure even if any one of a large number of possible applications do not use the functionality described herein. Multiple instances of electronic devices each may process the content in various possible ways. Implementations are not necessarily in one system used by all end users. Accordingly, the appended claims and their legal equivalents should only define the invention, rather than any specific examples given.