Embodiments described herein generally relate to computers. More particularly, embodiments relate to a mechanism for facilitating watermarking-based management of echoes for content transmission at communication devices.
Echoes can be very disturbing and are often regarded as the worst type of impairment during conversations. Although various conventional echo cancellation techniques are employed at today's communication devices, these conventional techniques are inefficient as they are not known for complete elimination of echoes.
Embodiments are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings in which like reference numerals refer to similar elements.
In the following description, numerous specific details are set forth. However, embodiments, as described herein, may be practiced without these specific details. In other instances, well-known circuits, structures and techniques have not been shown in details in order not to obscure the understanding of this description.
Embodiments provide for extraction and/or suppression of communication signals (e.g., audio signal) that are classified as echoes (also referred to as “echo signals”) from a mixture of signals based on watermarking of the audio signals, where the mixture of signals being communicated between computing/communication devices (e.g., smartphones, table computers, etc.) over a network. In one embodiment, an audio signal, regarded as an echo, is watermarked prior to exiting the communication device so it may be recognized as the echo and sufficiently suppressed upon re-entering the communication device. For example, watermarking may be assigned according to a binary representation by using two different echo kernels (e.g., “one” kernel and “zero” kernel) to convolve the carrier audio signal. The two kernels may differ in the delay of the inserted echo and accordingly, at decoding, the bit value of each timeframe is recovered by comparing the presence of echo at the two expected delay values in the watermarked signal. Having taken into various account human ear capabilities and features, this novel and innovative technique of watermarking by echo-hiding may remain transparent to the human ear.
Computing device 100 may include any number and type of communication devices, such as large computing systems, such as server computers, desktop computers, etc., and may further include set-top boxes (e.g., Internet-based cable television set-top boxes, etc.), global positioning system (GPS)-based devices, etc. Computing device 100 may include mobile computing devices serving as communication devices, such as cellular phones including smartphones (e.g., iPhone® by Apple®, BlackBerry® by Research in Motion®, etc.), personal digital assistants (PDAs), tablet computers (e.g., iPad® by Apple®, Galaxy 3® by Samsung®, etc.), laptop computers (e.g., notebook, netbook, an Ultrabook™ system, etc.), e-readers (e.g., Kindle® by Amazon®, Nook® by Barnes and Nobles®, etc.), smart televisions, wearable devices (e.g., watch, bracelet, smartcard, etc.), media players, etc.
Computing device 100 may include an operating system (OS) 106 serving as an interface between hardware and/or physical resources of the computer device 100 and a user. Computing device 100 further includes one or more processors 102, memory devices 104, network devices, drivers, or the like, as well as input/output (I/O) sources 108, such as touchscreens, touch panels, touch pads, virtual or regular keyboards, virtual or regular mice, etc. It is to be noted that terms like “node”, “computing node”, “server”, “server device”, “cloud computer”, “cloud server”, “cloud server computer”, “machine”, “host machine”, “device”, “computing device”, “computer”, “computing system”, and the like, may be used interchangeably throughout this document. It is to be further noted that terms like “application”, “software application”, “program”, “software program”, “package”, and “software package” may be used interchangeably throughout this document. Similarly, terms like “job”, “input”, “request” and “message” may be used interchangeably throughout this document.
In some embodiments, computing device 100 may contain any number and type of other components to working with echo mechanism 110 to perform various conventional and non-conventional tasks. Many of such components are not discussed here and may include (but not limited to) equalizer dynamic control (EDC), speech intelligibility enhancement (SIE), signal and noise estimation (SNE), acoustic echo cancellation (AEC), gain loss control (GLC), noise reduction component including residual echo suppression component, or the like.
A communication signal (such as an audio signal (e.g., telephone voice signal, etc.), an audio/video signal (e.g., FaceTime® communication signal, Tango® communication signal, etc.), or the like) may be communicated between computing device 240 within far-end acoustic environment 220 and computing device 100 within near-end acoustic environment 250 over one or more communication networks, such as network 230 (e.g., telecommunication network, Internet, cloud network, etc.). It is contemplated that the communication between computing devices 100, 240 may be facilitated via one or more communication software applications, such as software application 241, provided by one or more telecommunication companies (e.g., Skype®, FaceTime® by Apple®, Tango®, Viber®, AT&T®, Verizon®, etc.). It is contemplated that one or more user interfaces, such as user interfaces 217, 243, provided by software applications, such as software application 242, may be used at computing devices 100, 240, to facilitate communication of signals (such as regular telephone calls, FaceTime® calls, etc.).
It is contemplated that although the illustrated embodiment provides for having echo mechanism 110 employed at computing device 100 serving as a near device where a communication signal is received from computing device 240, serving as a far device, for echo processing and filtering purposes, embodiments are not limited to this particular arrangement and that the tasks may be reversed between computing devices 110 and 240 and that any number and type of other computing devices (without or without echo mechanism 110) may be included in the fold along with any number and type of networks.
Once the communication signal (or simply “signal”) is received at computing device 100, it is to be passed and sounded through a listening device, such as listening device 213 (e.g., loudspeaker, etc.), at computing device 100 and which may then expected to create an echo once it has left listening device 213 and received at or fed back into a speaking device, such as speaking device 215 (e.g., microphone). In one embodiment, upon receiving the communication signal at computing device 100, it may then be detected and evaluated by signal detection and evaluation logic 201 to be considered as a potential echo. For example, as the communication signal is received at computing device 100 and going through typical communication components towards listening device 213, it may be detected by signal detection and evaluation logic 201 prior to reaching listening device 213 so that it may then be evaluated for possible watermarking prior to being mixed with other exterior signals, such as the voice of the user of computing device 100 at the receiving end and any other noise (e.g., traffic, crowd, television, etc.) that may be part of near-end acoustic environment 220.
In one embodiment, upon detection of the signal and the subsequent evaluation that it is regarded to be an echo, watermark assignment logic 203 assigns the signal a watermark for future recognition when it is returned to computing device 100 as an echo via speaking device 215. In one embodiment, echo monitoring and reception logic 205 continuously monitors the watermarked echo as it leaves listening device 213 and travels through the air and reaches speaking device 215 where it is received by echo monitoring and reception logic 205. It is contemplated the watermarked echo may not be the only sound received at speaking device 215 and that any number and types of other sounds may also be received and converged into becoming a mixture signal, including (but not limited to) human voice of a first user of computing device 100 and other noises and sounds, such as other human voices, traffic, etc., that may fall within near-end acoustic environment 220 and the reach of speaking device 215.
Upon receiving the watermarked echo at speaking device 215, the watermarked echo is detected by watermark detection logic 207 to be the echo as opposed to other noises and sounds that are entered through speaking device 215. In one embodiment, the detected watermarked echo is then processed to be dynamically filtered by filter and processing logic 209. For example, in some embodiments, the watermarked echo may be completely suppressed (also referred to as “cancelled”, “eliminated”, “removed”, or “hidden”); while, in some embodiments, the watermarked echo may be partially suppressed from reaching a second user of computing device 240, such as certain portions (e.g., certain words, frequency segments, etc.) may be eliminated or not eliminated and allowed to pass. For example, certain frequency segments may not be audible to the human ear and thus, there may not be a need to watermarked or eliminate them. In yet other embodiments, the watermarked echo may not be suppressed at all and allowed to pass on to computing device 240 over network 230, while, in yet other embodiments, only the watermarked echo may be kept and allowed to pass, while all other noises and sounds may be suppressed, such as when the watermarked echo is being used for detective purposes or in security situations, such as in police detective work, military work, etc.
In one embodiment, the signal may be broken down into segments and the segments may be selectively watermarked by watermark assignment logic 203, where each segment may represent or include a frequency band. For example, in some embodiments, the watermark may not be applied to the entire spectrum of the signal and that it may be applied selectively to any number and type of segments depending on the frequencies they represent. Accordingly, when the watermarked echo is detected by watermark detection logic 207, this allows for subsequent echo estimation at bands or sub-bands rather than on the entire signal or the mixture of sounds, which allows for filtering and processing logic 209 to perform frequency responses varying in time.
In one embodiment, the communication signal includes a loudspeaker signal that is obtained and decoded form network 230 which is to be sent to listening device 213. As aforementioned, the mixture signal, entering through speaking device 215, may include a sum of (but not limited to): (i) the echo, such as the loudspeaker signal after playback (ii) the environmental noise of near-end acoustic environment 220, and (iii) the useful speech from near-end speaker, such as first user. As will be further described with reference to
Echo Kernel
As aforementioned, watermark assignment logic 203 may be used to track segments (e.g., frequency brands) of the communication signal may be watermarked after being tracked and detected by signal detection and evaluation logic 201. For example, an “echo kernel” may refer to an expression of a delay line as a linear filter, while a “sub-band echo kernel” (“sub-band kernel” or simply “sub-kernel”) may refer to a subset of contiguous frequency bins of a band echo kernel, and a “full-band echo kernel” (“full-band kernel” or simply “full kernel”) may be an echo kernel. For example, a sub-kernel may be derived from an echo kernel which may have been shifted, scaled, and enforced to have a real valued impulse response.
In one embodiment, sub-kernels equivalent to a full kernel may be derived, where the targeted echo kernel includes a set of independent sub-kernels. For example, a different kernel may be used in each sub-kernel, while choosing and using a single type of kernel for all sub-kernels ensures that the resulting full kernel is equivalent to the echo kernel.
In one embodiment, let us suppose an echo refers to a feed-forward comb filter, of which unit sample response is as follows:
where α is the scaling factor (e.g., the amplitude of the echo), and D is the echo delay in samples. In one embodiment, let us suppose that α<1 and D>0. For instance, the coefficients of a 50% echo of 4 samples (e.g., α=0.5 and D=4) are:
h=[1 0 0 0.5].
As mentioned, for example, the set of sub-kernels be equivalent to a full kernel to ensure acceptable distortions in the watermarked signal. Further, essentials for latter detection of the watermark are to constrain the sub-kernels to have an echo kernel form as well. An echo kernel may have the following frequency response:
H(ω)=1+αe−jωD=1+α cos(ωD)−jα sin(ωD).
Within each interval [kπ; (k+1)π], kε, H is periodic of period
Let us consider a frequency-shifted version of H to be as follows:
where K is the desired number of bands, and k−0 . . . K−1. If one chooses K=D/2q, where integer q is the number of periods per band:
because H is
periodic within [0; π]
The filter is then frequency scaled by factor
This frequency response is truncated over [0; π]:
From this truncated frequency response, a sub-band filter H′ may be defined by assuming that its time-domain coefficients are real, which may be imposed by wanting of the sub-band filter to be of the echo kernel form. We chose K=D/2q:
which is periodic of period
Accordingly, interval [0; π] spans a whole number q of periods of H
Consequently, the frequency response of sub-band filter H′ is periodic and equal to that of H
From this point of view, H′ is a sub-kernel with delay
derived from the full-kernel H.
Communication Signal Watermarking
As will be further described with respect to
Detecting Presence of Watermarked Echo Upon Passing Speaking Device 215
The detection of the watermark in the microphone signal may be based on the cepstral analysis, [See Gruhl et al., Echo Hiding, 1996], except that, in one embodiment, it may be performed in sub-kernels (as opposed to on an entire broadband communication signal) and further, the watermarked signal may be detected from a mixture of signals, such as noise and sounds of near-end acoustic environment 220, containing the watermarked signal.
Echo Detection
Computation of the cepstrum of the watermarked signal W(n) may allow for the separation of the echo kernel H from the original signal X(n) it has been convolved with as follows:
where {tilde over (w)}, {tilde over (x)} and {tilde over (h)} refer to the complex cepstra of w, x and h, respectively. For example, cepstral analysis converts the convolution operation to an addition operation. Regarding the cepstrum of h:
The two terms in the inverse Fourier transforms:
are both periodic of period
Therefore, according to the Fourier analysis, their inverse Fourier transforms show in {tilde over (h)}(n) one strong component at its fundamental frequency n=D. A first option to detect if an echo of delay D is present in {tilde over (w)} is may be to look at value {tilde over (w)}(D).
However, because of the presence of the log function in both terms, additional components may show as well in {tilde over (h)}(n) at its harmonic frequencies n=2D, 3D, etc. Accordingly, to further improve the detection of an echo of delay D in {tilde over (w)}, the autocorrelation R{tilde over (w)}{tilde over (w)}(n) of {tilde over (w)} is usually computed to obtain the power of the signal found at each delay n. For example, one can decide on the presence of an echo of delay D by looking at whether a power spike is present at value R{tilde over (w)}{tilde over (w)}(D).
One Embodiment of an Implementation of Echo Detector in Sub-Band Echoes
In one embodiment, the frequency analysis of the microphone signal y(n) may be performed based on a Short-Term Fourier Transform (STFT), such as:
Y(l,m)=w(l,m)+S(l,m)+Z(l,m),
where W is the watermarked loudspeaker signal, S is the near-end speech signal (useful speech, such as by the first user of computing device 100), Z is the environmental noise signal, such as from near-end acoustic environment 220, l is the frequency bin, and m is the temporal frame index.
Following the aforementioned sub-band kernel watermarking approach, for each temporal frame m, Y is decomposed in
sub-band signals
Y
k
=Y[k2qDπ . . . (k|1)2qDπ], k=0 . . . K−1 (index m is omitted for clarity):
A frequency shifting by
followed by a frequency scaling of
is applied to each Y_k, to obtain Yk′. Therefore, in this particular case for which Y−W, signals Yk′ are equal to the product of Xk′ with H′, such as xk′ convolved with the sub-kernel h′ of delay D′=2q.
For each temporal frame, for example, a Discrete Fourier Transform (DFT) may be performed on N points. Based on the previous remarks, the N/2+1 useful frequency bins may be grouped in K bands of an equal number of frequency bins
Filter Based on Watermarking Presence
For one temporal frame (omitting index m):
Y
k(l)=Wk(l)+Sk(l)+Zk(l), with Wk′(l)=Xk′(l)Hk′(l).
Cepstrum of yk′ then equals:
When Wk′(l)>>Sk′(l)+Zk′(l), then:
Due to the presence of the echo kernel {tilde over (h)}k′ of delay D′, R{tilde over (y)}
On the opposite, when Wk′(l)<<Sk′(l)+Zk′(l), then:
{tilde over (y)}
k′(n)≈τ−1 log [Sk′(l)+Zk′(l)]
and assuming that Sk′+Zk′ does not result from the convolution with an echo kernel of delay D′, R{tilde over (y)}
In the following, let us suppose that the desired behavior is to remove signal Wk′ from Yk′, but a similar reasoning can be used to keep Wk′ only. A simple, binary gain rule consists in setting a threshold τ above which Yk′ is considered as being mainly composed of Wk′.
For instance, setting gmin=0 and gmax=1 results in a filter which removes bands mainly composed of watermarked signal while keeping other bands.
By extension, one can define a smoother gain rule based on two thresholds τmin and τmax:
This gain rule verifies that for Rý
Filtering of Watermarked Echo Received Via Speaking Device 215
With regard to filtering of watermarked echo (e.g., “microphone signal”) that is received via speaking device 215, any filtering method (e.g., Inverse Discrete Fourier Transform (IDFT), Overlap-Add (OLA), Analysis-Synthesis Filter-Bank (ASFB), Filter-Bank Equalizer (FBE), Low Delay Filter (LDF), etc.) may be used to apply the gain rule defined in the analysis stage as described above. For example, the hop size used for the STFT at analysis may be chosen to match that of the filtering method. Also, because analysis may need rather long frames to be efficient, the frames used at the filtering stage may be centered on that used for analysis.
Computing devices 100, 240 may further include any number and type of touch/image components, where these touch/image components may include (but not limited to) image capturing devices (e.g., one or more cameras, etc.) and image sensing devices, such as (but not limited to) context-aware sensors (e.g., temperature sensors, facial expression and feature measurement sensors, etc.) working with one or more cameras, environment sensors (such as to sense background colors, lights, etc.), biometric sensors (such as to detect fingerprints, facial points or features, etc.), and the like. Computing devices 100, 240 may also include one or more software applications, such as business applications, social network websites (e.g., Facebook®, Google+®, Twitter®, etc.), business networking websites (e.g., LinkedIn®, etc.), communication applications (e.g., Skype®, Tango®, Viber®, etc.), games and other entertainment applications, etc., offering one or more user interfaces (e.g., web user interface (WUI), graphical user interface (GUI), touchscreen, etc.), while ensuring compatibility with changing technologies, parameters, protocols, standards, etc.
Communication/compatibility logic 211 may be used to facilitate dynamic communication and compatibility between computing device 100 and any number and type of other computing devices (such as a mobile computing device, a desktop computer, a server computing device, etc.), storage devices, databases and/or data sources (such as data storage devices, hard drives, solid-state drives, hard disks, memory cards or devices, memory circuits, etc.), networks (e.g., cloud network, the Internet, intranet, cellular network, proximity networks, such as Bluetooth, Bluetooth low energy (BLE), Bluetooth Smart, Wi-Fi proximity, Radio Frequency Identification (RFID), Near Field Communication (NFC), Body Area Network (BAN), etc.), wireless or wired communications and relevant protocols (e.g., Wi-Fi®, WiMAX, Ethernet, etc.), connectivity and location management techniques, software applications/websites, (e.g., social and/or business networking websites, such as Facebook®, LinkedIn®, Google+®, Twitter®, etc., business applications, games and other entertainment applications, etc.), programming languages, etc., while ensuring compatibility with changing technologies, parameters, protocols, standards, etc.
Although one or more terms or examples (e.g., communication signals, loudspeaker signals, microphone signals, watermarked signals, echoes, echo kernels, sub-kernels, full-kernels, segments including frequency bands, telephones, smartphones, table computers, etc.) may be discussed throughout this document for brevity, clarity, and ease of understanding, it is contemplated that embodiments are not limited to any particular number and type of gestures, display panels, computing devices, users, network or authentication protocols or processes, or the like. For example, embodiments are not limited to any particular network security infrastructures or protocols (e.g., single-sign-on (SSO) infrastructures and protocols) and may be compatible with any number and type of network security infrastructures and protocols, such as security assertion markup language (SAML), OAuth, Kerberos, etc.
Throughout this document, terms like “logic”, “component”, “module”, “framework”, “engine”, “point”, and the like, may be referenced interchangeably and include, by way of example, software, hardware, and/or any combination of software and hardware, such as firmware. Further, any use of a particular brand, word, term, phrase, name, and/or acronym, such as “echo cancellation” or “EC”, “watermark echo cancellation” or “WEC”, “gain watermark echo cancellation” or “GWEC”, “watermark echo filtering” or “WEF”, “communication signal”, “loudspeaker signal”, “microphone signal”, “watermark” or “watermarking”, “watermarked signal”, “echo” or “watermarked echo”, “echo kernel”, “sub-band echo kernel” or “sub-kernel”, “full-band echo kernel” or “full kernel”, “segment” or “frequency band”, “telephone”, “smartphone”, “tablet computer”, etc., should not be read to limit embodiments to software or devices that carry that label in products or in literature external to this document.
It is contemplated that any number and type of components may be added to and/or removed from echo watermarking and filtering mechanism 110 to facilitate various embodiments including adding, removing, and/or enhancing certain features. For brevity, clarity, and ease of understanding of echo watermarking and filtering mechanism 110 and flexible wraparound display 120, many of the standard and/or known components, such as those of a computing device, are not shown or discussed here. It is contemplated that embodiments, as described herein, are not limited to any particular technology, topology, system, architecture, and/or standard and are dynamic enough to adopt and adapt to any future changes.
Upon entering the air, the watermarked signal turns into a watermarked echo (e.g., watermarked segments or bands, such as full band echoes, sub-band echoes, etc.) which may then be returned and fed back into computing device 100 via speaking device 215 (e.g., microphone, etc.) as part of a mixture of signals including (but not limited to) useful sound (e.g., user's voice), other noises/sounds (e.g., kids, market noises, traffic sounds, office chatter, background television sound, etc.) within the acoustic environment of computing device 100. The watermarked echo is monitored and then received at speaking device 215 as a mixture of voice, noise, and watermarked echo. The monitoring and receiving is performed by echo monitoring and reception logic 205 of gain watermark echo cancellation (GWEC) engine 323.
In one embodiment, additional components, such as equalizer dynamic control 303B, signal and noise estimation 305, acoustic echo cancellation 307, noise reduction 309, residual echo suppression 311, and gain loss control 313 may also be employed to perform their respective tasks. In another embodiment, components 301, 303A-B, 305, 307, 309, 311, 313 may not be needed and instead, they be replaced by other components or simply WEC 312 and GWEC 323 of echo mechanism 110. It is contemplated that components 301, 303A-B, 305, 307, 309, 311, 313 and their corresponding connections, paths, and tasks are merely shown and/or discussed as examples for brevity, clarity, and ease of understanding and that embodiments are not limited to any of these or other such components. For example, GWEC 323 may be placed or allowed to function before or after noise reduction 309 and similarly, before or after acoustic echo cancellation 307, etc.
In one embodiment, GWEC having echo monitoring and reception logic 205, watermark detection logic 207, filtering and processing logic 209, and communication/compatibility logic 211 perform any number of tasks as described with reference to
For example, as user second user 351 speaks into speaking device 353 (e.g., microphone) at computing device 240, it generates communication signal 331 that is communicated over network 230 and received at computing device 100. In one embodiment, communication signal 331 is detected by WEC engine 321 where it is assigned a watermark as it leaves through listening device (e.g., loudspeaker) 213. Watermarked signal 333, upon departing computing device 100 via listening device 213, turns into watermarked echo 335 and enters back into computing device 100 via speaking device 215 (e.g., microphone). As illustrated, watermarked echo 335 may not be the only sound that may enter through speaking device 215 as it may be joined by other sounds, such as voice 337 of first user 331 speaking into speaking device 215, and other noise/sounds (e.g., traffic noise, chatter, background music, dog barking, etc.) within near-end acoustic environment 220.
These sounds 335, 337, 339 may enter computing device 100 as mixed signals 341 where, as aforementioned, watermarked echo is identified or detected by GWEC engine 323 and separated from mixed signals 341 for further processing. In one embodiment, the watermarked echo may be processed and filtered, at GWEC engine 323, to be completely or partially cancelled or, in another embodiment, it may not be filtered and allowed to proceed. In one embodiment, filtered or final signal 343 is then facilitated to be transmitted on to computing device 240 over network 230. At computing device 240, filtered signal 343 is broadcast to second user 351 through listening device (e.g., loudspeaker) 355.
Referring to
Method 500 begins at block 505 with receiving of a communication signal at a first computing device (e.g., smartphone, tablet computer, etc.) from a second computing device (e.g., smartphone, tablet computer, etc.). At block 510, the communication signal's presence is detected within the first computing device. At block 515, in one embodiment, a watermark is assigned to the detected communication signal before it leaves the first computing device via loudspeaker (other any other listening device), wherein the watermarked signal is regarded or referred to as a watermarked echo once it departs the first computing device through its loudspeaker and gets into the air. In one embodiment, the signal may be sorted by or divided into any number of segments, where each segment refers to a frequency band. Accordingly, in one embodiment, any number of such segments (e.g., minority of segments, majority of segments, etc.) may be watermarked as opposed to watermarking the entire signal. In another embodiment, the entire signal may be watermarked or the entire signal may not be watermarked. For example, certain frequency bands may not be of concern if they are not audible to the human ear and thus they may not be watermarked for being not likely to translate into or act as an echo. At block 520, the watermarked echo is continuously monitored and subsequently, at block 525, it is received back at first computing device via its microphone (or any other speaking device).
It is contemplated that the watermarked echo may not be the only signal or sound entering the first computing device and that it may be mixed with other sounds, such as a first user's voice as s/he speaks into the microphone and other environmental sounds, such as traffic noise, background chatter, etc., that are found to be within a proximity of the first computing device. At block 530, in one embodiment, the watermarked echo is identified or detected out of the mix of sounds and signals. At block 535, the detected watermarked echo is separated from the mix to be further processed for filtering purposes.
At block 540, in one embodiment, a determination is made as to whether the watermarked echo is to be filtered. If the watermarked echo is not to be filtered, at block 545, the watermarked echo is allowed to pass as a final signal to the second computing device. For example, in some embodiments, the watermarked echo may not be filtered for any number of reasons, such as when preferred or desired by the user or when the watermarked echo may be used for specific purposes, such as security measures, police/detective or military purposes, science research, research and development or experimentation, etc. At block 550, the final signal (having the watermarked echo) is allowed to be transmitted to the second computing device.
Referring back to block 540, if the watermarked echo is to be filtered, the process continues with block 555 where another determination is made as to whether the watermarked echo is to be filtered completely or partially. If the entire watermarked echo is to be filtered, at block 560, the watermarked echo is completely filtered and cancelled/suppressed and subsequently, at block 550, the final signal (without having any of the watermarked echo) is transmitted on to the second computing device. Referring back to block 555, if the watermarked echo is to be partially filtered (e.g., certain segments or frequency bands are to be filtered out or cancelled/suppressed, while other segments are allowed to remain and pass), a final signal having partially filtered watermarked echo is facilitated to be transmitted on to the second computing device at block 550.
Now referring to
Computing system 400 includes bus 405 (or, for example, a link, an interconnect, or another type of communication device or interface to communicate information) and processor 410 coupled to bus 405 that may process information. While computing system 400 is illustrated with a single processor, electronic system 400 and may include multiple processors and/or co-processors, such as one or more of central processors, graphics processors, and physics processors, etc. Computing system 400 may further include random access memory (RAM) or other dynamic storage device 420 (referred to as main memory), coupled to bus 405 and may store information and instructions that may be executed by processor 410. Main memory 420 may also be used to store temporary variables or other intermediate information during execution of instructions by processor 410.
Computing system 400 may also include read only memory (ROM) and/or other storage device 430 coupled to bus 405 that may store static information and instructions for processor 410. Data storage device 440 may be coupled to bus 405 to store information and instructions. Data storage device 440, such as magnetic disk or optical disc and corresponding drive may be coupled to computing system 400.
Computing system 400 may also be coupled via bus 405 to display device 450, such as a cathode ray tube (CRT), liquid crystal display (LCD) or Organic Light Emitting Diode (OLED) array, to display information to a user. User input device 460, including alphanumeric and other keys, may be coupled to bus 405 to communicate information and command selections to processor 410. Another type of user input device 460 is cursor control 470, such as a mouse, a trackball, a touchscreen, a touchpad, or cursor direction keys to communicate direction information and command selections to processor 410 and to control cursor movement on display 450. Camera and microphone arrays 490 of computer system 400 may be coupled to bus 405 to observe gestures, record audio and video and to receive and transmit visual and audio commands.
Computing system 400 may further include network interface(s) 480 to provide access to a network, such as a local area network (LAN), a wide area network (WAN), a metropolitan area network (MAN), a personal area network (PAN), Bluetooth, a cloud network, a mobile network (e.g., 3rd Generation (3G), etc.), an intranet, the Internet, etc. Network interface(s) 480 may include, for example, a wireless network interface having antenna 485, which may represent one or more antenna(e). Network interface(s) 480 may also include, for example, a wired network interface to communicate with remote devices via network cable 487, which may be, for example, an Ethernet cable, a coaxial cable, a fiber optic cable, a serial cable, or a parallel cable.
Network interface(s) 480 may provide access to a LAN, for example, by conforming to IEEE 802.11b and/or IEEE 802.11g standards, and/or the wireless network interface may provide access to a personal area network, for example, by conforming to Bluetooth standards. Other wireless network interfaces and/or protocols, including previous and subsequent versions of the standards, may also be supported.
In addition to, or instead of, communication via the wireless LAN standards, network interface(s) 480 may provide wireless communication using, for example, Time Division, Multiple Access (TDMA) protocols, Global Systems for Mobile Communications (GSM) protocols, Code Division, Multiple Access (CDMA) protocols, and/or any other type of wireless communications protocols.
Network interface(s) 480 may include one or more communication interfaces, such as a modem, a network interface card, or other well-known interface devices, such as those used for coupling to the Ethernet, token ring, or other types of physical wired or wireless attachments for purposes of providing a communication link to support a LAN or a WAN, for example. In this manner, the computer system may also be coupled to a number of peripheral devices, clients, control surfaces, consoles, or servers via a conventional network infrastructure, including an Intranet or the Internet, for example.
It is to be appreciated that a lesser or more equipped system than the example described above may be preferred for certain implementations. Therefore, the configuration of computing system 400 may vary from implementation to implementation depending upon numerous factors, such as price constraints, performance requirements, technological improvements, or other circumstances. Examples of the electronic device or computer system 400 may include without limitation a mobile device, a personal digital assistant, a mobile computing device, a smartphone, a cellular telephone, a handset, a one-way pager, a two-way pager, a messaging device, a computer, a personal computer (PC), a desktop computer, a laptop computer, a notebook computer, a handheld computer, a tablet computer, a server, a server array or server farm, a web server, a network server, an Internet server, a work station, a mini-computer, a main frame computer, a supercomputer, a network appliance, a web appliance, a distributed computing system, multiprocessor systems, processor-based systems, consumer electronics, programmable consumer electronics, television, digital television, set top box, wireless access point, base station, subscriber station, mobile subscriber center, radio network controller, router, hub, gateway, bridge, switch, machine, or combinations thereof.
Embodiments may be implemented as any or a combination of: one or more microchips or integrated circuits interconnected using a parentboard, hardwired logic, software stored by a memory device and executed by a microprocessor, firmware, an application specific integrated circuit (ASIC), and/or a field programmable gate array (FPGA). The term “logic” may include, by way of example, software or hardware and/or combinations of software and hardware.
Embodiments may be provided, for example, as a computer program product which may include one or more machine-readable media having stored thereon machine-executable instructions that, when executed by one or more machines such as a computer, network of computers, or other electronic devices, may result in the one or more machines carrying out operations in accordance with embodiments described herein. A machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs (Compact Disc-Read Only Memories), and magneto-optical disks, ROMs, RAMs, EPROMs (Erasable Programmable Read Only Memories), EEPROMs (Electrically Erasable Programmable Read Only Memories), magnetic or optical cards, flash memory, or other type of media/machine-readable medium suitable for storing machine-executable instructions.
Moreover, embodiments may be downloaded as a computer program product, wherein the program may be transferred from a remote computer (e.g., a server) to a requesting computer (e.g., a client) by way of one or more data signals embodied in and/or modulated by a carrier wave or other propagation medium via a communication link (e.g., a modem and/or network connection).
References to “one embodiment”, “an embodiment”, “example embodiment”, “various embodiments”, etc., indicate that the embodiment(s) so described may include particular features, structures, or characteristics, but not every embodiment necessarily includes the particular features, structures, or characteristics. Further, some embodiments may have some, all, or none of the features described for other embodiments.
In the following description and claims, the term “coupled” along with its derivatives, may be used. “Coupled” is used to indicate that two or more elements co-operate or interact with each other, but they may or may not have intervening physical or electrical components between them.
As used in the claims, unless otherwise specified the use of the ordinal adjectives “first”, “second”, “third”, etc., to describe a common element, merely indicate that different instances of like elements are being referred to, and are not intended to imply that the elements so described must be in a given sequence, either temporally, spatially, in ranking, or in any other manner.
The following clauses and/or examples pertain to further embodiments or examples. Specifics in the examples may be used anywhere in one or more embodiments. The various features of the different embodiments or examples may be variously combined with some features included and others excluded to suit a variety of different applications. Examples may include subject matter such as a method, means for performing acts of the method, at least one machine-readable medium including instructions that, when performed by a machine cause the machine to performs acts of the method, or of an apparatus or system for facilitating hybrid communication according to embodiments and examples described herein.
Some embodiments pertain to Example 1 that includes an apparatus to facilitate echo watermarking and filtering, comprising: watermark assignment logic to assign a watermark to a communication signal, wherein the watermarked communication signal transforms into a watermarked echo upon exiting the apparatus; echo monitoring and reception logic to receive the watermarked echo; filtering and processing logic to filter the watermarked echo such that the watermarked echo is cancelled out of a final signal; and communication/compatibility logic to transmit the final signal that is free of the watermarked echo.
Example 2 includes the subject matter of Example 1, further comprising signal detection and evaluation logic to detect the communication signal, wherein the signal detection and evaluation logic is further to evaluate the detected communication signal as having a capacity to be transformed into the watermarked echo upon exiting the apparatus into the air, wherein the watermarked communication signal exits through a listening device including a loudspeaker.
Example 3 includes the subject matter of Example 1, wherein the echo monitoring and reception logic is further to continuously monitor the watermarked echo while the watermarked echo is in the air prior to its reception at the apparatus via a speaking device including a microphone.
Example 4 includes the subject matter of Example 1 or 3, further comprising watermark detection logic to detect the watermarked echo upon its reception via the speaking device, wherein the watermark detection logic is further to separate the detected watermarked echo from one or more sounds received via the speaking device.
Example 5 includes the subject matter of Example 4, wherein the one or more sounds comprises one or more of a first sound including a voice spoken into the speaking device by a user, and a second sound including noise being generated within a proximity of the speaking device, wherein the noise includes one or more of traffic noise, human chatter, music, and street noise.
Example 6 includes the subject matter of Example 1, wherein the watermark assignment logic is further to detection a plurality of segments relating to the communication signal, wherein each segment of the plurality of segments refers to a frequency brand, wherein the watermark assignment logic is further to assign the watermark to one or more of the plurality of segments.
Example 7 includes the subject matter of Example 6, wherein the communication signal is completely watermarked if each segment of the plurality of segments is assigned the watermark, wherein the communication signal is partially watermarked if one or more of the plurality of segments are assigned the watermarked, and wherein the communication signal is not watermarked if the plurality of segments is not assigned the watermark.
Example 8 includes the subject matter of Example 1 or 6, wherein filtering further comprises filtering out the plurality of segments to cancel out the watermarked echo from the final signal, wherein each of the plurality of segments is assigned the watermark.
Example 9 includes the subject matter of Example 1 or 6, wherein filtering further comprises filtering out one or more of the plurality of segments to partially cancel out the watermarked echo from the final signal, wherein the one or more of the plurality of segments include the watermarked one or more of the plurality of segments.
Example 10 includes the subject matter of Example 1 or 6, wherein filtering further comprises allowing the watermarked echo to remain within the final signal.
Some embodiments pertain to Example 11 that includes a method for facilitating echo watermarking and filtering, comprising: assigning a watermark to a communication signal, wherein the watermarked communication signal transforms into a watermarked echo upon exiting a computing device; receiving the watermarked echo; filtering the watermarked echo such that the watermarked echo is cancelled out of a final signal; and transmitting the final signal that is free of the watermarked echo.
Example 12 includes the subject matter of Example 11, further comprising: detecting the communication signal; and evaluating the detected communication signal as having a capacity to be transformed into the watermarked echo upon exiting the computing device into the air, wherein the watermarked communication signal exits through a listening device including a loudspeaker.
Example 13 includes the subject matter of Example 11, further comprising continuously monitoring the watermarked echo while the watermarked echo is in the air prior to its reception at the computing device via a speaking device including a microphone.
Example 14 includes the subject matter of Example 13, further comprising: detecting the watermarked echo upon its reception via the speaking device; and separating the detected watermarked echo from one or more sounds received via the speaking device.
Example 15 includes the subject matter of Example 14, wherein the one or more sounds comprises one or more of a first sound including a voice spoken into the speaking device by a user, and a second sound including noise being generated within a proximity of the speaking device, wherein the noise includes one or more of traffic noise, human chatter, music, and street noise.
Example 16 includes the subject matter of Example 11, further comprising detecting a plurality of segments relating to the communication signal, wherein each segment of the plurality of segments refers to a frequency brand, wherein watermark is assigned to one or more of the plurality of segments.
Example 17 includes the subject matter of Example 16, wherein the communication signal is completely watermarked if each segment of the plurality of segments is assigned the watermark, wherein the communication signal is partially watermarked if one or more of the plurality of segments are assigned the watermarked, and wherein the communication signal is not watermarked if the plurality of segments is not assigned the watermark.
Example 18 includes the subject matter of Example 11, wherein filtering further comprises filtering out the plurality of segments to cancel out the watermarked echo from the final signal, wherein each of the plurality of segments is assigned the watermark.
Example 19 includes the subject matter of Example 11, wherein filtering further comprises filtering out one or more of the plurality of segments to partially cancel out the watermarked echo from the final signal, wherein the one or more of the plurality of segments include the watermarked one or more of the plurality of segments.
Example 20 includes the subject matter of Example 11, wherein filtering further comprises allowing the watermarked echo to remain within the final signal.
Example 21 includes at least one machine-readable medium comprising a plurality of instructions that in response to being executed on a computing device, causes the computing device to carry out operations according to any one of the aforementioned examples 11 to 20.
Example 22 includes at least one non-transitory or tangible machine-readable medium comprising a plurality of instructions that in response to being executed on a computing device, causes the computing device to carry out operations according to any one of the aforementioned examples 11 to 20.
Example 23 includes a system comprising a mechanism to carry out operations according to any one of the aforementioned examples 11 to 20.
Example 24 includes an apparatus comprising means to carry out operations according to any one of the aforementioned examples 11 to 20.
Example 25 includes a computing device arranged to carry out operations according to any one of the aforementioned examples 11 to 20.
Example 26 includes a communications device arranged to carry out operations according to any one of the aforementioned examples 11 to 20.
Some embodiments pertain to Example 27 includes a system comprising a storage device having instructions, and a processor to execute the instructions to facilitate a mechanism to perform one or more operations comprising: assigning a watermark to a communication signal, wherein the watermarked communication signal transforms into a watermarked echo upon exiting a computing device; receiving the watermarked echo; filtering the watermarked echo such that the watermarked echo is cancelled out of a final signal; and transmitting the final signal that is free of the watermarked echo.
Example 28 includes the subject matter of Example 27, wherein the one or more operations comprise detecting the communication signal; and evaluating the detected communication signal as having a capacity to be transformed into the watermarked echo upon exiting the computing device into the air, wherein the watermarked communication signal exits through a listening device including a loudspeaker.
Example 29 includes the subject matter of Example 27, wherein the one or more operations comprise continuously monitoring the watermarked echo while the watermarked echo is in the air prior to its reception at the computing device via a speaking device including a microphone.
Example 30 includes the subject matter of Example 29, wherein the one or more operations comprise detecting the watermarked echo upon its reception via the speaking device; and separating the detected watermarked echo from one or more sounds received via the speaking device.
Example 31 includes the subject matter of Example 30, wherein the one or more sounds comprises one or more of a first sound including a voice spoken into the speaking device by a user, and a second sound including noise being generated within a proximity of the speaking device, wherein the noise includes one or more of traffic noise, human chatter, music, and street noise.
Example 32 includes the subject matter of Example 27, wherein the one or more operations comprise detecting a plurality of segments relating to the communication signal, wherein each segment of the plurality of segments refers to a frequency brand, wherein watermark is assigned to one or more of the plurality of segments.
Example 33 includes the subject matter of Example 32, wherein the communication signal is completely watermarked if each segment of the plurality of segments is assigned the watermark, wherein the communication signal is partially watermarked if one or more of the plurality of segments are assigned the watermarked, and wherein the communication signal is not watermarked if the plurality of segments is not assigned the watermark.
Example 34 includes the subject matter of Example 27, wherein filtering further comprises filtering out the plurality of segments to cancel out the watermarked echo from the final signal, wherein each of the plurality of segments is assigned the watermark.
Example 35 includes the subject matter of Example 27, wherein filtering further comprises filtering out one or more of the plurality of segments to partially cancel out the watermarked echo from the final signal, wherein the one or more of the plurality of segments include the watermarked one or more of the plurality of segments.
Example 36 includes the subject matter of Example 27, wherein filtering further comprises allowing the watermarked echo to remain within the final signal.
Some embodiments pertain to Example 37 includes an apparatus comprising: means for assigning a watermark to a communication signal, wherein the watermarked communication signal transforms into a watermarked echo upon exiting a computing device; means for receiving the watermarked echo; means for filtering the watermarked echo such that the watermarked echo is cancelled out of a final signal; and means for transmitting the final signal that is free of the watermarked echo.
Example 38 includes the subject matter of Example 37, further comprising: means for detecting the communication signal; and means for evaluating the detected communication signal as having a capacity to be transformed into the watermarked echo upon exiting the computing device into the air, wherein the watermarked communication signal exits through a listening device including a loudspeaker.
Example 39 includes the subject matter of Example 37, further comprising continuously monitoring the watermarked echo while the watermarked echo is in the air prior to its reception at the computing device via a speaking device including a microphone.
Example 40 includes the subject matter of Example 39, further comprising means for detecting the watermarked echo upon its reception via the speaking device; and means for separating the detected watermarked echo from one or more sounds received via the speaking device.
Example 41 includes the subject matter of Example 40, wherein the one or more sounds comprises one or more of a first sound including a voice spoken into the speaking device by a user, and a second sound including noise being generated within a proximity of the speaking device, wherein the noise includes one or more of traffic noise, human chatter, music, and street noise.
Example 42 includes the subject matter of Example 37, further comprising means for detecting a plurality of segments relating to the communication signal, wherein each segment of the plurality of segments refers to a frequency brand, wherein watermark is assigned to one or more of the plurality of segments.
Example 43 includes the subject matter of Example 32, wherein the communication signal is completely watermarked if each segment of the plurality of segments is assigned the watermark, wherein the communication signal is partially watermarked if one or more of the plurality of segments are assigned the watermarked, and wherein the communication signal is not watermarked if the plurality of segments is not assigned the watermark.
Example 44 includes the subject matter of Example 37, wherein the means for filtering further comprises means for filtering out the plurality of segments to cancel out the watermarked echo from the final signal, wherein each of the plurality of segments is assigned the watermark.
Example 45 includes the subject matter of Example 37, wherein the means for filtering further comprises means for filtering out one or more of the plurality of segments to partially cancel out the watermarked echo from the final signal, wherein the one or more of the plurality of segments include the watermarked one or more of the plurality of segments.
Example 46 includes the subject matter of Example 37, wherein the means for filtering further comprises means for allowing the watermarked echo to remain within the final signal.
The drawings and the forgoing description give examples of embodiments. Those skilled in the art will appreciate that one or more of the described elements may well be combined into a single functional element. Alternatively, certain elements may be split into multiple functional elements. Elements from one embodiment may be added to another embodiment. For example, orders of processes described herein may be changed and are not limited to the manner described herein. Moreover, the actions any flow diagram need not be implemented in the order shown; nor do all of the acts necessarily need to be performed. Also, those acts that are not dependent on other acts may be performed in parallel with the other acts. The scope of embodiments is by no means limited by these specific examples. Numerous variations, whether explicitly given in the specification or not, such as differences in structure, dimension, and use of material, are possible. The scope of embodiments is at least as broad as given by the following claims.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2014/012119 | 1/17/2014 | WO | 00 |