This invention relates to a method and apparatus for enhanced stimulation of the limbic response to audio signals. More specifically, the invention results in an identifiable physiological effect through technical means of sound production.
Music in its many forms is recognized as one of the great sources of pleasure for mankind. The phrase “music to my ears” is understood to generalize to any welcome sensory input. The lullabies of mothers are the first experience of the power of music to soothe for newborns, and empowerment for mothers and fathers. The power of music to soothe humans even when brains are at the very earliest stages of development is never lost.
Music has been recognized as a source of emotional comfort at times of major loss. Thus, requiems such as those of Mozart or Verdi, as well as the chants of Gregorian monks and singers from many religions, are recognized for their power to diminish the sense of loss and vulnerability in those who have experienced the death of beloved relatives or friends, and to relieve anxiety by creating a sense of community and link to powerful historical forces.
The therapeutic benefits of music have been acknowledged for centuries by many cultures and religions. The power of music to facilitate healing sick is recognized by the discipline of music therapy, which is now well-established as of provable benefit to many who are ill, including those with coronary artery heart disease and serious mental disorders, such as major depression and schizophrenia.
Music from a variety of genres, including jazz, blues, rock, opera, classical, country, bluegrass, folk, and heavy metal, is a highly valued way to experience pleasure. Extensive scientific research in the last 50 years has established that pleasure results from stimulating activity in specific areas of the medio-temporal lobes of the brain known as the limbic system. The limbic system is a key part of the human neural apparatus, as it enables us to respond emotionally and cognitively to various stimuli, threatening as well as pleasure-giving, in the environment.
The limbic system is a set of brain structures, including the hippocampus, amygdala, anterior thalamic nuclei, and limbic cortex, which support a variety of functions, including emotion, behavior, long term memory, and olfaction. For most, the pleasure experienced from listening to music, whether live or recorded, and the capacity of music to make the listener feel, think and remember its special qualities, results from the individual's limbic system response. However, music also can sometimes be aversive because of subjective responses to its nature as combinations of sounds based on tonalities, timing, and rhythms, painful associations of an idiosyncratic nature with the music, of aspects of its production, e.g. volume, repetition, and, finally, the quality of the recorded sound and its reproduction by man-made equipment.
Accordingly, what is needed is a method, and accompanying apparatus, to enhance the stimulation of the limbic system response in listeners of recorded audio signals, and produce an identifiable physiological effect through technical means of sound production.
Various exemplary embodiments of the present invention, as described below, are directed to the optimization of sound production so as to achieve limbic and cortical arousal, leading to the experiences of authenticity and pleasure. This includes, but is not limited to, sound production through loudspeakers.
In one embodiment, the present invention comprises the use of a resistance-controlled (or partially mass-controlled) woofer system, a mass-controlled (or partially resistance-controlled) midrange system, and a resistance-controlled tweeter system. This system may further comprise crossover networks of a particular configuration. By use of unsymmetrical networks of low order, it is possible to obtain a complete system which exhibits flat delay response.
In addition, in the middle and high-frequency ranges the correct combination (or combinations) of these elements will result in an electrical input impedance to the system which is relatively independent of frequency in both magnitude and phase. This improves the sound reproduction because many types of power amplifiers used to drive a loudspeaker system may be adversely affected by a widely varying load impedance (as presented by the loudspeaker). The specific performance degradation in the power amplifier will affect both the transient response and the frequency response. Flat magnitude and phase of the loudspeaker impedance will reduce or eliminate these problems.
Another exemplary embodiment of a loudspeaker system with these elements comprises one or more bending-wave transducers, one or more mid-range transducers, and two woofers in opposition. The bending-wave transducers and mid-range transducers may optionally be placed in a line-array.
The reproduction of music by means of technical apparatus and procedures should stimulate both the autonomic and cognitive centers of the brain. This is because, while the limbic (autonomic) response is immediate, it is quickly followed by a cognitive response as well. Failure to engage the limbic response results in reduction of emotional connection in the listener with the music. While limbic system arousal is essential to the ability of music to arouse emotions in listeners, the limbic system utilizes the frontal cortex of the brain to process the experience. Cortical regions of the brain enable the listener to understand and evaluate the complexity of music in concert with the hippocampus of the limbic system, where long term memory storage is mainly located. This combination allows the listener to compare current and past performance of the same and different performances of the same music, with different music, or with specific events associated with the music. Indeed, as shown in
The sounds of live music impacts the listener with immediacy and, in most circumstances, will decay rapidly, leaving a true music lover with a unique feeling of an authentic aesthetic experience. Non-live music, because it must be played through electronic devices, e.g., a CD player, amplifiers, loudspeakers, or ear phones, often deprives the listener of the feeling of authenticity. The listener often knows that listening to recorded music is a derivative experience, with much of the content of the live performance missing.
For recorded music to produce as close to optimal pleasure as possible, it must stimulate the limbic system and also activate higher cortical areas of the brain. The listener can then make judgments and integrate emotional and cognitive information to experience something close to an authentic listening experience. This results from immediate and, within the right range, intense stimulation of the limbic response. Thinking about music, without limbic arousal, cannot produce the pleasure which comes from hearing it and having it arouse limbic system chemical and electrical changes, which are believed to be mediated by the neurotransmitter, dopamine. Dopamine is the primary pleasure chemical which the limbic system is geared to produce in the right amounts at the right time. Music stimulates the release of dopamine, as well as other pleasure enhancers such as the endorphins.
There is a threshold for the experience of authenticity in listening to reproduced sound which must be met and exceeded in order to stimulate the limbic system response effectively. This is a function of the ability of the electronic sound reproduction system to reproduce the intensity, color, timbre, timing, and multidirectional nature of the sound the listener experiences in the live music setting. The quality of the loudspeaker which sends sound waves to the listener is a critical component of effort to achieve authenticity through limbic arousal.
A surrogate marker or indicator for the limbic response is the measurement of physiological responses in the body and brain, such as skin conductance, heart rate, and changes in the EEG. Limbic response can also be measured by changes in brain activity using modern magnetic resonance imaging methods; however, this is very costly. Studies have been performed comparing limbic system arousal with music produced by the linear Pace Conditioning Mode (PCM), which is digitized music. Music which is generated through an MP3-encoded version of the same music always fails to evoke as great a physiological response, as demonstrated by heart rate, galvanic skin response, and other measures.
Various exemplary embodiments of the present invention, as described below, are directed to the optimization of sound production so as to achieve limbic and cortical arousal, leading to the experiences of authenticity and pleasure. This includes, but is not limited to, sound production through loudspeakers.
A limbic response to sounds or audio signals can be stimulated in a variety of ways. First, one of the more obvious stimulants is “suddenness.” This evokes what is described in psychological research as the “startle” response or reflex. “Suddenness” in a sound can be found, for example, in a gunshot, the snap of a twig, or the clap of hands. These may be described technically as “transient” sounds, as distinct from continuous or “steady-state” sounds. For music, examples may include the clang of a symbol or the sound of a violin string being bowed and then abruptly stopping its vibration.
A second stimulant is “loudness” (i.e., high volume sounds), which may combine with suddenness. A gunshot, for example, combines loudness and suddenness. Intensity at close range, such as standing near a passing train or in front of the speakers at a loud rock concert, can evoke a sense of being overwhelmed or of great danger, and result in an unpleasant, frightening, or even painful experience.
In contrast, a third stimulant may be “softness” (i.e., low intensity sounds). “Softness” may cause heightened attention, such as in listening for the approach of a predator, or straining to hear a sound played gently. Softness can evoke a soothing or calming response, but also unpleasant over stimulation. The range of loudness to softness is described technically as “dynamic range”.
Spectrum, or the distribution of sounds with respect to frequency, is another key dimension. The middle frequencies, which are occupied by the human voice, are strongly related to both limbic and cognitive response in humans (e.g., hearing and responding to maternal and paternal voices). The extreme frequencies, both high and low, are more strongly related to the limbic response. For example, the driving beat of music, the footfalls of marching soldiers, the rumble of a vehicle all involve frequencies below the range of the human voice. In contrast, transient sounds, such as those mentioned above, are rich in higher frequencies above the range of the human voice.
The technological art of recording and reproducing sound is based upon both what is objectively measurable and what is subjectively describable. Objective measurements are useful as a tool for improving sound recording and reproducing devices in order to establish basic technical characteristics. However, the measurements do not completely capture the resulting sound quality or capacity to produce pleasure. Subjective description, by definition, requires cognitive processing. The widespread use of jury-based comparative ratings in the audio field is based upon cognitive processing. This is subjective and will sometimes produce disagreement among experts, causing some to question the value or even the validity of comparative listening tests, blind or otherwise. Objective measurements of reproduced sound and cognitive judgment have the potential to facilitate sound that is emotionally-involving.
In one exemplary embodiment, the present invention comprises a method, and related apparatus, for the optimization of transient reproduction, dynamic range, and spectral extent. These components are closely linked, although their optimization is not always congruent.
With regard to transient reproduction, it has been thought that the criterion for good transient response is wide frequency response. This comes from the Fourier transform which establishes the relationship between time and frequency for linear time-invariant systems. However, loudspeaker systems operating in real rooms are not linear, time-invariant systems. Instead, embodiments of the present invention use the simple equation F=ma, force equals mass times acceleration. Transient sounds are characterized by the rapid acceleration of the air by some physical object. In the case of loudspeakers, it is the diaphragm of the loudspeaker which must be accelerated to move the air, thus producing the sound. Since a=F/m, it follows that to have high acceleration in order to accurately reproduce transient sounds, the mass of the diaphragm must be very low, and the force available to move it must be very high. In fact, music and other sounds are discontinuous, resulting in “jerk,” which is the derivative of acceleration (i.e., the rate of change of acceleration), just as acceleration is the derivative of velocity.
The mass of the speaker diaphragm may be reduced by simply making it smaller. Unfortunately, this increases the radiation resistance to the point where it is not possible to impart enough acoustic power to the air to obtain the required loudness. Radiation resistance is proportional to wavelength (and inversely proportional to frequency), so the loudspeaker system is divided into parts. A large diaphragm is used for the low frequencies, in order to radiate enough power. A smaller diaphragm can be used for the middle frequencies. For the high frequencies, it is usually not sufficient to simply further reduce the diaphragm size, and some other approach must be used. This is because the range of human hearing covers a ratio of about 1000:1 in wavelength, and it is clearly not practical that the reproducers (i.e., diaphragms) would span that range of physical size. As a result, multi-way loudspeaker systems, such as shown in
With regard to dynamic range, the dynamic range of a loudspeaker is the range from the softest sound it will reproduce to the loudest sound it will reproduce. The response generally is linear over the whole dynamic range. That is, a given increase in the electrical input produces the same increase in the acoustic output. When this is not the case there is said to be compression.
There are two primary compression mechanisms, both of which should be avoided in loudspeaker construction. The first is instantaneous compression, which is due to the motor of the speaker having a non-linear reduction in force near the limits of its excursion. The second is long-term compression, which is usually thermal in origin. Here, the voice-coil of the motor heats up and its resistance rises. Since force is proportional to current, the increasing resistance diminishes the available force.
The upper end of the dynamic range (i.e., the highest acoustic power) is limited not only by the motor, but also by the ability of the diaphragm to withstand the accelerative forces. This is why the diaphragm cannot be too light. In the woofer (low frequency) and midrange drivers, this can be addressed by proper selection of diaphragm material and geometry. However, heavy woofer diaphragms and soft midrange diaphragms do not lead to good transient reproduction, as described above.
For a tweeter (high frequency), one solution is to use numerous tweeters arranged in a line. For a given sound pressure, the required acceleration from each tweeter is reduced according to the number of tweeters. Another approach abandons the attempt at unitary motion of the diaphragm in favor of the propagation of a bending wave.
With regard to spectral extent, this is often referred to as frequency response in connection with loudspeaker technology. In the context of the present invention, however, it has much greater implications. Frequency response is customarily defined as the sound pressure amplitude on some specified axis, usually perpendicular to the front panel, at a specified distance, as a function of frequency. The measurement is usually performed at a specified input voltage so that the voltage sensitivity may also be obtained. This is generally what is called a small-signal characteristic.
It is also necessary to insure that the frequency response is maintained dynamically. That is, it must not change as a function of loudness. This is a requirement for good dynamic range.
The frequency response also should be maintained spatially. As shown in
A loudspeaker apparatus in accordance with an exemplary embodiment of the present invention comprises the simultaneous application of numerous techniques as described below.
For low-frequency sounds, a woofer system is implemented in one or more configurations based on physical size and acoustic output. In general, the woofer systems of various embodiments are arranged so that the fundamental resonance frequency (fs) is at the upper end of the operating frequency range. Because of this, the system is stiffness-controlled rather than mass-controlled. When the system is stiffness-controlled, the response is not flat but rather decreases monotonically with frequency at a rate of 40 dB/decade. When this response is equalized by a biquadratic network with equal and opposite response, by superposition the reactances cancel. The woofer is therefore operating resistively over the range of interest. This results in flatter group delay which corresponds to superior transient response, as shown in
Further, in one exemplary embodiment, as seen in
For mid-frequency sounds, the “limbic” optimization of the midrange reproducer is performed, in order, for the following: (1) transient response; (2) dynamic range; and (3) frequency response.
The mass of the moving parts is reduced as much as possible through the use of lightweight but stiff diaphragm material, and a low-mass voice-coil former and winding. Electrical inductance in the voice-coil causes two problems. First, this inductance reflected to the mechanical system is indistinguishable from mass. Second, this inductance, and therefore its reactance, tends to be modified by the instantaneous voice-coil position. This results in signal-dependent amplitude-modulation of high frequencies when strong low frequencies are simultaneously being reproduced. This is called amplitude intermodulation distortion and it is very audible. When it is reduced or eliminated, the sound is perceived as being less congested and more clear.
As shown in
In another embodiment, the thickness of the shorting ring 22 should be made as thin or as small as possible. The shorting ring conducts significant current at high frequencies, and if its AC resistance is too high, it will not be effective.
The above solution with the widened gap requires more magnet material to overcome the increased reluctance in the gap, and thus increases expense.
The dynamic range optimization comprises of two parts. First, the linear excursion of the motor (i.e., the length of the stroke with uniform force) must be great enough to support the required diaphragm excursion to the lowest frequency of interest. This avoids instantaneous compression. Second, the sensitivity of the speaker must be high enough that the highest required acoustic output will not result in significant heating of the voice-coil. This, combined with adequate ventilation of the voice-coil, avoids thermal compression.
In one embodiment, the frequency optimization cannot be done in the loudspeaker unit itself. The first two optimizations result in a non-flat frequency response which must be corrected in the frequency-dividing (crossover) network, as previously shown in
Depending on the total dynamic-range requirements of the system, several midrange drivers 40 as described may be used in a line-array, as seen in
With regard to high frequency optimization, the primary difficulties are extension of frequency response, and production of sufficient acoustic power output. In a conventional tweeter, the diaphragm diameter is about one inch and the voice-coil is placed at the outer diameter. This is conventionally known as a “dome” tweeter. Such a design will not produce enough acoustic power due to deformation of the diaphragm during the very high accelerations. One solution is to use more than one such tweeter, usually many more, arranged in a line-array, as seen in
An alternative method of high-frequency reproduction is possible in the form of a bending-wave transducer 50, shown in
Another advantage of this type of transducer is that the area of the film diaphragms 52 can be quite large. Because the bending wave produces motion perpendicular to the surface, the acoustic radiation efficiency is quite high. This has the advantage that very little electric power is required in the motor so very little heat is produced. As a result, there is essentially no thermal compression.
The bending-wave transducer 50 operates in the resistive domain rather than the mass-controlled domain of conventional direct-radiator tweeters. This causes the acoustic output to be in-phase with the electrical input, rather than lagging in quadrature. As with the tweeter above, further advantage can be realized by using several of the transducers, as shown in
A bending-wave transducer also may be used for the midrange. Similarly, the electromagnetic mechanical advantage of the mass-reducing inductance-lowering features described for the midrange above may also be used with the woofer system. Digital signal processing also may be used on the woofer system to reduce size and weight.
The use of a resistance-controlled (or partially mass-controlled) woofer system, a mass-controlled (or partially resistance-controlled) midrange system, and a resistance-controlled tweeter system requires crossover networks of a particular configuration. By use of unsymmetrical networks of low order, it is possible to obtain a complete system which exhibits flat delay response. In one embodiment, this is a fundamental requirement for good transient reproduction because flat delay means that the various elements of a transient sound are preserved in their original time relationships. This may be observed in several ways. First, flat delay corresponds to flat phase response after the causal delay has been removed from the system (see
In addition, in the middle and high-frequency ranges the correct combination (or combinations) of these techniques will result in an electrical input impedance to the system which is relatively independent of frequency in both magnitude and phase. This improves the sound reproduction because many types of power amplifier used to drive a loudspeaker system may be adversely affected by a widely varying load impedance (as presented by the loudspeaker). The specific performance degradation in the power amplifier will affect both the transient response and the frequency response. Flat magnitude and phase of the loudspeaker impedance will reduce or eliminate these problems.
Another exemplary embodiment of a loudspeaker system 60 with these elements as described above is shown in
It should be understood that the embodiments and examples described herein have been chosen and described in order to best illustrate the principles, methods, and processes of the invention and its practical applications to thereby enable one of ordinary skill in the art to best utilize the invention in various embodiments and with various modifications as are suited for particular uses contemplated. Even though specific embodiments of this invention have been described, they are not to be taken as exhaustive. There are several variations that will be apparent to those skilled in the art.
This application claims benefit of and priority to U.S. Provisional Application Nos. 61/164,482, filed Mar. 30, 2009, and 61/179,078, filed May 18, 2009, and is entitled to those filing dates in whole or in part for priority. The specification, figures and complete disclosure of U.S. Provisional Application Nos. 61/164,482 and 61/179,078 are incorporated herein by specific reference for all purposes.
Number | Date | Country | |
---|---|---|---|
61164482 | Mar 2009 | US | |
61179078 | May 2009 | US |