The present invention relates to a process for adjusting the sound volume of a digital sound recording reproduced by an item of equipment. This process is essentially intended to be used during the reproduction of a digital recording in the form of a data file by means of a sound card, for example, of an audiovisual reproduction system, such as a jukebox.
In the prior art, it is known that digital recordings, such as compact disks (CD), are not reproduced with the same sound volume for a specified sound setting level. This is essentially due to the type of music and the way in which the piece of music was recorded. Indeed, a sound frame is composed of an electrical signal comprising a succession of oscillations and peaks. Each peak corresponds to a voltage value. The higher the voltage in terms of absolute value, the higher the volume and the higher the slope of the signal variation, the higher the frequency of the sound reproduced. When such a recording is recorded in the form of a digital file and then reproduced on a sound system by means of a digital sound card on a computer, the same maximum variation phenomena are observed since the data contained in the file is approximately the same as that recorded on a CD. Consequently, between two recordings of different types of music, it is necessary to modify the sound level setting between two recordings, to obtain a reproduction with the same sound level for two different recordings with different original sound levels.
Therefore, the purpose of the present invention is to remedy the disadvantages of the prior art by proposing a process for adjusting the sound level of a digital sound recording making it possible to obtain identical sound levels in different recordings, irrespective of the differences in the digital sound recording level existing initially between each of the recordings.
This purpose is achieved by the fact that the process comprises:
According to another feature, the maximum amplitude value determination step comprises:
According to another feature, n is determined so that the degradation of the reproduction quality of the recording is not perceptible to the human ear.
According to another feature, n is of the order of 10 and preferably equal to 4 or 5.
According to another feature, the maximum amplitude value determination step comprises:
According to another feature, the maximum amplitude value determination step comprises:
According to another feature, the psycho-acoustic mask(s) is/are applied using the MPEG-1 Layer 3 process.
According to another feature, the reproduction step comprises a dynamic reproduction sound level adjustment step on the recording consisting of authorising a specified gain for the low-pitched and/or high-pitched sounds in the recording, the gain corresponding approximately to the attenuation applied during the reproduction of the recording.
Another purpose of the invention consists of a use of the process according to the invention in an audiovisual reproduction system, such as a digital jukebox.
This purpose is achieved by the fact that the automatic volume adjustment process is used on a digital audiovisual reproduction system, this use being characterised in that the recording is stored in memory in the reproduction system with the corresponding calculated gain and audiovisual reproduction system reading means giving access to the gain value to control the gain circuits of the digital signal processing processor of the digital audiovisual reproduction system to adjust the sound level accordingly.
Other features and advantages of the present invention will be understood more clearly upon reading the description below with reference to the appended drawings, wherein:
Before starting the description of the invention, it is necessary to give some notes on digital recording. First of all, sound reproduction by a loud speaker consists of applying voltages of specified levels to said loud speaker, according to a specified frequency to vibrate a membrane and, therefore, produce the sound corresponding to the specified frequency. For a given amplification value, the root mean square voltage value defines the sound volume or sound level.
A sound frame, represented in
In this way, depending on the type of music, the curve C representing the frequency of the reproduced sound, defined by the slope of the curve C and the corresponding voltage value of the maximum sound levels, for the same sound amplification circuit setting, the output level of the loud speakers will be different. Indeed, the maximum root mean square voltages observed for a first recording will not necessarily be of the same order as the maximum root mean square voltages observed for a second recording. Therefore, the purpose of the invention is to provide a solution for this disadvantage such that, between two recordings, the volume or sound level perceived by the listener is automatically adjusted so that the sound level is the same from one recording to another.
The invention requires, firstly, a preliminary analysis of each recording liable to be reproduced on an audiovisual reproduction system or on a computer and, secondly, a correction of the amplification level during the sound reproduction of the recording, according to the analysis.
A first solution consists of searching, in absolute values, the maximum voltage observed on each recording, and using this value to amplify the recordings such that, for a specified sound level setting, this values reaches the same voltage value for all the recordings. However, a sound frame of a recording comprises sounds with frequencies that are both audible and inaudible for the human ear. In this way, if the maximum amplitude corresponds to an inaudible frequency, the adjustment of the volume will not be adapted.
Therefore, the process according to the invention consists, in a first step, of determining, for a recording, the maximum amplitude only for frequencies audible for the human ear. In a first embodiment variant, this maximum amplitude is determined by analysing the digital recording to classify the number of samples of the recording for each amplitude, in increasing order of amplitude, in absolute values. This classification is represented in
Empirically, it is observed that a recording corresponding to a song only comprises a few samples, of the order of ten, located in the portion B of the curve C1, with the highest amplitudes in the recording. In this way, the portion B of the curve C1 is represented with dashes to show that all the values of the numbers representing the voltages of the corresponding analogue signal are not represented. Similarly, it is observed that 90% of the samples of a recording have a low amplitude, i.e. located in the portion A of the curve C1.
According to the invention, the maximum amplitude is selected, in the classification carried out, as the amplitude n ranks less with reference to the rank of the maximum amplitude sample of the recording. In other words, if 1 corresponds to the rank of the number representing the amplitude and K is the rank of the number representing the maximum amplitude found on the digital recording, then the amplitude selected as the maximum amplitude for the process corresponds to the rank number K-n, from the classification defined and corresponding to the curve C1. In this way, the n−1 samples, located on portion B of the curve C1 are not taken into account, using the maximum amplitude as a basis, implying that these samples do not appear in the final reproduction. Then, the recording volume correction, i.e. the possible volume gain Gv for the recording is determined by applying the following formula:
Gv=20log(A2/Am) a
In this formula, A2 is the selected amplitude and Am is the maximum amplitude of the recording.
In practice, the higher the value of n, the more degraded the recording reproduction quality. Indeed, the higher the value of n, the higher the number of high-amplitude samples that will not be taken into account, and the higher the probability of the samples not taken into account corresponding to audible signals. Consequently, when the gain calculated using the above formula is applied to the recording, some sound frequencies will be over-amplified, resulting in a saturation phenomenon on the loud speakers and, therefore, in a degradation of the reproduction quality. It has been observed that a value of n of the order of 10, preferably equal to 4 to 5, does not induce a perceptible degradation during the reproduction of the recording after applying the gain calculated using the formula above. This variant can only be applied effectively to digital recordings that have not undergone prior compression or processing aiming to optimise the volume level.
On the basis of the classification carried out above, another variant for determining the value of the selected amplitude may be carried out. According to this variant, the value of the selected amplitude corresponds to the mean value Amean of the n′ highest amplitudes occurring at least k′ times in the recording. Then, the value of the possible volume gain Gv for the recording is determined by applying the formula a above, replacing A2 by Amean.
The experiment showed that, by choosing n′ equal to 2 and k′ equal to 4, the sound recording reproduction did not show any degradation audible for the human ear. The higher the values of n′ and k′, the higher the degradation of the sound recording reproduction.
For recordings having undergone optimisation processing, the determination step of the maximum amplitude for audible signals consists of compressing the recording according to a compression process using at least one psycho-acoustic mask making it possible to eliminate inaudible sounds from the recording. For example, it is possible to use the known MPEG-1 Layer 3 process or any other compression process such as AAC. Indeed, it is known that the MPEG compression process uses masks to eliminate any unnecessary data from the recording. The unnecessary data in the sound recording includes all the inaudible frequencies and all the sound variations which are not perceptible to the human ear. Then, the recording is decompressed and the value of the maximum amplitude is located in this decompressed recording. In this way, during the decompression, the decompressed recording only contains audible frequency sounds. Searching the maximum amplitude in this decompressed recording does not necessarily produce a maximum amplitude Am for an audible frequency. In this embodiment variant, it is also advisable to store in memory before compression, the maximum amplitude of the recording, for all frequencies combined, in order to be able to calculate the gain according to the formula a. This second embodiment variant may be applied to any type of recording, since the MPEG compression process is indifferent to the initial recording type.
The gain value calculated by means of the formula a is then stored in memory with the sound recording produced, for example, on a server or on the audiovisual reproduction system, and used during the recording reproduction by the reproduction system. Indeed, during the reproduction of the initial digital recording, the gain calculated for this recording is added during the sound setting.
The process according to the invention is particularly used when digital recordings are reproduced by means of a sound card of a computer or an audiovisual data reproduction system. Therefore, the process according to the invention requires having determined the gain either arbitrarily or using a preliminary analysis of each recording liable to be reproduced by the sound card. As described above, this analysis consists of determining the gain liable to be applied to each recording during its reproduction. The gain is, for example, stored in memory in a database on storage means of the computer or reproduction system and can be accessed by the sound card management program, such that each recording stored on the storage means of the computer or the reproduction system is associated with a gain in the database. In this way, before the reproduction of a specified recording, the sound card management program consults its database and collects the data representing the gain calculated for this recording. During the setting of the sound of the recording, the level selected by the user will be automatically adjusted by a value corresponding to the calculated gain Gv, such that the real sound level indeed corresponds to the level selected by the user and is homogeneous for all the recordings contained in the storage means. The adjustment may be made by a positive or negative value.
Another variant of the process according to the invention consists of adjusting the gain for the sound signals of a recording corresponding to low-pitched and/or high-pitched sounds. The aim of the process is to increase, when possible, the gain for low-pitched and/or high-pitched sounds without exceeding the sound level selected by the user and without exceeding a maximum gain set for low-pitched and/or high-pitched sounds. It is necessary to underline that, in this variant, only low-pitched and/or high-pitched sounds are concerned by the dynamic gain adjustment, when the reproduction enables independent setting of the general sound level and the sound level of low-pitched and/or high-pitched sounds. In this way, when the sound level of low-pitched and/or high-pitched sounds is less than the sound level selected by the user, an additional gain is authorised on low-pitched and/or high-pitched sounds to increase the perception of low-pitched and/or high-pitched sounds to improve the reproduction quality of the recording. This additional gain will be at most equal to the gain requested by the user for low-pitched and/or high-pitched sounds.
The maximum volume is obtained when the incoming signal on the amplifier is not attenuated, i.e. at a gain of 0 dB. So as to obtain a gain for low-pitched and/or high-pitched sounds systematically, the overall maximum volume for the recording may be less than zero dB and the maximum volume of low-pitched and/or high-pitched sounds is determined so that the incoming gain in the amplifier can be equal to zero dB. Consequently, it is always possible to obtain a gain for low-pitched and/or high-pitched sounds corresponding to the absolute value of the recording volume attenuation. In this way, for example, if the recording volume attenuation is −3 dB, the gain for low-pitched and/or high-pitched sounds is 3 dB. So as to limit the influence of the dynamic adjustment of low-pitched and/or high-pitched sounds, the maximum low-pitched and/or high-pitched sound gain is limited, for example to 12 dB. In this way, even if, for a specified volume, the gain for low-pitched and/or high-pitched sounds may be 16 dB, for example, it will only actually be 12 dB.
For example,
Once the dynamic low-pitched and/or high-pitched sound adjustment has been carried out, the digital signal 4110 is applied to the input of a digital/analogue converter 412, 422, 423, the output of which is connected to the input of an amplifier 51, 52, 53 on which loud speakers 61, 62, 63 are connected.
It is understood that the process according to the invention makes it possible, after prior determination of the possible volume gain for each recording, to reproduce all the digital recordings analysed, with the same sound level, for the same sound setting selected by a user.
It must be clear for those experienced in the art that the present invention enables embodiments in many other specific forms without leaving the field of the invention as claimed. Consequently, the present embodiments must be considered as illustrations, but may be modified in the field defined by the scope of the claims attached, and the invention must not be limited to the details given above.
Number | Date | Country | Kind |
---|---|---|---|
00 01905 | Feb 2000 | FR | national |
This application is a continuation of application Ser. No. 09/583,864, filed Jun. 1, 2000, now U.S. Pat. No. 7,107,109, the entire content of which is hereby incorporated by reference in this application.
Number | Name | Date | Kind |
---|---|---|---|
3982620 | Kortenhaus | Sep 1976 | A |
4186438 | Benson | Jan 1980 | A |
4232295 | McConnell | Nov 1980 | A |
4335809 | Wain | Jun 1982 | A |
4335908 | Burge | Jun 1982 | A |
4412292 | Sedam | Oct 1983 | A |
4521014 | Sitrick | Jun 1985 | A |
4528643 | Freeny | Jul 1985 | A |
4558413 | Schmidt | Dec 1985 | A |
4572509 | Sitrick | Feb 1986 | A |
4582324 | Koza | Apr 1986 | A |
4593904 | Graves | Jun 1986 | A |
4597058 | Izumi | Jun 1986 | A |
4636951 | Harlick | Jan 1987 | A |
4652998 | Koza | Mar 1987 | A |
4654799 | Ogaki | Mar 1987 | A |
4658093 | Hellman | Apr 1987 | A |
4667802 | Verduin | May 1987 | A |
4675538 | Epstein | Jun 1987 | A |
4677311 | Morita | Jun 1987 | A |
4677565 | Ogaki | Jun 1987 | A |
4703465 | Parker | Oct 1987 | A |
4707804 | Leal | Nov 1987 | A |
4722053 | Dubno | Jan 1988 | A |
4761684 | Clark | Aug 1988 | A |
4766581 | Korn | Aug 1988 | A |
4787050 | Suzuki | Nov 1988 | A |
4792849 | McCalley | Dec 1988 | A |
4811325 | Sharples | Mar 1989 | A |
4825054 | Rust | Apr 1989 | A |
4829570 | Schotz | May 1989 | A |
4868832 | Marrington | Sep 1989 | A |
4920432 | Eggers | Apr 1990 | A |
4922420 | Nakagawa | May 1990 | A |
4924378 | Hershey | May 1990 | A |
4926485 | Yamashita | May 1990 | A |
4937807 | Weitz | Jun 1990 | A |
4949187 | Cohen | Aug 1990 | A |
4956768 | Sidi | Sep 1990 | A |
4958835 | Tashiro | Sep 1990 | A |
4999806 | Chernow | Mar 1991 | A |
5012121 | Hammond | Apr 1991 | A |
5041921 | Scheffler | Aug 1991 | A |
5058089 | Yoshimaru et al. | Oct 1991 | A |
5138712 | Corbin | Aug 1992 | A |
5155847 | Kirouac | Oct 1992 | A |
5163131 | Row | Nov 1992 | A |
5166886 | Molnar | Nov 1992 | A |
5191573 | Hair | Mar 1993 | A |
5191611 | Lang | Mar 1993 | A |
5192999 | Graczyk | Mar 1993 | A |
5197094 | Tillery | Mar 1993 | A |
5203028 | Shiraishi | Apr 1993 | A |
5237157 | Kaplan | Aug 1993 | A |
5237322 | Heberle | Aug 1993 | A |
5239480 | Huegel | Aug 1993 | A |
5250747 | Tsumura | Oct 1993 | A |
5252775 | Urano | Oct 1993 | A |
5260999 | Wyman | Nov 1993 | A |
5262875 | Mincer et al. | Nov 1993 | A |
5276866 | Paolini | Jan 1994 | A |
5315161 | Robinson | May 1994 | A |
5339413 | Koval | Aug 1994 | A |
5341350 | Frank | Aug 1994 | A |
5355302 | Martin | Oct 1994 | A |
5357276 | Banker | Oct 1994 | A |
5369778 | SanSoucie | Nov 1994 | A |
5375206 | Hunter | Dec 1994 | A |
5418713 | Allen | May 1995 | A |
5420923 | Beyers | May 1995 | A |
5428252 | Walker | Jun 1995 | A |
5431492 | Rothschild | Jul 1995 | A |
5445295 | Brown | Aug 1995 | A |
5455926 | Keele | Oct 1995 | A |
5457305 | Akel | Oct 1995 | A |
5465213 | Ross | Nov 1995 | A |
5475835 | Hickey | Dec 1995 | A |
5481509 | Knowles | Jan 1996 | A |
5495610 | Shing | Feb 1996 | A |
5496178 | Back | Mar 1996 | A |
5499921 | Sone | Mar 1996 | A |
5511000 | Kaloi | Apr 1996 | A |
5513117 | Small | Apr 1996 | A |
5548729 | Akiyoshi | Aug 1996 | A |
5550577 | Verbiest | Aug 1996 | A |
5555244 | Gupta | Sep 1996 | A |
5557541 | Schulhof | Sep 1996 | A |
5559505 | McNair | Sep 1996 | A |
5559549 | Hendricks | Sep 1996 | A |
5561709 | Remillard | Oct 1996 | A |
5566237 | Dobbs | Oct 1996 | A |
5570363 | Holm | Oct 1996 | A |
5579404 | Fielder et al. | Nov 1996 | A |
5583994 | Rangan | Dec 1996 | A |
5592551 | Lett | Jan 1997 | A |
5594509 | Florin | Jan 1997 | A |
5612581 | Kageyama | Mar 1997 | A |
5613909 | Stelovsky | Mar 1997 | A |
5619247 | Russo | Apr 1997 | A |
5619698 | Lillich | Apr 1997 | A |
5623666 | Pike | Apr 1997 | A |
5642337 | Oskay | Jun 1997 | A |
5644714 | Kikinis | Jul 1997 | A |
5644766 | Coy | Jul 1997 | A |
5668592 | Spaulding | Sep 1997 | A |
5668788 | Allison | Sep 1997 | A |
5684716 | Freeman | Nov 1997 | A |
5691778 | Song | Nov 1997 | A |
5697844 | Von Kohorn | Dec 1997 | A |
5703795 | Mankovitz | Dec 1997 | A |
5708811 | Arendt | Jan 1998 | A |
5712976 | Falcon | Jan 1998 | A |
5726909 | Krikorian | Mar 1998 | A |
5734719 | Tsevdos | Mar 1998 | A |
5734961 | Castille | Mar 1998 | A |
5761655 | Hoffman | Jun 1998 | A |
5762552 | Vuong | Jun 1998 | A |
5774668 | Choquier et al. | Jun 1998 | A |
5774672 | Funahashi | Jun 1998 | A |
5781889 | Martin | Jul 1998 | A |
5790172 | Imanaka | Aug 1998 | A |
5790671 | Cooper | Aug 1998 | A |
5790856 | Lillich | Aug 1998 | A |
5793980 | Glaser | Aug 1998 | A |
5798785 | Hendricks | Aug 1998 | A |
5802599 | Cabrera | Sep 1998 | A |
5808224 | Kato | Sep 1998 | A |
5809246 | Goldman | Sep 1998 | A |
5832287 | Atalla | Nov 1998 | A |
5835843 | Haddad | Nov 1998 | A |
5845104 | Rao | Dec 1998 | A |
5848398 | Martin | Dec 1998 | A |
5854887 | Kindell | Dec 1998 | A |
5862324 | Collins | Jan 1999 | A |
5864870 | Guck | Jan 1999 | A |
5867714 | Todd | Feb 1999 | A |
5884028 | Kindell | Mar 1999 | A |
5884298 | Smith | Mar 1999 | A |
5887193 | Takahashi | Mar 1999 | A |
5913040 | Rakavy | Jun 1999 | A |
5915094 | Kouloheris | Jun 1999 | A |
5915238 | Tjaden | Jun 1999 | A |
5917537 | Lightfoot | Jun 1999 | A |
5917835 | Barrett | Jun 1999 | A |
5923885 | Johnson | Jul 1999 | A |
5930765 | Martin | Jul 1999 | A |
5931908 | Gerba | Aug 1999 | A |
5949688 | Montoya | Sep 1999 | A |
5959869 | Miller | Sep 1999 | A |
5959945 | Kleiman | Sep 1999 | A |
5966495 | Takahashi | Oct 1999 | A |
5978855 | Metz | Nov 1999 | A |
6002720 | Yurt | Dec 1999 | A |
6009274 | Fletcher | Dec 1999 | A |
6018337 | Peters | Jan 2000 | A |
6018726 | Tsumura | Jan 2000 | A |
6072982 | Haddad | Jun 2000 | A |
6151634 | Glaser | Nov 2000 | A |
6341166 | Basel | Jan 2002 | B1 |
6498855 | Kokkosoulis et al. | Dec 2002 | B1 |
6522707 | Brandstetter et al. | Feb 2003 | B1 |
6744882 | Gupta et al. | Jun 2004 | B1 |
7107109 | Nathan et al. | Sep 2006 | B1 |
20010016815 | Takahashi et al. | Aug 2001 | A1 |
Number | Date | Country |
---|---|---|
0498130 | Aug 1992 | EP |
0498130 | Aug 1992 | EP |
0632371 | Jan 1995 | EP |
0817103 | Jan 1998 | EP |
0841616 | May 1998 | EP |
0982695 | Mar 2000 | EP |
WO 9612255 | Apr 1996 | WO |
WO 9612257 | Apr 1996 | WO |
Number | Date | Country | |
---|---|---|---|
20060265093 A1 | Nov 2006 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09583864 | Jun 2000 | US |
Child | 11495620 | US |