Not applicable.
Not applicable.
Not applicable.
Not applicable.
1. Field of the Invention
The present patent application for industrial invention relates to a system used to reproduce the sound of a stringed instrument, in particular a piano, by means of modeling and digital synthesis of the oscillatory components, or partials, due to the excitation of the string constrained together with the other strings of the instrument, as in the case of the piano strings.
2. Description of Related Art Including Information Disclosed Under 37 CFR 1.97 and 37 CFR 1.98.
The most common methodology used in the digital synthesis of the sound of musical instruments consists of storing a collection of sounds sampled from real musical instruments in the memory of a synthesis device. The samples can be pre-processed before storage and are successively reproduced in real time, during the synthesis, adding a post-processing that aims at adapting them to the player's requirements. Said processing modifies the recorded sounds in variable extent through dedicated computing resources, thus permitting proportional processing before storage. With the increase of computing resources, samples can be simplified in wavetable format or additionally reduced to few data in the memory according to wave-shaping techniques.
A methodology alternative to the use of samples provides for completely synthesizing the sound of the instrument by means of physical models. By simulating the dynamics of specific components of the musical instrument, these models imitate what happens in reality when a component typically identified as “exciter” stresses the remaining part of the model, identified as “resonator”. In the case of a piano, the use of hammer-string models based on digital waveguides is known, which are able to reproduce the motion of the string at the bridge, starting from the information on the impact velocity of the hammer on the string; then, the corresponding motion signal is processed by a discrete-time realization of a model of the soundboard without feedback effects on the string model (see Bank et al., EURASIP journal on Applied Signal Processing, vol. 2003, pp. 941-952, 2003).
In between the two aforementioned methodologies is the class of methods that make use of physical models, whose excitation is performed by injecting in the model a signal that is an indirect function of the force impressed by the player. With reference to the piano, the known models are the digital waveguide model excited by the hammer-soundboard-instrument body assembly (known as commuted synthesis, see Smith, U.S. Pat. No. 5,777,255), and the additive synthesis model of damped sinusoidal components informed by finite elements of the string-soundboard assembly, excited by signals measured directly from the piano, i.e. obtained from simulations made on physical models comparable to the ones described in the previous paragraph (see Guillaume, U.S. Pat. No. 7,915,515 B2).
The aforementioned prior art, in the context of piano simulation, does not provide for the realization of a method by means of a digital device, in which a scalable model of the strings (resonator) is stressed by a hammer model (exciter) according to the force impressed on the key by the player, thus generating a sound that is then sent to a post-processing step that takes into account the action of the soundboard-instrument body on the previously generated sound. The theoretical foundations of such a method are known from the literature (see Balázs Bank, Stefano Zambon, and Federico Fontana, pp 809-821, IEEE Transactions on Audio, Speech and Language Processing, Vol 18, No 4, May 2010): in particular, the same theory guarantees the rendering of all partials generated by the strings of a standard 88-key piano, as well as of the oscillatory components deriving from the longitudinal motion of the strings.
EP 2 261 891 discloses a method used to synthesize tone signals and a system used to generate tone signals, in particular for electronic pianola.
The primary purpose of the present invention is to eliminate the drawbacks of the prior art and realize a system based on the interconnection of a hammer, strings, and soundboard-instrument body model to synthesize digital piano sounds through the rendering of all oscillatory partial and transient longitudinal components of the instrument in different playing conditions.
An additional purpose is to provide a realization of the hammer, strings, and soundboard-instrument body model as much accurate as possible in terms of sound realism, and as much efficient as possible in terms of computational cost.
Another purpose is to provide a realization of the hammer and strings model allowing for fine tuning of the simulated instrument, similarly to what occurs in the real instrument when hammers and strings are tuned.
The present invention is based on certain assumptions that are known from the vast literature that quantifies the measurable characteristics of piano sound depending on the mechanical characteristics of the instrument and its operation in different playing conditions. Based on these assumptions, and using the quantitative results proposed by the same literature that can be used to set the operating parameters, the present invention provides for modeling:
The system of the invention leads to two main advantages:
For a better understanding the description of the system of the invention continues with reference to the enclosed drawings, wherein:
Referring to
The system (1) comprises a number N of note modules equal to the number of hammers of the musical instrument. If the stringed instrument is a piano, for instance, the number N of note modules is 88, just like a standard 88-key piano provided with 88 hammers hitting the strings.
Each note module comprises a hammer module (100), a primary resonator and longitudinal motion module (200), a secondary resonator module (300) and a duplex resonator module (400).
The information on the impact velocity of the hammer of each key played on a keyboard is instantaneously directed to the corresponding hammer module (100). In the keyboards of standard digital pianos, such information is typically detected by measuring the flight time of the hammer between two predefined points, one of them being situated immediately in the proximity of the impact point on the corresponding strings.
Referring to
As it is known in the prior art, the force signal (ff) can be determined from measurements made on real musical instruments, or from simulations made on physics-based sound models able to simulate the hammer-string system of the piano in different playing conditions, including ff dynamics (see Balázs Bank, Stefano Zambon, and Federico Fontana, pages 809-821, IEEE Transactions on Audio, Speech and Language Processing, Vol 18, No 4, May 2010).
Instead, the resonance impulse (Imp) can be obtained in a known way as residual signal from the same measurements or simulations, through the use of techniques capable of de-correlating the harmonic part from a note attack transient.
The force signal (ff) is divided in two portions with complementary amplitude by means of respective gain blocks (120, 130). The first gain block (120) has gain (g) comprised between 1 and 0, whereas the second gain block (130) has gain (1−g). The purpose of the two gains (120, 130) is to weigh, upon varying the impact velocity of the hammer associated with the key, the action of two low-pass filters (140, 160) with variable cut-off frequency. The first low-pass filter (140) has a slope of 6 dB (140) and is installed downstream the second gain block (130). The second low-pass filter (160) has a slope of 18 dB and is installed downstream an adder (150) that sums the output from the first gain block (120) to the output of the first low-pass filter (140).
The gain blocks (120, 130) can be designed according to standard digital signal processing techniques: by controlling gain g in a range from 0 to 1 proportionally to the velocity, and summing by means of the adder (150) the outputs from the respective weighed branches, downstream the second filter (160), an equivalent low-pass effect is obtained with slope of 6+18=24 dB for null velocity and gain because of the action in series of filters (140) and (160) on the force signal (110).
Vice versa, for g values close to 1, a low-pass filtering with slope of 18 dB would be obtained, due to the action of the second filter (160) only, the first filter (140) being no longer fed by a sufficiently wide input.
In actual fact, such a system provides for progressively increasing the cut-off frequency of both filters (140, 160) upon increasing the gain g: in such a way, the low-pass effect due to the first filter (140) is progressively attenuated together with the amplitude of the signal entering the filter, whereas likewise the low-pass effect due to the second filter (160) is attenuated contextually to a proportional increase of the amplitude of the force signal (ff) directly entering the second filter (160).
The global effect of such a control on gain g and simultaneously on the cut-off frequencies of filters (140) and (160) is the optimization of the slope of the spectrum of the force signal (ff) at the different hammer velocities exerted by the player. A global scaling of the signal is operated by means of a third gain block (170) installed downstream the second filter (160). A force signal Fh is generated by the third gain block (170). The third gain block (170) is a function of the impact velocity of the hammer. This also optimizes the amplitude of the force signal Fh outgoing from the hammer module (100).
In parallel to the force signal (ff), the resonance impulse (Imp) is subject to the action of a third low-pass filter (185) identical to the first low-pass filter (140) and successively to the action of a fourth gain block (190) identical to the third gain block (170). The fourth gain block (190) generates a resonance impulse signal (Fh,res) as a function of the impact velocity of the hammer.
Both the third filter (185) and the fourth gain block (190) are controlled by the impact velocity of the hammer just like their respective equivalents (140) and (170). The presence of the third filter (185) and the fourth gain block (190) allows for simultaneously reducing resonances and controlling the amplitude of the resonance impulse (180), respectively. In this way, an evolution of the resonance impulse signal (Fh,res) as a function of the impact velocity of the hammer is obtained.
Referring to
As shown in
The signal of the quadratic components (Fquad) is scaled by means of a first gain block (250). Then, the two signals are summed by means of an adder (255) and the signal obtained is scaled again by means of a second gain block (260) obtaining in output the primary component signal (Fprim+quad).
The spectral components which lay above one fourth of the sampling frequency of the system are removed from the force signal (Fh) by means of a low-pass filter (230). As it is known from the digital signal processing theory, the signal outgoing from the low-pass filter (230) can be squared by means of a multiplier (235), without incurring in the known frequency aliasing phenomenon. The squared signal is filtered by means of a high-pass filter (240) with cut-off frequency set one octave below the fundamental longitudinal frequency of the K-th note, in such manner to obtain an excitation signal (Fexc). Moreover, having removed the continuous component of the signal outgoing from the multiplier (235) by means of the filter (240), the excitation signal (Fexc) meets the conditions to feed the longitudinal resonators (270) that synthesize the longitudinal oscillatory components. The longitudinal resonators (270) generate a longitudinal component signal (Flong) that contains the longitudinal oscillatory components of the K-th note.
Referring to
b0=Ak exp(−1/(FsTk))sin(2πfk/Fs)
a1=−2 exp(−1/(FsTk))cos(2πfk/Fs)
a2=exp(−2/(FsTk))
Said input/output relation is realized by the filter illustrated in
The system of the invention uses resonator filters, such as the one illustrated above, with decay time parameters Tk controlled in such a way to dynamically vary the decay of each partial oscillatory component of the simulated instrument further to excitation of the corresponding string by the hammer. For each partial oscillatory component the decay dynamics is governed by alternatively selecting three values Tk′, Tk″ and Tk′″ for the respective resonator filter, which are predefined during the design stage based on data from decay measurements of partial components in a real piano.
Referring to the time/amplitude envelope diagram illustrated in
Additionally,
The force signal (Fh) is scaled by the corresponding gain (340), whose value is determined by the theory in the cited literature (BANK, ZAMBON & FONTANA).
A switch (380) is connected to each secondary resonator filter (360), switching between a position (A) in which it connects the gain (340) and another position (B) in which it connects a gain (350) where an active note signal (Fc) is fed.
Referring to
Going back to
When the force signal (Fh) becomes constantly null, the switches (380) change state (moving to position (B)), allowing for circulation in the filter bank (360) of the active note signal (Fc) containing the primary oscillatory components, as well as the resonance impulses of the keys active in that moment, globally scaled by gain (350) before introduction into the respective filter (360).
The outputs of all filters (360) are added by means of an adder (370) that superimposes the output of all filters (360), forming a signal of secondary components (Fsec) outgoing from the secondary resonator module (300).
Going back to
Each secondary resonator module (300) receives the active note signal (Fc) when the respective hammer is not active. In view of the second adder (960) each secondary module (300) collects the primary oscillatory component signals (Fprim+quad) coming from all active primary modules (200), each of them scaled through a respective gain (800).
In view of the first adder (920), each secondary module (300) collects the resonance impulse signals (Fh,res) coming from all active hammers (100), each of them scaled through a gain (750).
The outputs from the first adder (920) and the second adder (960) are summed by means of the third adder (940) and globally scaled by a gain (900), in such manner to form the active note signal (Fc) that contains information from the strings and hammers of each active note. Because of such a mechanism, the system of the invention controls the synthesis of partial components produced by means of sympathetic resonance by all strings, as well as the synthesis caused by the harmonic part of the hammer strike peculiar of each hammer.
The various signals outgoing from the filters (410) of the duplex module are summed by means of an adder (420) and the resulting signal is scaled by means of a gain (430) in such a way to obtain a duplex signal (Fduplex) that is emitted in output from the duplex module (400).
Referring to
The signal values outgoing from each filter (220) are squared by a multiplier (222) in such a way to obtain the corresponding quadratic oscillatory components. Said quadratic oscillatory components are globally superimposed by means of an adder (226) and finally sent to a high-pass filter (227) from which a quadratic signal (Fquad) is generated. Likewise filter (240), the high-pass filter (227) is adapted to remove the very low frequency continuous components from the signals outgoing from the multipliers (222). In such away, the quadratic signal (Fquad) containing the tension modulation harmonics of the strings of the K-th note is emitted by the primary resonator module (210).
As shown in
Vice versa, the filters (220) of the bank associated with a tension modulation harmonic always correspond to a secondary resonator suitably tuned in its parameters, adapted to control the envelope beats of the corresponding partial component; for the same reason, the signal entering these filters is not scaled by the constant factor expressed by the gain (213). In both cases, the outputs from the resonator are globally superimposed by means of a second adder (225) to form a primary output signal (Fprim) containing the primary oscillatory components of the K-th note.
Referring to
Each free resonator filter (273) of the bank, suitably set in resonance parameters, receives the excitation force signal (Fexc) from the hammer downstream the multiplication by a gain (271). The global superimposition of the oscillatory components, obtained by means of an adder (275), represents the set of the longitudinal components for the strings of the K-th note.
In case of the aforementioned longitudinal oscillatory components, the system of the invention provides for realizing the synthesis differently from what provided for by the prior art. In fact, the theory provides for realizing a certain number of signal products (or “loop modulations”) between certain partial oscillatory components belonging to the same string; each of these products represents an excitation force component of a corresponding longitudinal mode of the string. Now the force component is filtered through a “forming” pass-band filter, the impulse response of which models the free response of the same longitudinal mode. Therefore a corresponding forced longitudinal oscillatory component of the string is present at the output of the forming filter.
Unlike this procedure, the system of the present invention provides for using the signal outgoing from the high-pass filter (240) of
fk=fm+fn
Tk=(TmTn)/(Tm+Tn)
Ak=|H(fk)|(AmAn)1/2
wherein |H(f)| is the free amplitude response of the longitudinal mode, whose value in correspondence of frequency fk of the excitation force component is selected.
The novelty introduced to synthesize the longitudinal oscillatory components compared to the prior art consists of the fact the forming filters respond to the signal coming from the hammers and therefore resonate excessively during the note attack step. For this reason the system of the invention provides for excluding these filters from the synthesis chain of the longitudinal oscillatory components. Nevertheless, the transient components produced by them are crucial for the accurate sound synthesis due to the longitudinal motion of the strings.
The solution of the invention is to add a second forming filter bank (277) that reproduces the free response components in parallel to the resonator bank (273) for reproduction of the forced longitudinal oscillatory components. The outputs from the free resonator filters (277) are added by means of an adder (280) and the resulting signal is scaled by means of a gain (282).
In this way, the transient components caused by the free response can be synthesized and simultaneously kept under control thanks to the gain (282), which scales their amplitude without affecting the longitudinal oscillatory components. The outputs from the two banks (273, 277) are finally superimposed by means of an adder (285) and scaled by a gain (290) to form a longitudinal component signal (Flong).
Consequently, the system of the invention proposes an approximate solution to the problem, consisting of grouping multiple notes in the same split, the size of which varies inversely with the available computing power. To each pair of adjacent splits a specific two-step structure based on digital filters is associated, the realization of which is known in the prior art (see Bank, Zambon and Fontana). The first step is composed of two filtering modules (738) and (750), whereas the second step is composed of a convolution module (760).
In view of the novelty proposed by the present system, the design of the first step provides for a considerable saving in computations compared to the use of only signal convolution, regardless of the efficiency of the convolution techniques used, for which reference is made to the literature of the prior art.
Specifically, each split (705) adds the input signals (Ftot) associated with it, and at the same time selects an optimal lateralization parameter for the sound produced by the notes that are active in that moment on the split. The value of said parameter lateralizes, that is to say shifts laterally with respect to an ideal point located on the center in front of the listener, the position of the acoustic source represented by the sound. In case of a split of piano notes, at every moment this value can correspond to the center of the region that originates the sound, which is a function of the notes of the split that are played in the same moment. Finally, the lateralization parameter value determines a temporal delay value that is used by a block (710) to define two instances of the input signal, one of them being delayed with respect to the other one by the corresponding value. Discrete-time lateralization models of the acoustic source based on the relative delay between pairs of otherwise identical signals, which contain the same source sound, are known in the prior art.
The signals outgoing from the left channel of each block (710) of the corresponding split pair are summed by means of an adder (720) and processed by the first step, consisting of a bank of N all-zero second-order digital filters (738) and a bank of N all-pole second-order digital filters (750). Assuming to define an even split number P, with reference to the P/2-th split pair of
Being the poles of the i-th filter of each bank common to all splits, the inputs at the bank filters having the same index i can be individually processed by the all-zero part (738) of the filter, the output of which is sent to processing by the common-pole part (750) of the i-th filter.
In practical terms, the signal outgoing from the adder (720) of the P/2-th pair is processed by the N all-zero filters (738), each of them being respectively characterized by coefficient b0,i,P/2 of a gain block (722), and in parallel by the delay element z−1 (725) in series to coefficient b1,i,P/2 (730). When all signals outgoing from the respective adders (720) have been processed by the corresponding all-zero filters (738), the sum (740) of P/2 signals, each of them outgoing from the i-th all-zero filter of the bank is sent to the i-th all-pole filter (750), respectively characterized by coefficients −a1,i and −a2,i to complete the filtering operation.
The second processing step completes the global characterization of the soundboard-instrument body for the left signal of all notes. The convolution module (760) receives the sum (755) of the outputs from the all-pole parts (750) of the N filters, in parallel to the sum (745) of the P/2 inputs at the corresponding filter banks, scaled by the respective gains (735) forming the second order transfer characteristic HiP/2(z) illustrated above. A similar processing is carried out for the right signal outgoing from all blocks (710), which undergoes a structurally identical processing, except for the values assumed by the coefficients of the respective common zero-pole second-order transfer characteristics and by the parameters of the relative convolution block.
Although the identification and extraction of 2N common poles from a set of transfer characteristics is illustrated and motivated by the prior art (see Y. Haneda, S. Makino, and Y. Kaneda, “Multiple-point equalization of room transfer functions by using common acoustical poles,” IEEE Trans. Speech Audio Processing, vol. 5, no. 4, pp. 325-333, 1997), the specific application of such a methodology to a multidimensional pole-zero model able to characterize the soundboard and the instrument body is a novelty proposed in the system of the invention.
In the specific case of filter banks with N elements as illustrated above, the system of the present invention provides for selecting 2N common poles from a set of target transfer characteristics. The characteristics are obtained from measurements of the response of the soundboard-instrument body: first, by disaggregating from the measurements information common to all responses, which characterizes the second step, or convolution; successively, by identifying the aforementioned characteristics as residues of the same responses after extracting the common information from them.
After selecting the common poles, the NP zero positions are optimized to define N(P/2) second-order common-pole filters able to minimize, for each N(P/2) input/output pair, the sum of the quadratic errors with respect to the target characteristics.
The use of P/2 banks with N common-pole second-order filters allows for considerably reducing the computational load of the module (700) compared to the cost in case of realization of N(P/2) second-order filters having distinct poles. In fact, such a reduction allows for avoiding the calculation of N(P/2)−N=(N−1)(P/2) all-pole second-order equivalent filters, without appreciable loss of model accuracy.
Number | Date | Country | Kind |
---|---|---|---|
AN2012A0023 | Mar 2012 | IT | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2013/054874 | 3/11/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2013/135627 | 9/19/2013 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
7285718 | Sasaki | Oct 2007 | B2 |
7435895 | Muramatsu | Oct 2008 | B2 |
7598448 | Ohba | Oct 2009 | B2 |
8013233 | Komatsu | Sep 2011 | B2 |
Number | Date | Country |
---|---|---|
2261891 | Dec 2010 | EP |
2013135627 | Sep 2013 | WO |
Entry |
---|
International Search Report for corresponding International Application No. PCT/EP2013/054874. |
Balazs Bank et al., “A Modal-Based Real-Time Piano Synthesizer”, IEEE Transactions on Audio, Speech adn Language Processing, IEEE Service Center, New York, NY USA vol. 18, No. 4, May 1, 2010, pp. 809-821. |
Number | Date | Country | |
---|---|---|---|
20150068390 A1 | Mar 2015 | US |