The present invention relates to a device and a method for analyzing an audio datum, in particular to a device which may be used, for example, in connection with a display device, an accompaniment device or another evaluation device, for example to enable a faster and simpler determination of a key of the key change, a chord or a chord change.
When making music, but also when otherwise dealing with a piece of music or an existing sequence of chords, an analysis of the existing or sounding piece of music is required in many situations, for example to enable improvising on the existing piece of music, i.e. creatively generating harmonically and consonantly sounding melodies, or accompanying the existing piece of music, i.e. creating a sequence of chords and/or a sequence of single tones which go with the melody and tend to underline the same.
This frequently requires of a person a minimum measure of experience in dealing with music, which may frequently only be learned by several years of working with music and/or a musical instrument. In addition to that, a corresponding analysis frequently requires of a person a certain musical talent, which may request partially even absolute hearing in the case of very complex pieces of music. This, however, excludes many people who lack the required background knowledge of music theory, sufficient experience in dealing with music and/or a musical instrument, or the corresponding talent.
In literature, many teaching aids and means for learning and/or finding chords, harmonies and keys are known. These are often templates, discs or other objects, in particular mechanically connected, shiftable or rotatable templates on which connections regarding music theory are illustrated. Such learning aids and means are, for example, described in the following documents DE 8005260 U1, DE 8902959 U1, DE 3744255 A1, U.S. Pat. No. 5,709,552, DE 3690188 T1, US 2002/0178896 A1, DE 4002361 A1, DE 19831409 A1, DE 19859303 A1, DE 29801154 U1 and DE 20301012 U1. In general, on one of the discs or the corresponding objects a sequence of pitches is applied which in general either corresponds to the chromatic scale consisting of a sequence of twelve semitones and thus all available pitches of an equal temperament, or to the circle of fifths, wherein a pitch interval of two adjacent pitches is a fifth (for example C-G or F-C). DE 8005260 shows a device for finding chords, harmonies and keys with an arrangement in an interval of a third.
The utility model DE 29512911 U1 describes a teaching and learning aid for a synthesis and analysis of connections regarding music theory with several different templates and at least twelve gaming pieces provided with designations of pitches.
The European patent EP 0452347 B1 refers to a universal operating unit for an electronic musical instrument comprising a number of note selectors, each of which provides a note selection signal when a note is selected and a note deselection signal with a deminishment of a note, note turn-on devices coupled to the number of note selectors for providing note-designating information associated with each note selector and for providing a note turn-on signal triggered by the note selection signal which includes the corresponding note-designating information, a memory means for storing the note-designating information provided as triggered by the note selection signal, means coupled to the note turn-on device for changing the note-designating information and note turn-off devices coupled to the number of note selectors and to the memory means for providing a note turn-off signal triggered by the note deselection signal which includes the note-designating information stored when providing the note selection signal.
The patent DE 4216349 C2 describes an electronic musical instrument having a melody and an accompaniment keyboard. The musical instrument described has a melody keyboard whose melody keys include switches including two switching stages, wherein those pitches corresponding to the white keys are associated with the first switching stages and those pitches corresponding to the black keys of a keyboard are associated with the second switching stages, and an accompaniment keyboard comprising accompaniment keys which, when operated, may call an automatic chord accompaniment, wherein the accompaniment keys are respectively implemented as switches having at least two switching stages which have different associated accompaniment chords. An operation of the described electronic musical instrument does not request the knowledge of musical notation, but requires, due to the described modeling according to a fingerboard, an operator who is educated in music theory, as in particular certain combinations of individual pitches and chords, which are needed in particular for pedagogical purposes, are obvious. In particular, the document describes a musical instrument with a one-finger accompaniment system, which a user may operate manually to generate an accompaniment chord.
The patent DE 2857808 C3 describes an electronic musical instrument combined with an electronic clock. The invention relates to an electronic musical instrument, wherein via input and storage means any pitch sequences and pieces of music may be input and retrieved again. The described electronic musical instrument thus enables only an input with a subsequent storage of a pitch sequence and a reproduction of the stored pitch sequence via a pitch generator circuit to reproduce the stored sequence of pitches in the form of a sequential acoustic presentation. It is in particular disadvantageous with regard to the musical instrument described, that the input and/or the “programming” of the pitch sequence takes place via a 10-key pad, extended by several additional keys. In particular, the electronic musical instrument described also requires a certain minimum of theoretical musical knowledge, as otherwise a programming of the musical instrument will hardly be realizable.
The European patent EP 0834167 B1 refers to a virtual musical instrument with a new input device. In particular, the above-mentioned patent application refers to a virtual musical instrument having a portable accessory of a type which is to be brought in contact with a musical instrument in order to play this instrument, wherein the mentioned portable accessory comprises a switch which generates an activation signal as a reaction to a person holding the mentioned portable accessory causing the mentioned portable accessory to hit another object. The mentioned activation signal is received by a digital processor, which in turn generates a control signal which causes a synthesizer to generate a note which is represented by a selected note data structure. In particular, the patent application describes a virtual musical instrument, wherein the mentioned portable accessory is a guitar plectrum and wherein a user may only make pitches from within a predetermined amount of pitches sound via the synthesizer.
The European patent EP 0632427 B1 relates to a method and a device for inputting musical data. More specifically, the mentioned patent relates to a musical data input device including an input recording means for recording a hand-written input on it, a position detection means for detecting a position on the input recording means where the hand-written input is performed to obtain pitch data representative of a pitch of a musical note, an input detection means for detecting the hand-written input performed on the input recording means, wherein the input detection means comprises a means for detecting the number of pushing events performed on the input recording means or for detecting a time period in which the input recording means is pushed, or for detecting the intensity of pressure which is exerted on the input recording means during the hand-written input, or comprises a number detection means to detect a number written onto the input recording means, or a line detection means to detect the length of a line which is drawn onto the input recording means, a time designation means for designating time data representative of the length of a musical pitch, on the basis of the detected number of pushing events or the detected time period or the detected intensity of pushing events or the detected number or the detected length of a line detected by the input detection device, and a musical pitch generation means for detecting musical pitch data on the basis of pitch level data obtained from the position detection means and the time data obtained from the time designation means. In particular, the mentioned patent application describes a musical data input device having an LCD unit (LCD=liquid crystal display) and a touch pad arranged on the same, via which, with the help of a pen, pitches may be inserted into a pitch system. The described musical data input device thus relates to people having a sufficiently high knowledge of connections regarding music theory.
The patent application U.S. Pat. No. 5,415,071 relates to a method and a device for generating relationships between musical pitches. Here, an arrangement of offset lines or rows of symbols is described, wherein each symbol represents a musical note. Each line includes a repeating series of twelve symbols which forms a musical series of semitones which is also known as the chromatic scale. Here, each line is offset with regard to the adjacent lines so that groups of symbols which represent the same musical relationship, i.e., for example, intervals, scales, chords, etc., form the same visually recognizable configurations, like, for example, diagonal configurations or vertical configurations at certain locations in the arrangement. In one embodiment, such a device which includes such an arrangement may be used as a learning aid, wherein the learning aid comprises two overlapping components which may be shifted against one another. Apart from that, the patent application describes an arrangement of the contact area of a keyboard and/or a claviature of a musical instrument with a claviature or a fingerboard of a musical string instrument which are arranged in accordance with the arrangement. The patent application thus describes a claviature having keys arranged in the form of concentric circles.
According to an embodiment, a device for analyzing an audio datum may have a semitone analyzer, which is implemented to analyze the audio datum with regard to a volume information distribution over an amount of semitones; and a vector calculator, which is implemented to calculate a sum vector over two-dimensional intermediate vectors for each semitone or each element of the definition amount and to output an analysis signal based on the sum vector, based on the volume information distribution or a distribution derived from the volume information distribution, which comprises a definition amount based on the amount of semitones.
According to another embodiment, an accompaniment system, may have a device for analyzing an audio datum, having a semitone analyzer, which is implemented to analyze the audio datum with regard to a volume information distribution over an amount of semitones; and a vector calculator, which is implemented to calculate a sum vector over two-dimensional intermediate vectors for each semitone or each element of the definition amount and to output an analysis signal based on the sum vector, based on the volume information distribution or a distribution derived from the volume information distribution, which comprises a definition amount based on the amount of semitones; and an accompaniment device, which is coupled to the device and implemented to receive the analysis signal and provide a note signal based on the analysis signal.
According to another embodiment, a measurement system may have a device for analyzing an audio datum which may have a semitone analyzer, which is implemented to analyze the audio datum with regard to a volume information distribution over an amount of semitones; and a vector calculator, which is implemented to calculate a sum vector over two-dimensional intermediate vectors for each semitone or each element of the definition amount and to output an analysis signal based on the sum vector, based on the volume information distribution or a distribution derived from the volume information distribution, which comprises a definition amount based on the amount of semitones; and a display device which is coupled to the device to receive the analysis signal and implemented to provide an output signal indicating an angle of the sum vector based on the output signal.
According to another embodiment, a detection system may have an integrator, which is implemented to integrate a time-dependent audio input signal regarding time and provide the same as an audio datum; a device for analyzing an audio datum, which may have a semitone analyzer, which is implemented to analyze the audio datum with regard to a volume information distribution over an amount of semitones; and a vector calculator, which is implemented to calculate a sum vector over two-dimensional intermediate vectors for each semitone or each element of the definition amount and to output an analysis signal based on the sum vector, based on the volume information distribution or a distribution derived from the volume information distribution, which comprises a definition amount based on the amount of semitones., which is coupled to the integrator and provides the analysis signal; and an evaluation device, which is coupled to the device and is implemented to analyze a time course of a length of the sum vector based on the analysis signal and, when the time course of the length of the sum vector comprises a maximum or a minimum, output a detection signal.
According to another embodiment, a key determination system may have a device as mentioned above; and a key determinator, which is coupled to the device and is implemented to generate a key signal indicating a key based on the analysis signal of the device and provide the same at an output.
According to another embodiment, a method for analyzing an audio datum may have the steps of analyzing the audio datum with regard to a volume information distribution over an amount of semitones; calculating a two-dimensional intermediate vector based on the volume information distribution or a distribution derived from the volume information distribution, which comprises a definition amount based on the amount of semitones, for each semitone or each element of the definition amount; calculating a sum vector based on the two-dimensional intermediate vectors; and outputting an analysis signal which is based on the sum vector.
According to another embodiment, a computer program may have a program code for performing the method for analyzing an audio datum, which may have the steps of analyzing the audio datum with regard to a volume information distribution over an amount of semitones; calculating a two-dimensional intermediate vector based on the volume information distribution or a distribution derived from the volume information distribution, which comprises a definition amount based on the amount of semitones, for each semitone or each element of the definition amount; calculating a sum vector based on the two-dimensional intermediate vectors; and outputting an analysis signal which is based on the sum vector, when the computer program runs on a computer.
The inventive device for analyzing an audio datum includes a semitone analysis means which is implemented to analyze the audio datum with regard to a volume information distribution over an amount of semitones, and a vector calculation means which is implemented, based on the volume information distribution or a distribution derived from the volume information distribution, which comprises a definition amount based on the amount of semitones, to calculate for each semitone or each element of the definition amount a sum vector over two-dimensional intermediate vectors for each semitone or each element and to output an analysis signal based on the sum vector.
The present invention is based on the finding that a faster and more efficient analysis of an audio datum, for example with regard to a determination of a key, a change of key, a chord, a change of chord and other connections regarding music theory, is enabled by the fact that the audio datum is analyzed over an amount of semitones with regard to a volume information distribution, and based on the volume information distribution or a distribution derived from the volume information distribution a sum vector is calculated and output as the analysis signal. By the calculation of the sum vector, i.e. a mapping of the volume information distribution to the two-dimensional sum vector, substantial information about a piece of music, perceived to be harmonic and/or consonant by many people, which is present in the form of the audio datum, are gained. Regarding this, it is especially advantageous that, by the calculation of the two-dimensional sum vector, also from a very complex audio datum, significant and thus relevant information may be extracted from the audio datum, and that thus the same may be analyzed. The inventive device for analyzing an audio datum is thus able to extract substantial information from the audio datum and make the same available in the form of the analysis signal.
It is a substantial advantage that the inventive device for analyzing an audio datum requires a suitable implementation which may perform the analysis in “real time” on the basis of a current value of the audio datum. Limitations to the possibility of an instantaneous and/or direct calculation of the sum vector are basically presented by the semitone analysis means which requires a certain time for the analysis of the volume information distribution due to the physical characteristics of sound waves, when the audio datum includes analog or digital audio signals. If, however, the audio datum includes note sequence signals, i.e., for example, analog or digital control signals for a sound generator (e.g. midi signals), then the semitone analysis means may perform a corresponding analysis quasi instantaneously.
It is a further advantage, that the vector calculation means may be implemented to perform the calculation of the two-dimensional intermediate vectors by a weighting of the unit vectors, which are associated with the respective semitones and/or the respective elements of the definition amount, with the volume information distribution or the distribution derived from the same. By this, the calculation may be significantly accelerated. In addition to that, as a further advantage, the semitone analysis means may analyze the audio datum with regard to the volume information distribution under consideration of a frequency-dependent weighting function, so that a difference of the perception of consonance and/or harmony regarding the frequency, in particular regarding an octave position, is considered. By this, it is possible to consider hearing specific characteristics, for example to consider that a C major chord is perceived to be differently pleasant in different octavings and/or octave positions.
It is a further advantage that the calculation may be further accelerated by the inventive device for analyzing an audio datum further comprising a pitch class analysis means which forms a pitch class volume information distribution based on the volume information distribution and simultaneously maps the amount of semitones to an amount of pitch classes as the definition amount of the pitch class volume information distribution. Here, a pitch class is referred to as the indication of a pitch neglecting the octave to which this pitch (tone) belongs. In other words, a pitch may be identified by the fact that its pitch class (e.g. C) and the associated octaving and/or octave position are determined. Thus, for example, the pitches C, C′, C″, C′″ comprise the pitch class C.
It is a special advantage of the present invention that the vector calculation means may be implemented such that the unit vectors, which are associated with the pitch classes, the semitones or the elements of the definition amount, comprise an angle value regarding a preferential direction, so that the two-dimensional sum vector may be used within the context of an arrangement of pitch classes referred to as “circle of thirds” or an arrangement referred to as “symmetry model”, to represent connections regarding music theory in an especially efficient and simple way.
It is a further advantage of the present invention, that the semitone analysis means may analyze the audio datum with regard to a plurality of different volume information distributions. Thus, the volume information distribution may comprise information regarding an amplitude, an intensity, a volume, a hearing-adapted volume or other volume information. By this, depending on the application-specific circumstances, the inventive device for analyzing an audio datum may analyze the same regarding different pieces of volume information adapted to the application and thus enable an especially efficient analysis.
It is a further advantage, that the inventive device may also output an analysis signal which comprises a time course in case the audio datum comprises a time course. By this, for example, an analysis of a piece of music in real time is possible, so that the analysis signal may provide information regarding data regarding music theory of the piece of music to a person during the course of a piece of music for controlling further devices and/or after displaying the same on a display device.
Here, the audio datum may be provided to the inventive device in different forms. Thus, it is possible to provide the audio datum in the form of a microphone signal, a line signal, an analog audio signal, a digital audio signal, a midi signal, a note signal, a note sequence signal of an analog control signal for controlling a sound generator or a digital control signal for controlling a sound generator, so that the inventive device for analyzing an audio datum may be used within the scope of many applications, which represents a further substantial advantage.
As the embodiments will show, thus the inventive device may, for example, be used in an accompaniment system, which apart from the inventive device includes an accompaniment device, which is coupled to the inventive device for analyzing an audio datum and implemented such that the accompaniment device may receive the analysis signal and provide a corresponding note signal based on the analysis signal. Thus, for example, the accompaniment device of an accompaniment system may be implemented such that, based on the analysis signal, the same determines a chord and/or a diatonic scale and provides corresponding note signals based on the determined chord and/or the determined diatonic scale and/or both. The inventive device may thus be integrated into an accompaniment system which enables a very flexible, automatic and efficient provision of a note signal for the accompaniment of the piece of music underlying the audio datum. It is thus a substantial advantage of the present invention that the inventive device may be integrated into an accompaniment system which comprises the above-mentioned characteristics.
It is a further advantage of the present invention, that the inventive device may be integrated into a measurement system which further comprises a display device, which is coupled to the inventive device to receive the analysis signal and which is implemented, based on an angle of the sum vector, to provide an output signal indicating the same. If the output device, for example, has an output field having an output field center and an output field preferential direction, then the display device may accentuate an output field radial direction based on the angle of the sum vector on the output field. From this, the advantage results, that the analysis signal representing the sum vector may be geometrically represented on the output field and that, by this, the analysis signal may be presented to a person in an especially understandable way.
This advantage is in particular increased when the output field and the device for analyzing an audio datum use a geometric arrangement of pitch classes, as they occur in the above-mentioned circle of thirds or symmetry model. By this, the meaning of the analysis signal regarding music theory may be presented to a user of the measurement system in an even more efficient way.
In addition to that, it is possible not only to represent the angle of the sum vector on the display device but also a length of the same which, for example, indicates an estimate for the tonal context and/or the definedness of the key or the consonance and/or dissonance or the present chord, which represents a substantial advantage.
In addition to that, the inventive device may also be used in a detection system which, apart from the inventive device for analyzing an audio datum, further comprises an integrator device and an evaluation device, which enables an automatic detection of a change of chord and/or a change of key.
Other features, elements, steps, characteristics and advantages of the present invention will become more apparent from the following detailed description of preferred embodiments of the present invention with reference to the attached drawings.
In the following, preferred embodiments of the present invention are explained in more detail with reference to the accompanying drawings, in which:
With reference to
The present application is structured as follows: first, with reference to one embodiment, the basic setup and the basic functioning of an inventive device for analyzing an audio datum and of three systems, which include the inventive device, are explained. Subsequently, the synthesis and the analysis of tone combinations will be explained in more detail, before an introduction into two different positioning variants is given. Hereupon, a mathematical model description follows, which serves for a further understanding of the present invention. Subsequently, a symmetry model-based and a circle of thirds-based harmony analysis will be explained, before further embodiments are explained and discussed.
If an audio datum is provided to the semitone analysis means 110 at the input terminal 130, then the semitone analysis means 110 analyzes the audio datum with regard to a volume information distribution across an amount of semitones and makes the same available to the vector calculation means 120 or optionally a distribution derived from the same. The vector calculation means 120 now calculates a two-dimensional intermediate vector based on the volume information distribution or the distribution derived from the volume information distribution, for each semitone or each element of a definition amount, via which the derived distribution was determined. Subsequently, the vector calculation means 120 calculates a sum vector based on the two-dimensional intermediate vectors and outputs the same as an analysis signal to the output terminal 140.
To explain this in more detail, in
Depending on the concrete implementation of the device 100 for analyzing an audio datum, the semitone analysis means 110 may be set up in a different way. It is decisive here, in which form the audio datum is present. If the audio datum is, for example, a note sequence signal and/or a control signal, i.e. a signal which, for example, indicates to a sound generator which note and/or which pitch it has to play, the semitone analysis 110 of the device 100 for analyzing an audio datum may store the corresponding note sequence signals in a memory. The semitone analysis means 110 may then, for example on the basis of the note sequence signals stored in the memory, combine or “sum up” all note sequence signals which belong to a certain semitone, to subsequently provide the same as a volume information distribution to the vector calculation means 120. Here, depending on the concrete implementation of the semitone analysis means 110, the volume information distribution may be weighted according to a number of note sequence signals which belong to a certain semitone. If the note sequence signals comprise volume information, for example in the form of attack and/or touch values or other data indicating the volume, then the semitone analysis means 110 may gain the volume information distribution over the amount of semitones via putting together the corresponding note sequence signals. Examples for note sequence signals are, for example, midi signals (midi=musical instrument digital interface) or other digital or analog control signals for sound generators.
If, however, an analog or a digital audio signal is provided to the inventive device 100 for analyzing an audio datum, it may be required for the semitone analysis means 110 to analyze, if applicable, with regard to a frequency composition, in order to achieve the volume information distribution over the amount of semitones. In the case of digital audio signals being the audio datum, such an analysis may, for example, take place with the help of a so-called constant-Q transformation. In a constant-Q transformation, the incoming audio signal is analyzed by a plurality of bandpass filters respectively characterized by a central frequency and a bandwidth. Here, the central frequency of a bandpass filter is used according to the frequency and/or basic frequency of a pitch. The basic frequency of a pitch (e.g. 440 Hz for the pitch A′) in this case corresponds to the central frequency of the bandpass filter which is responsible for an analysis of the audio datum with regard to this pitch and/or semitone. The bandwidth of the filters here corresponds to the distance of two pitches and/or tones in the frequency domain, so that the quotient of the central frequency and the bandwidth of every filter is constant. By this fact also the term constant-Q transformation is taken into account, as the letter Q here stand for quotient. Examples for digital audio signals are PCM signals (PCM=pulse code modulation), as they are, for example, used in connection with CDs. Depending on which digital audio signals are used, a further conversion into PCM signals or other digital audio signals may be required. One example for this is, for example, an MP3-encoded audio signal.
In the case of analog audio signals being the audio datum, for example a conversion and/or sampling of the analog audio signals into a digital audio signal may be required before a corresponding constant-Q transformation can be performed. This sampling of such an analog audio signal may, for example, be performed with the help of an analog/digital converter (ADC). Examples for analog audio signals are analog microphone signals, analog headset signals or line signals, as they are used, for example, in the field of stereo systems.
Optionally, a pitch class analysis means may be coupled between the semitone analysis means 110 and the vector calculation means 120, which calculates a pitch class volume information distribution over the amount of pitch classes and as a definition amount on the basis of the volume information distribution over the amount of semitones. As already explained above, a pitch class is here information regarding a pitch disregarding the octave to which the pitch belongs. In other words, a pitch is determined by indicating the pitch class and the octaving, i.e. the indication to which octave the pitch belongs. Thus, the pitches C, C′, C″, C′″, . . . have the pitch class C. On the piano, thus twelve pitch classes are defined: D, D sharp, E, F, F sharp, G, G sharp, A, A sharp (B and/or H), C and C sharp.
The semitone analysis means 110 may further consider a frequency-dependent weighting function g(f) in the determination of the volume information distribution, which weights the analyzed semitones depending on their pitch level and/or their frequency and/or their basic frequency f. By considering the weighting function g(f) it is possible to consider how different the influence of two pitches and/or semitones of the same pitch class but of a different frequency, and thus of different octaves, is on the perception of harmony in the case of a chord and/or harmony.
The vector calculation means 120 may, for example, be implemented such that to each semitone or each pitch class a two-dimensional unit vector is assigned, which is weighted and/or multiplied with the associated component of the volume information distribution and/or the distribution derived from the volume information distribution. The vector calculation means 120 may do this, for example, on the basis of Cartesian coordinates with the help of a corresponding arithmetic logic unit. Likewise, the subsequent calculation of the sum vector 160 may take place on the basis of the intermediate vectors with the help of a (digital) arithmetic logic unit on the basis of Cartesian coordinates. Depending on the implementation of the inventive device 100 for analyzing an audio datum, the analysis signal may include the length r and the angle α of the sum vector with regard to a differential direction in the form of a digital data package.
The accompaniment device 180 may, for example, be implemented such that, based on a mapping function which links the angle α of the sum vector 160 with an amount of note signals, which are output at the accompaniment system output 185. One example for the determination of a chord and/or the tonal context is explained in more detail in connection with
Optionally, in addition to that, the accompaniment system 170 may be extended by a melody detection means and a melody generation means, which are coupled to each other. The melody detection means detects a melody signal which is, for example, the audio datum, which is supplied to the device 100, which may, however, also be another audio signal, analyzes the same with regard to a volume information distribution over an amount of semitones and provides the same to the melody generation means as the melody detection signal. The melody generation means in turn generates a melody note signal on the basis of the melody detection signal, which may, for example, be supplied to an optional sound generator.
Thus, a melody audio datum, for example singing, may be provided to the melody detection means, for example via a suitable input, via a microphone input or another digital or analog audio signal, which is analyzed by the melody detection means. On the basis of the result of the melody detection means, the melody generation means may generate a melody note signal which may, for example, be provided to a sound generator, so that the same may replay the sung melody. By this, the accompaniment system 170 is able to simultaneously replay, for example, a sung melody and accompany the same.
The output field 210 may, for example, include an LCD display (LCD=liquid crystal display), a screen or another optical display area, like in the form of a field of light emitting diodes arranged in a matrix (LED=light emitting diode). Likewise, the output field 210 may include a TFT display (thin film transistor), a screen or another pixel-oriented display field. Depending on the concrete implementation of the output field 210, the display control means 205 may control the output field 210 such that, based on a central point 215, any output field radial direction may be optically accentuated. In the case of a field of light emitting diodes arranged in a matrix, this may, for example, be realized by the fact that, starting from a light emitting diode associated with the central point 215, a plurality of light emitting diodes is controlled by the display control means 205, which originate from the central point 215 in a straight line.
In the case of an output field, which enables a more complex illustration, like, for example, of a TFT display or an LCD display, the display control means 205 may be implemented to represent more complex patterns. In this case, not only an output field radial direction may be accentuated, but more complicated patterns may be represented. Thus it would in this case be obvious to represent an arrangement of pitch classes and/or pitches on the display 210, in connection with which the sum vector, which is provided by the inventive device 100 in the form of the analysis signal, is to be made clearer for an viewer of the measurement system 190.
In
Independent of the concrete implementation of the output field 210, the display means 205 controls the output field 210 such that, starting from the central point 215, the sum vector is illustrated in the form of an output field radial direction or a more complicated pattern. In
Optionally, via the length of the illustrated arrow 220 also the length of the sum vector may be illustrated. The length of the arrow 220 and the length of the sum vector may here be linked to each other via a function which may, for example, be implemented within the context of the display control means 205. Here, virtually any functions are possible. Thus, a simple linear assignment may take place just like, for example, a logarithmic, a quadratic, or another, perhaps more complicated, mapping of the length of the sum vector to the length of the illustrated arrow 220.
Within the scope of the present application, in the notation of the pitches classes, there is a difference between upper case and lower case pitch classes. If a pitch class is designated by an upper-case letter, like, for example, C or F, the corresponding major triad sounds when the corresponding pitch class and the two pitch classes which are adjacent to the corresponding pitch class in a clockwise direction are selected. In the case of C, this means that the pitch classes C-e-G for example represent a C major triad. Accordingly, the three pitch classes F, a and C together represent an F major triad. Pitch classes which are designated by small letters correspondingly represent minor triads. An example of this is the D minor triad which includes the pitch classes d, F and a. The triad designated by h0 has a special status, which is the diminished triad h0 when, based on the pitch class h0, the two clockwise adjacent pitch classes also sound. Here, this is the triad h/b-d-F which consists of a sequence of two minor thirds.
Basically, it is also possible that the output field 210 is not a screen or a screen-like output field which passes on information to a viewer optically, but that it is here, for example, a mechanical output field, wherein individual output field radial directions, output field areas or parts of the output field may be accentuated mechanically. It is in this connection also possible that such an accentuation may take place by a mechanical vibration or by a lifting or lowering of a certain area. By this, it is possible to offer a corresponding representation also to visually handicapped people.
Optionally, the display control means 205 may additionally also be implemented to accentuate an output field radial direction of the output field 210 or an area of the output field 210 which is associated with a pitch class of the symmetry model 217 or the circle of thirds 217′, when a corresponding signal is transmitted to the display control means 205.
Of course, on the output field 210 also other arrangements of pitch classes or semitones may be illustrated. Arrangements of pitch classes are especially sensible in this context, in which pitch classes are associated with adjacent angles, which are based on special connections regarding music theory. The selection of a concrete output field preferential direction here represents no limitation regarding the term “adjacent angle” or “directly adjacent angle”. Thus, for example, an angle to which a pitch class is associated and which is located at an angle value of 359° may be directly adjacent to another angle to which a pitch class is associated and which is located at an angle value of 1°.
If the time-dependent audio input signal is a note sequence signal, like, for example, a midi signal, the integrator means 240 may be implemented such that the number of parts of the note sequence signal referring to one pitch are added up. Here, a weighting of the volume information, which the note sequence signal includes, may be considered just like other weighting factors. Further, for example, the integrator means 240 may consider the “age” of a note sequence signal, i.e. a time difference between the arrival of a note sequence signal and a current time index. The integrator means 240 may, in this case, provide the audio datum to the inventive device 100 in the form of a further note sequence signal.
If the time-dependent audio input signal is an analog or a digital audio signal, like, for example, an analog microphone signal, it may be advisable to integrate a semitone analysis means into the integrator means 240, as it was already explained in connection with
Downstream to the inventive device 100 for analyzing the evaluation device 250 is connected, which receives the analysis signal from the device 100. The analysis signal of the device 100 in this case includes the length of the sum vector.
If the integrator means 240 is implemented such that it provides the time-dependent audio input signal in a time-integrated way as an audio signal to the device 100, for example in regular intervals, and if in addition to that the device 100, for example, performs the analysis in regular time intervals with a predetermined frequency and correspondingly respectively outputs the analysis signal, then the evaluation means 250 may determine a time course of the length of the sum vector on the basis of the incoming analysis signal, analyze the same and, if the time course of the length of the sum vector comprises a maximum or a minimum, output a detection signal at an output of the detection system 230. By this, the detection system 230 is able, for example, to detect a change of chords or a change of key. More details about this topic are explained in the further course of the present application.
Optionally, also the detection signal of the evaluation device 250 may be supplied to the integrator means 240, as the connection in dashed lines in
Alternatively, the detection system may further be realized such that the integrator means 240 is connected between the semitone analysis means 110 and the vector calculation means 120. In other words, the detection system may further be implemented such that the integrator means 240 is implemented as an optional component of the inventive device 100. In this case, the integrator means 240 may be implemented such that the same, on the basis of the volume information distribution, provides a distribution derived from the same to the vector calculation means or a downstream pitch class calculation means.
A further embodiment of the present invention represents a key determination system, which, apart from an inventive device for analyzing an audio datum, comprises a key determination means, which is coupled to the inventive device. The key determination means receives the analysis signal from the inventive device and analyzes the current key or alternatively the current chord based on the angle of the sum vector included in the analysis signal. The key determination means may perform this, for example, on the basis of a key assignment function, which assigns the angle of the sum vector to a key or a chord. More detailed explanations regarding this are given in the further course of the present application in connection with the “symmetry model”, the “third circle” and their mathematical description. Optionally, in addition to that, the key determination means may provide an estimate for the reliability of the analysis also on the basis of the analysis signal. Here, the length of the sum vector, which is also included in the analysis signal, may be used as a basis. Here, the estimate may be determined on the basis of a further functional assignment which assigns a certain estimate value to a length value of the sum vector. This further functional assignment may include a simple linear mapping, a step function or a more complicated function. The key determination means outputs the key and, if applicable, the estimate as the key signal at an output which may, for example, be output at an optional display device.
The chromatic scale consists of a sequence of twelve semitones which respectively have a pitch interval of a minor second. In other words, the chromatic scale includes twelve semitones which belong to an octave. To each pitch and semitone thus a frequency of a sound wave or another mechanical vibration is assigned. Due to the conventional division of the audible spectrum into octaves with respectively exactly twelve semitones in western music, each pitch and semitone of a certain octave and within an octave may thus be associated with a certain pitch class. In other words, this means that a semitone is clearly determined by the octave and its pitch class.
In other words, this means that a pitch class is referred to when, with regard to a pitch, it is disregarded to which octave it belongs. In western music and its instruments, i.e., for example, the piano, twelve pitch classes D, D sharp, E, F, F sharp, G, G sharp, A, A sharp, B and/or H, C and C sharp are defined, wherein, for reasons of clarity, enharmonic mix-ups are not mentioned here.
In music, a prime or a prime interval designates an interval of a semitone, wherein the starting pitch and the ending pitch are counted. In other words, two pitches with a prime interval have the same frequency and/or basic frequency (frequency ratio of the pitches 1:1), so that it is the same pitch. A minor second or an interval of a minor second in music is a pitch interval of two semitones, wherein also here the two pitches forming the interval are counted. Accordingly, a minor third and/or an interval of a minor third is a pitch interval of four semitones, a major third or a major third interval is an interval with five semitone steps and a fifth and/or a fifth interval is an interval with eight semitones, wherein the two pitches forming the interval are respectively counted.
In the notation of pitch classes, within the scope of the present application there is often a difference between upper-case and lower-case pitch classes. If a pitch class is designated by an upper-case letter, like, for example, C or F, this indicates that the corresponding pitch class is the base pitch (keynote) of a corresponding major triad, i.e. in the above case a C major triad or an F major triad. Correspondingly, pitch classes within the scope of the present invention representing a base pitch of a minor triad are designated by lower-case letters. An example of this is the a minor triad.
To enable a better understanding of the embodiments discussed in the further course of the present invention, first of all the synthesis of sensibly sounding pitch combinations will now be examined before the analysis of pitch combinations, the positioning variants of base pitches in the pitch space, the mathematical model description and the harmony analysis based on the symmetry model and on the circle of thirds are described in further sections.
The basic principle behind all embodiments proposed in this document is the following: in a so-called pitch space, base pitches and/or pitch classes are placed so that adjacent pitches and/or pitch classes make sensibly sounding pitch combinations. Here, within the scope of the present application, in general an oval/circular arrangement of the base pitches is taken as a basis. Due to this placement, it is possible to create harmonically sounding music by the selection of a suitable level section or space section. Based on the arrangement of the base pitches in an oval/circular arrangement, the level section and/or range/space section includes at least one input angle or one input angle range, as far as an input angle or input angle range was selected by the user at all. The selected space section may be varied infinitely or in leaps regarding its extension and its center of mass, i.e. its position. Apart from that, it is possible to occupy the selected space section with a selection weighting function. The selection weighting function makes it possible to define the relative volume at which the base pitches and/or pitch classes detected by the space section are to be played. Base pitches are thus placed at discrete positions of the pitch space.
But what happens with the positions in between? Which pitches sound when a space section was selected which lies in between two discrete base pitches? In order to solve this problem, in addition to the selection weighting function, a spatial pitch distribution function is defined. Each base pitch and/or pitch class placed in the pitch space has a function, which is in this case called a spatial single pitch distribution function. By introducing the spatial pitch distribution function and/or the spatial single pitch distribution function, wherein a corresponding spatial single pitch distribution function is associated with each pitch class and/or each base pitch, the spatial pitch distribution function results as an overlay (e.g. by addition, considering the pitch classes) of the spatial single pitch distribution function. The spatial pitch distribution function thus ensures that a pitch not only occupies an infinitely small discrete point and/or in case of an oval/circular pitch space an individual angle, but a section of a range and/or a finite angle range. The space sections occupied by two base pitches may here overlap. Thus, an angle may have more than one associated pitch class, in particular two pitch classes. The principles presented here thus offer completely new possibilities in the design of polyphonic audio signals, as it will become clear regarding the description of the embodiments in the further course of the present application.
Possibilities offered by this arrangement of basic pitches in the pitch space are explained in more detail in the further course regarding
In
In
In
In
In
In
In
The middle partial illustration of
In
If now a sound generator is connected, and if a very narrow space section or also an individual input angle is shifted as an input angle range respectively starting from the base pitch C from left to right to the base pitch E, the following will happen regarding to sound: in the case illustrated on the left in
In connection with
The opening angle of the symmetry circle and/or the selected space section may also be interpreted as the “jazz factor”. The greater the angle, the more jazz-typical pitches (tones) sound and/or are added. Among those are 7th chords, 7th-9th chords and 7th-9th-13th chords.
In the following, the basic principle for the analysis of a pitch combination is explained in more detail. The principle for the synthesis of sensible sound combinations described in the above paragraphs may be reversed to analyze existing sound combinations. Just like in the synthesis, in a first step base pitches have to be positioned in the pitch space in such a way that adjacent base pitches result in sensible sound combinations. The thus generated pitch space is, however, not used to determine pitches to be generated but, if applicable, to represent and analyze already existing pitches. By this it is possible to examine whether an existing pitch combination is “sensible” or not with regard to the definition existing in the form of the pitch space. If a pitch combination is sensible, then the base pitches of this pitch combination are represented in spatially adjacent areas. If a pitch combination is less sensible, the base pitches are illustrated in remote areas. The advantage of this principle is that the term “sensible pitch combination” and the term “senseless pitch combination” are not rigid, but may be redefined by a reorganization of the base pitches in the pitch space.
In each of
If pitches resulting in a less sensible pitch combination sound simultaneously, then the corresponding base pitches in the pitch space and thus on the output field which spatially models the pitch space are very far apart. From this it may be deduced that the spatial extension of active base pitches in the pitch space is a measure for senselessness, i.e. for the perceived dissonance. In the example illustrated in
With several sounding pitches it is possible not only to mark the associated base pitches, but also to calculate a corresponding area on the output field 210 which includes the sounding pitches, and a center of mass (focus; gravity) of all sounding pitches in the pitch space and represent the same by a corresponding marking. Such a calculation is possible with the help of the sum vector, which is explained mathematically further below, which is included in the analysis signal. The center of gravity again enables to assess the sound color of complicated pitch combinations as it is explained in more detail in the further course of the invention.
What is a “sensible pitch combination” and what is a “senseless pitch combination”? There is no general answer to this question. What we think to be sensible and what we think to be senseless or what we think to be consonant and/or to be dissonant strongly depends on subjective factors like taste, culture, education, etc. and may differ from person to person. Just as no global answer can be given to the above question, it is not possible to find an arrangement of base pitches in the pitch space which provides valid statements for all people and all musical styles. It is, however, possible to find positioning variants, with the help of which statements about tonal connections and perceived sound perceptions may be made which hold true for a great number of persons. The circle of thirds and the symmetry model, which are explained in the following paragraphs, are two systems which enable exactly this.
The symmetry model enables defining and/or analyzing many tonal connections for pieces of music which follow the classical major cadence. The technical use of the symmetry model is new. The explanations in this sections are based on the example of the C major scale and may be applied to all other major scales. In summary, it may be said that the key differentiation features of the symmetry model are
A second, very critical feature is the symmetrical arrangement of the pitches around an imaginary symmetry axis 360. The symmetry axis 360 runs exactly through the location 350-D of the second pitch of the scale (D), which is why the same is referred to as symmetry pitch. The remaining and/or further pitches of the scale are positioned symmetrically left and right around the symmetry pitch 350-D.
If the order and the symmetry of the pitches is maintained, different possibilities remain to determine the exact position of the base pitches. One possibility which is used within the scope of the symmetry model is to position the pitches on the circle according to their pitch interval. For this purpose, the circle is divided into 24 segments 370, with an opening angle of the segment of 360°/24=15°. Each segment 370 corresponds to a semitone interval, as it is indicated in
In
The pitches (tones) and/or pitch classes 350-E to 350-C are thus not distributed equidistantly on a circle with regard to the angle. Rather, they are spaced apart correspondingly with regard to the respectively smallest pitch distance to the neighbor pitch and/or to the neighboring pitch class. Because, as it was explained above, the symmetry model is based on a division of the circle into 24 segments 370, an output of angle, which is assigned to a certain pitch class and/or a certain pitch may take place by introducing a designator n′. The designator n′ is an integer number from the amount of numbers {2, 5, 9, 12, 15, 19, 22} and designates the angle, under which a certain pitch class appears, according to the linear mapping
αT=n′·2Π/24 mod 2Π
wherein αt represents the angle of a pitch class in radian measure depending on the designator n′ of the pitch class and p is the circle number. An exact assignment of the pitch classes T, the designators n′, the angles in degree and the angles in radian measure is listed in the following table.
By a simple extension of the designator n, the same may represent the angle αT of the pitch classes not only with regard to an octave, but further enables a representation of all pitches of the corresponding major scale. Here, for each octave the designator n′ has to be increased or decreased by 24. If, for example, by definition, the pitch C′ has a designator n′=22, then in this case the pitch C″ would have a designator n′=46 and the pitch C would have a designator n′=−2.
Here, a tonic area is an area of the symmetry model illustrated in
From the symmetry model, many sensible tonal connections result which may, on the one hand, be used for the synthesis and, on the other hand, for the analysis of audio and pitch information. In the following, some of these connections are listed:
The symmetry model and/or the cadence circle is described, explained and discussed regarding music theory in more detail in the above-cited dissertation by David Gatzsche.
In other words, the symmetry model, compared to the diatonic scale, enables a playful and thus pedagogically more valuable introduction to principles of music theory, which are in the following again summarized and explained. Here, the focal point is on conveying knowledge about music theory to children. Principles of pedagogic/music theory are generally very obscure. As the description of this embodiment will show, the musical instruments described here presents such an input method for infants which is so simple that even infants or highly handicapped persons may be musically creative.
The question now is, why there are exactly seven pitch classes? The answer is as follows: the most common scale in western latitudes is the so-called diatonic scale. This scale has seven pitches. On the piano, seven adjacent white keys exactly correspond to the diatonic scale for C major and/or a minor. A substantial innovation of the symmetry model is the arrangement of pitch classes:
on the piano, the pitch keys are arranged in semitone steps and whole steps. From this, the pitch sequence and/or pitch class sequence C-D-E-FG-G-A-(B and/or h)-C results. In the symmetry model, however, the keys are arranged in intervals of thirds: starting with the pitch D minor and major thirds alternate. Thus, the following pitch sequence and/or pitch class sequence results: D-F-A-C-E-G-(B and/or H)-D.
The pitch classes are not arranged on a line like on a piano, but on a circle, i.e. the symmetry circle of the symmetry model. Basically, also other oval/circular arrangements, as defined in the introductory sections of the present application, are possible here. The circle comprises a circle center. A vertical imaginary axis runs through the circle center and is referred to in the following as the symmetry axis 360. With the help of the symmetry axis 360, every pitch class 350-C to 350-A may be represented by an angle α between the symmetry axis 360 and a connecting line between the corresponding pitch class and the circle center.
The white keys on the piano are of equal width, no matter whether two neighboring keys represent a whole step or a semitone step. In the symmetry model, the pitch classes are not arranged at equal intervals and/or angles, due to the oval/circular arrangement, but at an (angle) interval (distance) which corresponds to the pitch interval and/or pitch step between the two pitch classes. This means that two adjacent pitch classes which correspond to a (smallest) pitch interval of a major third are arranged further apart on the circle and/or the symmetry circle 915 than two pitch classes which have an associated (smallest) pitch interval which corresponds to a minor third. Thus, the distances of the individual pitch classes with regard to each other represent the (smallest) pitch interval of the associated pitches and/or pitch classes.
The exact arrangement and/or positioning of the pitch classes is calculated as follows: first of all, the symmetry circle is divided into 24 segments, which thus all in all correspond to two octaves. Each of these segments represents a semitone step. The opening angle of such a semitone segment is thus 360°: 24=15°. A major third corresponds to four semitones, a minor third accordingly to three semitones. Thus, the following intervals on the circle result: if the tonal interval, i.e. the (smallest) pitch interval between two adjacent pitch classes is a major third, then the angle spanned by the two pitch classes is 4×15°=60°. If the tonal interval between two adjacent pitch classes is a minor third, then the interval/distance is 3×15°=45°.
The pitch classes are subsequently positioned and/or arranged on the circle as follows: the pitch class 350-D, which corresponds to the pitch class D, is arranged at the bottom center of the circle, i.e. under an angle α=180° with regard to the circle center point and a zero direction which runs vertically upwards in
To illustrate the arrangement of the pitch classes 350-C to 350-A in a better way, a plurality of dotted orientation lines are plotted starting from the circle center in
The pitch D (350-D) is referred to as the symmetry pitch as it is the only pitch which lies exactly on the symmetry axis 360 and because all other pitches of the scale are arranged mirror-symmetrically around this pitch. Opposite the symmetry pitch, the tonal center 930 is located (D=0°). It is referred to as the tonal center because common melodies in western latitudes usually start with pitches and end with pitches which are close to the tonal center.
From the above-described arrangement of the pitch classes 350-C to 350-A, implicitly a number of connections regarding music theory open up, which currently still have to be learned with much effort. The symmetry model is especially also suitable for infants, as it allows a linking between geometrical positions and tonal connections. By this, it is a lot easier for the infant to learn connections regarding music theory later on.
In the following sections, an illustration of tonal connections and/or connections regarding music theory are summarized and/or repeated, which are conveyed by the symmetry model.
As children generally do not know names of pitches yet and cannot read the labeling of the pitch classes 350-C to 350A, it would be obvious to optionally provide the pitch classes with a coloring and/or with symbols. One possible coloring is explained in the above-mentioned dissertation by David Gatzsche. Here, the color yellow is assigned to the tonic area which includes the pitch classes C and E. Red or orange are assigned to the dominant area which includes the pitch classes G and B. Blue is assigned to the subdominant area which includes the pitch classes A and F, while the color violet is assigned to the area which includes the pitch class D.
This coloring was chosen with regard to a “thermal feeling”, wherein bluish colors are assigned to the subdominant area, as the same implicates “cold”. The dominant area has associated reddish pitches here, as “warmth” is associated with the same. The tonic area has the associated color yellow being the “neutral area”, while violet is associated with the area in which the subdominant area and the dominant area abut. In areas between the tonic area and the subdominant area, between the tonic area and the dominant area and the area between the subdominant area and the dominant area, here the resulting mixed colors are assigned. In addition to that, the pitch classes, deviating from the illustration in
In the same way as the symmetry model maps connections within a diatonic key, the circle of thirds illustrates connections across keys, as is illustrated in
In
With regard to a section of the circle of thirds,
The circle of thirds is further perfectly suitable for illustrating relationships between keys. Related keys, i.e. keys which have many common pitches, are illustrated adjacently in the circle of thirds. Keys which have little to do with each other are positioned remotely in the circle of thirds. Based on the symmetry axis 405 of the key C major and/or a minor, thus also the type and the number of key signatures belonging to a key may easily be determined. Thus, for example in
This consideration may also be used for all further keys, as it is also illustrated in
It is a further positioning alternative for the circle of thirds and the symmetry model (symmetry circle) to mirror the circle of thirds and/or the symmetry model each around an axis which runs horizontally in the figures, so that in the case of the symmetry model the tonic area of a certain (major) key lies at the bottom, while the diminished area would go to the top. This would offer different didactic advantages. In particular, it is thus possible to perform a pendulum analogy between a (western) piece of music and a description, for example in the symmetry model. A (attenuated) pendulum is deflected into one direction, then swings for a while and comes to rest. The stronger the pendulum is deflected to one side, the stronger it will also swing in the other direction.
A pendulum which, for example, is hung up at a central point of the symmetry model, as it is, for example, illustrated in
Even if, within the scope of the present application, the circle of thirds, as it is, for example, illustrated in
As it has already been described in the introductory paragraphs of the present invention, reference is made to a pitch class when, regarding a pitch, it may be disregarded to which octave it belongs. On the piano the twelve pitch classes D, D sharp, E, F, F sharp, G, G sharp, A, A sharp, B, C and C sharp are defined, wherein in this enumeration the indication of enharmonic equivalencies has been omitted for clarity. Each pitch class t has an associated basic index mt and an extended index nt. The basic index mt and the extended index nt are both integer numbers, wherein Z illustrates the amount of integer numbers. The following applies:
0≦mt≦11, mtεZ (1)
−∞<nt<+∞, ntεZ (2)
The basic index mt is a one-time or unique numbering of all 12 pitch classes. The extended index nt deals with the fact that the pitch classes logically form a circle and/or may be arranged periodically on the same, wherein after the last pitch class again the first pitch class follows. For this reason it is desirable that the extended index nt may be counted on infinitely. Each pitch class thus has many extended indices. Using the following calculation rules the basic index and the extended index may be converted into each other:
n
t
=m
t
+k·12, kεZ (3)
m
t=[(nt mod 12)+12] mod 12 (4)
It is an important question which pitch class t is provided with which basic index mt. According to the prior art, the pitch and/or pitch class C is provided with the basic index mt=0 to indicate the fact that this pitch is the base pitch of the simplest key C major which has no signature. At this point within the scope of the present application a different definition is used, however, which leads to some simplifications for the following calculations: the basic index mt=0 is not associated to the pitch C, but to the pitch D, because the pitch D is the symmetry pitch of the key C major which has no signature and thus also forms the geometric center of mass of the key in the third and symmetry circle. Thereby, the following index assignment and/or assignment of basic indices mt to the pitch classes t results, which is illustrated in the following Table 1. The following applies:
The circle of thirds consists of 24 pitches in a distance of major and minor thirds. These pitches are referred to as real pitches r because they represent actually sounding pitches. To be able to place the real pitches r geometrically on the circle of thirds, an addition of auxiliary pitches h is required. Two adjacent auxiliary pitches have a semitone interval (second) and, similar to the pitch classes, they have a basic index mh and an extended index nh. Two adjacent auxiliary pitches thus have the extended indices nh and (nh+1). Similar to the above paragraph, the following applies:
−42≦mh<+42 (5)
−∞<nh<+∞ (6)
The auxiliary pitches h are used to define the semitone raster consisting of 84 elements which lies behind the circle of thirds: the basic index mh of the auxiliary pitches h does not go from 0 to 11 like with the pitch classes, but from −42 to +41, as equation 5 shows. Auxiliary pitches which contribute to the definition of keys having a negative signature (flat keys) thus obtain a negative signature. Auxiliary pitches which contribute to the definition of keys with a positive signature (sharp keys and/or # keys) have a positive signature. The basic index mh and the extended index nh may be converted into each other according to the following rule:
To each auxiliary pitch h having the extended index nh, a pitch class t having the extended index of the pitch class nt is associated. By the definition of Table 1, no conversion of the indices nh and nt into each other is required. Rather, for the pitch class t of an auxiliary pitch h having the extended index nh it applies that the extended index nt of the pitch class t corresponds to the extended index nh of the auxiliary pitch. Thus, the following equation applies:
n
t(nh)=nh (8a)
The conversion of the extended index nt into the basic index mt of the pitch classes t is then performed according to equation 4. The following table 2 exemplarily shows the assignment of pitch classes t having the extended index nt to auxiliary pitches h having the extended index nh and/or vice versa:
Geometrically, each auxiliary pitch h having the extended index nh may also be represented and/or presented as the vector {right arrow over (h)}n
Apart from the angle α, also a length and/or a magnitude (absolute value) is associated to each auxiliary pitch, which is in the following also referred to as energy s of the auxiliary pitch. In other words, the energy s of the auxiliary pitch h reappears in the form of the absolute value of the vector {right arrow over (h)}n
wherein the formula symbol j is the imaginary variable. The following applies:
j=√{square root over (−1)}, j2=−1 (9a)
Apart from the auxiliary pitches h, there are also the real pitches r. The real pitches are the 24 pitches actually present on the circle of thirds and form a subset of the set of auxiliary pitches Mh. Each pitch r is either the base pitch of a major chord (+) or the keynote/base pitch of a minor chord (−). For this reason, the set of real pitches Mr may be divided into a subset Mr+ and Mr−. The following applies:
M
r±
:={h
n
|n
h=7k±2, kεZ} (10)
With the help of the mathematical fundamentals declared so far it is also possible to represent pitch mixes in the circle of thirds. Here, a vector {right arrow over (r)} is associated to each real pitch r. A sum of two real pitches ra and rb in the circle of thirds may thus be realized by the sum of the vectors {right arrow over (r)}a and {right arrow over (r)}b belonging to the two real pitches ra and rb. The result of such a summation is the so-called sum vector {right arrow over (r)}sum, which points to the geometric center of mass of the two pitches:
{right arrow over (r)}sum={right arrow over (r)}a+{right arrow over (r)}b (11)
Each pitch class t reappears on the circle of thirds in the form of two real pitches r, i.e. once as a base pitch of a major chord rnr+ and as the base pitch of a minor chord rnr−. Equation 12 shows a calculating rule, using which the associated real pitches rnr− and rnr+ of a circle of thirds associated to a given pitch class t having an extended index nt may be found.
n
nr±
=f(nt)=72nt±12 (12)
It was noted above that a set of real pitches in the circle of thirds may be described by a sum vector {right arrow over (r)}sum. It was further determined that each pitch class t reappears in the form of two real pitches rnr− and rnr+ in the circle of thirds. Thus, it is possible to represent a pitch class t with an extended index nt by a sum vector
{right arrow over (r)}sum={right arrow over (r)}nr−+{right arrow over (r)}nr+ (12a)
in the circle of thirds. The following applies:
The factor 1.25 results for all pitch classes and may thus be disregarded. Using the connections of equation 13 it is possible to represent a set of pitch classes Mt by a circle of thirds sum vector {right arrow over (r)}sum. The following applies:
From the circle of thirds sum vector in turn the key and/or signature number v and the type of signatures may be derived. The circle of thirds sum vector has an angle α which fulfils the relationship
wherein nhsum represents the “extended index” of the circle of thirds auxiliary pitch to which the sum vector {right arrow over (r)}sum points. The following applies:
so that for the number of signatures v the following applies:
It is further interesting that the circle of thirds sum vector {right arrow over (r)}sum belonging to a pitch class t is identical to the symmetry vector of the key represented by the pitch class. Thus, for example for the pitch class D, the following applies:
{right arrow over (r)}sum(t=D)={right arrow over (h)}0 (15d)
The mathematical description of the symmetry circle is similar to the description of the circle of thirds. The following explanations only hold true for diatonic keys without signatures like C major or a minor. To be able to illustrate the following embodiments also for transposed versions, a so-called transposition factor τ has to be introduced to consider the fact that the symmetry circle relates to a certain diatonic key. The symmetry circle and/or the cadence circle of the symmetry model contains seven real pitches rm in a distance of minor and major thirds. The same are placed on a semitone raster consisting of 24 auxiliary pitches h. Each of the auxiliary pitches also has a basic index mh and an extended index nh, with the help of which an auxiliary pitch h may be uniquely identified on the circle of thirds. The following applies:
−12≦mh<+12 (16)
−∞<nh<+∞ (17)
The indexing of the auxiliary pitches h in the circle of thirds is selected such that auxiliary pitches h having a negative index, in particular a negative basic index mh belong to the subdominant area and auxiliary pitches h with a positive index and/or a basic index mh belong to the dominant area. A very small absolute index value |mh| indicates that the real pitch r is close to the tonic area and/or the tonal center. The absolute value of the index |mh| is a measure for how far a pitch is apart from the tonic area and/or the tonal center. Thus, the basic index mh and the extended index nh may be converted into each other according to the following rule:
The assignment of a pitch class t with an extended index nt to an auxiliary pitch h with an extended index nh happens in the same way as with the circle of thirds: by the selected indexing of the pitch classes according to Table 1, a conversion of the indices of the pitch classes nt into the indices of the auxiliary pitches of the symmetry circle nh is not required. The following applies:
nh=nt (20)
The real pitches of the symmetry circle r are a subset of the auxiliary pitches. The real pitches of the symmetry circle may be divided into three groups: into real pitches forming the base pitch of a
1. major chord (rn+),
2. a minor chord (rn−) or
3. a diminished chord (rn0)
The set of real pitches Mr is set up as follows:
M
r
M
r±
∪M
r0
M
r±
{h
n
, n=7k±2, |k|≦1} (21)
Mr0{h12}
Each auxiliary pitch h with the extended index nh may also be represented as a vector {right arrow over (h)}nh. Also this vector {right arrow over (h)}nh comprises an angle α which is here selected such that the symmetry pitch of the key h0 represented by the symmetry circle has the angle 0. The vector {right arrow over (h)}0 is therefore also called the zero vector. Also in this case again the absolute value and/or the length of the vector is referred to as energy s. In other words, the energy of the pitch is indicated using the formula sign s:
A set of given pitch classes Mt may also be described by a sum vector {right arrow over (r)}sum in the symmetry circle. The symmetry circle does not contain all pitches, but only the pitches of the selected diatonic key. If one wants to represent an amount (a set) of given pitch classes Mt on the circle of thirds, first of all the intersection Mt∪Mr has to be formed from the given pitch classes Mt and the real pitches present on the symmetry circle and/or the amount of real pitches Mr present on the symmetry circle. For this intersection, subsequently the sum vector {right arrow over (r)}sum may be formed.
On the basis of the hitherto laid fundamentals, i.e. the synthesis and analysis of sensibly sounding pitch combinations, the introduction into different pitch spaces (e.g. symmetry model and circle of thirds) and the mathematical basics for describing the pitch spaces and the sum vectors following therefrom, in the following sections possible scenarios of use for the sum vector are described. The main focus is here on the possibilities, which the sum vector offers as it is provided by the inventive device 100 for analyzing an audio datum in the form of the analysis signal.
With the help of a circle of thirds-based key analysis, as it is explained in more detail in the following section, valuable information about content features of an audio and/or pitch signal may be obtained. In particular, according to equation 13, any amount of pitch classes may be summarized and described in the form of a sum vector {right arrow over (r)}sum. The same provides valuable conclusions on content features of the underlying audio and/or pitch signal.
As already explained in connection with equations 15a-15c, the angle α of the sum vector {right arrow over (r)}sum indicates in which key a piece of music is at a certain point of time. Thus, for example the sum vector has the angle α=0 for the pitch classes of the C major scale. This corresponds exactly to the point on the circle of thirds and/or is exactly at the location where the symmetry pitch and thus the representation of the key C major is located.
The absolute value of the sum vector |{right arrow over (r)}sum| is in addition to that an estimate which describes how sure it is that a certain diatonic key is present and/or how defined the tonal context is. If the absolute value is very high, then it is quite sure that the pitch classes belong to a certain key. In other words, with an increasing absolute value of the sum vector |{right arrow over (r)}sum| the probability increases that the pitch classes belong to a certain key. If the absolute value is very small, however, either only very few different pitch classes are present, so that the key may not be reliably determined, or the pitch classes belong to completely different keys.
Apart from that, the sum vector provides information about a change of key and/or modulations: a key occupies an area of 24 semitone steps on the circle of thirds. This corresponds to an angle of 4/7 π. If a piece of music remains within the limits of a diatonic key, then the sum vector {right arrow over (r)}sum moves within a circle segment which does not exceed this opening angle. If the sum vector {right arrow over (r)}sum leaves such a circle segment, however, probably a change of key has occurred.
The sum vector {right arrow over (r)}sum additionally enables correcting analysis errors in the harmony analysis and the key analysis. Modulations into adjacent keys are more probable than modulations into non-adjacent keys. Rare temporary outliers of the angle of the circle of thirds sum vector indicate that an analysis error has to be present with high probability.
It is further possible to differentiate between tonal and non-tonal music with the help of the sum vector {right arrow over (r)}sum. With non-tonal music, the absolute value of the sum vector is very small. With tonal music, however, it becomes ever longer as a function of time, wherein an integration and/or summation across the complete already elapsed time of the piece of music is performed.
If, in addition to that, the audio signal underlying the analysis is integrated temporally until the absolute value of the resulting sum vector has a maximum, then this allows a conclusion to a change of key. It may here be required to possibly design a criterion regarding the presence of a maximum to be “soft”. In other words, short-term deviations of the absolute value or the length of the sum vector may well result here, which are to be attributed to statistical fluctuations of the occurring semitones, without a change of key being present. Accordingly, it may be advisable, in the case of a detection system, as illustrated in
As it was explained in the last section, for the analysis of connections across keys the circle of thirds and/or the circle of thirds-based harmony analysis is used. With the help of the circle of thirds, thus, for example, the key used at a certain time may be determined from a pitch signal and/or audio signal and/or audio data. If the key is determined and/or given, then the symmetry model may be determined and/or used. This, in turn, is very suitable for determining connections within a key. Also within the scope of symmetry model-based harmony analysis, the sum vector {right arrow over (r)}sum introduced in the section on mathematical model description of the symmetry model is used.
From the angle of the sum vector {right arrow over (r)}sum, the current chord may be estimated, as the same points to the geometrical center of mass and/or the tonal center of the pitch classes played at a certain point of time. In addition to that, from the angle of the sum vector {right arrow over (r)}sum changes of chord may be determined and/or analyzed. A sudden change of the angle of the sum vector allows to suggest a change of chords.
The angle of the symmetry circle sum vector again gives an indication whether a pitch combination tends to be associated to the subdominant area, the tonic area or the dominant area.
From the absolute value of the symmetry circle sum vector |{right arrow over (r)}sum|, the perceived consonance and/or dissonance, i.e. the pleasantness of a given pitch combination of pitch classes may be estimated. The longer the vector, the more pleasant and/or consonant the analyzed pitch combination is perceived to be. Accordingly, a pitch combination is perceived to be more dissonant and/or unpleasant the shorter the symmetry model sum vector is. In other words, the shorter the vector, the more dissonant and/or unpleasant the perception of the respective pitch combination.
the sum of all energies, i.e. the components of the semitone vector, has the value 0.5. By multiplying all components of the semitone vector by a factor of 2 (=1/0.5), the following semitone vector results, whose energy is summed up to the value of 1.
The sum of all energies has now the value of 1.
Apart from that,
Similar to the courses 480, 485 indicated in
The course 490 shows, with a decreasing consonance and/or pleasantness of the respective chord variants, a monotonously decreasing course of the absolute value of the symmetry circle sum vector. Starting from a value 1 in the case of a prime, the course 490 continuously drops to a value of approximately 0 when all pitch classes are considered. Accordingly, the course 490 clarifies the suitability of the absolute value of the symmetry circle sum vector as an estimate for the assessment of the consonance and/or pleasantness of different pitch combinations. Here, the course 490 clearly shows that a pitch combination and/or pitch class combination is perceived and/or sensed to be more consonant and/or pleasant, the higher the absolute value of the corresponding symmetry circle sum vector is. In contrast to the course 490, the course 495 shows, similar to the course 485 of
Apart from the course 500, in
Apart from that, in
The inventive device for analyzing an audio datum thus provides an analysis signal based on the sum vector to further components. As the embodiments explained in the following will show, the analysis signal provided by the inventive device for analyzing audio data may be supplied to a display device 195 which graphically, in text form, mechanically or in another way represents the information which the sum vector includes based on the analysis signal. In addition to that, the analysis signal may also be provided to an automatic accompaniment device as an input signal, which generates an accompaniment which goes with the audio data based on the analysis signal.
In the following sections, further embodiments of the inventive device for analyzing an audio datum are described. The embodiments of the inventive device for generating a note signal described in the following among others include symmetry model-based and circle of thirds-based musical instruments which may be integrated into an inventive device, be coupled or couplable to the same.
The fundamentals set so far and explained in the above sections represent the starting point to describe new musical instruments in the form of embodiments of the present invention. In other words, the laid fundamentals are perfectly suitable for developing the new musical instruments described in the further process.
First of all, in the following sections, in the form of a block diagram, a principle setup for a musical instrument is introduced which works on the basis of the hitherto presented fundamentals. This instrument principle realized by a block diagram implements the concepts summarized in the introductory sections regarding the topics of the synthesis of sensibly sounding pitch combinations and the analysis of present pitch combinations. The basic features and/or characteristics of the inventive musical instruments are summarized in the following.
The concept for musical instruments (instrument concept) is based on a logic basic system which allows the geometrical positioning of base pitches in a pitch space. Optionally, the instrument concept additionally allows the definition of a spatial pitch distribution function and/or the definition of a spatial single pitch distribution function. As a further option, a selection weighting function may be introduced within the scope of the inventive instrument concept. Further, the instrument offers an operating means and/or a user interface which enables selecting and/or defining an input angle or an input angle range and/or a spatial section of the logical pitch space (range) in the form of an input signal. The selection of the spatial section may then be optionally indirectly supplied to a sound generator.
The arrangement of the base pitches and/or the pitch classes in the pitch space follows an arrangement with smallest pitch intervals which correspond to a major or a minor third. Following the defaults of the circle of thirds and/or the symmetry model and/or the symmetry circle and/or the cadence circle has shown to be especially sensible within this context. Hereby it is possible, with an extremely low number of base pitches and a consequent number of operating elements and/or input means, to generate sensible pitch combinations. For this reason, this instrument concept is especially suitable for the pedagogic field. Apart from that it is also suitable for fast and efficiently generating note signals which may be used via a connected sound generator for generating harmonically and/or consonantly sounding accompaniments or improvisations. This input, which is very fast and very simple, together with the pedagogic suitability of the inventive instrument concept, enables to playfully introduce people to music who have little musical pre-education.
This instrument concept may thus, for example, enable the infinite cross-fading of sound combinations into other sound combinations, without the result of unwanted dissonances. This essentially takes place on the basis of geometric adjacent arrangement and/or arrangement of sensible base pitches and the input of a user in the form of an input angle or an input angle range. Optionally, the instrument concept may be further refined here by introducing the spatial distribution function and/or the spatial single pitch distribution function, which is assigned to individual basic pitches, as well as the optional possibility of infinitely changing/varying the selected section in the pitch space regarding its position, extension and spatial weighting.
The instrument concept optionally provides an analysis part which is able to analyze audio information, audio data and pitch information of other instruments and map the same into its own pitch space. The active pitches of other instruments may then be marked and/or accentuated on a display device 195. By the geometric arrangement of the output field radial directions and/or the output areas of coherent base pitches in the pitch space and on the operating surface of the instrument, it is possible with a minimum of musical knowledge to generate a suitable accompaniment music to a given pitch signal.
Apart from the synthesis branch 630, the musical instrument 600 also comprises a device for analyzing an audio datum as an analysis branch 660. The same includes a base pitch analysis device and/or semitone analysis device 670 and an interpretation device 680 and/or vector calculation means 680, which are coupled to each other. In addition to that, the base pitch analysis device 670 receives a pitch signal as an audio datum via an input, which may assign no, one or several pitch classes to each angle. The interpretation device 680 is coupled to the display device 610 and may also access the memory 650 and the basic pitch distribution stored in the memory via a corresponding coupling. This coupling, i.e. the coupling of the interpretation device 680 and the memory 650, is optional. Also the coupling between the operating device 620 and the memory 650 is optional. In addition to that, the memory 650 may optionally also be connected to the display device 610 so that the same may also access the base pitch distribution stored in the memory 650.
Apart from the connections of the memory 650 to the interpretation device 680, the display device 610 and the operating device 620 already described above, the same may optionally also be connected to a base pitch definition input device 690, so that a user may influence, change or reprogram the base pitch distribution in the memory 650 via the base pitch definition device 690. The display device 610, the operating device 620 and the base pitch definition input device 690 thus represent user interfaces. The base pitch analysis device 670, the interpretation device 680 and the sound generator 640 thus represent processing blocks.
In the case of the musical instrument illustrated in
For an exact description of the functioning of the analysis branch 660, i.e. for the inventive device for analyzing an audio datum, reference is made to
While synthesizers today are specialized in particular on two things, i.e. modeling the amplitude courses and the frequency courses of single pitches, and thus only offer insufficient methods to generate, merge or otherwise process complex harmonies, the musical instrument 600 indicated in
If a pitch signal is provided to the musical instrument 600 via its input terminal, and thus to the base pitch analysis device 670, the semitone analysis device of the base pitch analysis device 670 first of all analyses with regard to a volume information distribution over an amount of semitones. Subsequently, the pitch class analysis means of the base pitch analysis device 670 determines a pitch class volume information distribution over the amount of pitch classes on the basis of the volume information distribution. This pitch class volume information distribution is then supplied to the interpretation device 680, which is the vector calculation means, which determines a two-dimensional intermediate vector for each semitone or for each pitch class, calculates a sum vector based on the two-dimensional intermediate vectors, wherein the individual intermediate vectors are weighted based on the volume information distribution or the pitch class volume information distribution with regard to their lengths. Finally, the interpretation device 680 outputs an analysis signal to the display device 610 which is based on the sum vector. Alternatively or additionally, the interpretation device 680 may provide a display signal to the display device 610 which comprises information regarding the volume information distribution or the pitch class volume information distribution.
The display device 610 may then, on the basis of the analysis signal and/or the display signal, indicate the pitch classes, corresponding to the incoming pitch signal, to the user on an output field of the display device 610 by accentuating output field radial directions or by accentuating output areas. Here, the display device 610 may perform the illustration on the output field based on the base pitch distribution stored in the memory 650.
The user of the musical instrument 600 may then define an input angle or an input angle range via the operating device 620, so that the operating device 620, with the help of its control means and optionally based on the base pitch distribution stored in the memory 650 in the form of the assignment function, then generates note signals from this and provides the same to the sound generator 640. The sound generator 640 then in turn generates pitch signals based on the note signals of the operating device 620 which are then output at the output of the musical instrument 600.
In other words, the optional memory 650, which includes the basic pitch distribution stored within the same and the possibility of changing the same via the base pitch definition input device 690, represents central components of the inventive musical instrument 600. A further important component is the display device 610. The same represents the pitch space and the base pitches contained therein, marks selected or analyzed pitches or also maps the spatial pitch distribution function and/or the spatial single pitch distribution function and/or the selection weighting function. Further, the concept of the musical instrument 600 provides the analysis branch 660 and the synthesis branch 630. The analysis branch 660 is able to analyze the base pitches transported within pitch signals (for example audio signals or midi signals) and interpret the same according to the base pitch distribution, mark them in the pitch space and display the same via the display device 610. This functionality may, e.g., be used so that a musician B may generate a suitable accompaniment to an audio signal provided by a musician A. Apart from the analysis branch 660, there is also the synthesis branch 630. The same contains an interface for selecting base pitches, i.e. the operating device 620 also referred to as the base pitch selection in
Basically it is possible that the interpretation device 680, the display device 610 and the operating device 620 access different base pitch distributions which are stored in the memory 650. Thus, it is, for example, possible that the display device 610 uses a representation which exactly models the symmetry model and/or the cadence circle, which means that with regard to the angle the distance of two adjacent pitch classes depends on whether the smallest pitch interval is a minor third or a major third. Simultaneously, the operating device 620 may work on the basis of an assignment function, wherein the seven pitch classes of the symmetry circle and/or the cadence circle are equidistantly distributed with regard to the angle.
In the form of a block diagram,
In the following sections, the selection of the active spatial section by the user, i.e. the definition of the input angle or the input angle range, is considered in more detail. In this connection, some embodiments of the operating means are given and explained in more detail. Here, the following explanations are made using a base pitch arrangement following the symmetry model. Without limitations, the same may, however, also be applied to the circle of thirds or another arrangement of the base pitches and/or pitch classes.
Here, the active spatial section in the symmetry model, in the circle of thirds and other arrangements of the base pitches is defined via one single input angle or via one circle segment. This may, for example, be done via a starting angle and an opening angle, and, if applicable, also optionally via a radius. The term “active spatial section” here also includes the case that the opening angle of the circle segment disappears and/or has an opening angle of 0°, so that the active spatial section may also consist of only one single input angle. In this case, consequently the starting angle and the input angle are the same.
If a fixed arrangement of keys has already been predefined, a sensible assignment of the base pitches to individual keys may be performed. One example for this is given in
Also using a simple rotary switch 740 the starting angle α and/or the input angle may be defined, as illustrated by
In this connection, it is important to note, that it is absolutely possible to let the musical instrument 600 for example operate in a mode which is based on the symmetry model of a certain scale, so that, for example, the display device 610 optically reflects the respective symmetry model, while the operating device 620 includes a rotary switch like the one illustrated in
A combined input of starting angle α and opening angle β may also take place using a joystick. Thus, for example, the starting angle α may be derived from the inclination direction of the joystick, and the opening angle β or the radius r of the circle segment may be derived from the inclination degree. Instead of the inclination axis of the joystick, also the inclination angle and the inclination degree of the head may be used. This is, for example, interesting for accompaniment instruments for paraplegics, as will be explained in more detail in the further course of the present application.
Very complex possibilities for the definition of the active circle segment are offered by screen-based input methods. In this case, the symmetry model or the circle of thirds may be mapped to a screen or a touch screen. The active circle segment may be selected using a mouse, by touching the touch screen or another type of a touch-sensitive surface. Here, possibilities like drag and drop, dragging, clicking, tipping or other gestures may be used.
Such an application and embodiment example is illustrated by the so-called HarmonyPad. The HarmonyPad is a special operating means or also instrument for generating, changing and cross-fading chords, on which the symmetry vector may be represented advantageously. The surface of the HarmonyPad may also be used to program the synthesizers and sound generators contained in circle of thirds-based and symmetry circle-based musical instruments and to configure their operating surface. In more detail, the HarmonyPad thus represents a system, which includes both a device for generating a note signal upon a manual input and a device for outputting an output signal indicating a pitch class, which may advantageously be coupled to an inventive device for analyzing an audio datum.
As it was explained in the application, which was filed concurrently to the present application, with the title “Device and method for generating a note signal and device and method for outputting an output signal indicating a pitch class”, the HarmonyPad comprises an output field and a touch-sensitive field, which are arranged regarding each other so that the touch-sensitive field is arranged between a user of the HarmonyPad and the output field. The touch-sensitive field is here implemented transparently and/or semi-transparently, so that the user may look through the touch-sensitive field. By this, the user may perform an input “quasi directly” on the screen, i.e. the output field, which detects a detection means coupled to the touch-sensitive field and passes it on to an input control means.
First of all, the possible operating surface and/or surface comprises a harmony area 800, which includes a circle of thirds 805 and the symmetry model 810. The symmetry model 810 is here arranged and/or mapped concentrically in the center of the circle of thirds. The circle of thirds 805 and the symmetry model 810 thus comprise a common center point 812. The center point 812 simultaneously represents a center of the output field and the touch sensitive field. Starting from this center 812, one or several output field radial directions may be accentuated, i.e. optically accentuated and/or illuminated here.
On the right next to the harmony area 800 four input fields and/or input possibilities (e.g. buttons) 815, 820, 825 and 830 are arranged one below the other. Here, the input field 815 enables editing, changing, determining or defining the spatial single pitch distribution function and thus also the spatial pitch distribution function. Using the button 820 a user of the HarmonyPad may define, edit or influence an inversion weighting function, using the button 825 correspondingly the selection distribution function and using the button 830 the opening angle β of the active spatial section and/or the selected area.
The surface of the HarmonyPad illustrated in
Selection of key: The current key is selected by touching the circle of thirds 805. In
Alternatively, it is, for example, also possible that the illuminated area 835 is shifted corresponding to the newly selected key, so that a new orientation of the circle of thirds may be omitted. The circle of thirds 805 in this embodiment thus represents an embodiment of an additional operating means, with the help of which a selection of different assignment functions may be performed by the user between angles and pitch classes. By this, the HarmonyPad may be switched to and fro between different keys.
Selection of the chord to be played: To make a certain chord and/or a certain pitch combination sound/play, first of all the opening angle β of the circle segment to be selected and/or the active spatial section has to be determined. This may, for example, take place graphically via the input field 835 and/or the associated window. Alternatively or additionally, this may, of course, also be done via a connected hardware interface or via an input means, as it was described in connection with
Fading between chords: In
Fading between conversions: Optionally, the HarmonyPad offers the possibility of using and/or interpreting the radius of the selected circle segment for the selection of different chord conversions. By this it is possible, by a change of the radius r, to obtain a desired octaving of individual base pitches. Here, within the scope of the present application, the octaving of a pitch or a pitch class is a determination and/or definition of an octave position. The indication of an octaving thus, for example, defines to which octave a pitch with a certain pitch class belongs. With the help of octaving, it is thus defined which of the pitches C, C′, C″, C′″, . . . are played/sound and/or are to be associated with the pitch class C. In other words, the octaving determines a basic frequency of a pitch in the form of a factor 2° with an integer number o, which is also referred to as the octaving parameter.
Thus, for example, the standard pitch A has a basic frequency of 440 Hz. If now, for example, instead of the standard pitch A minor a pitch of the pitch class A minor is to play one octave higher, then the octaving parameter has to be set at o=1, so that the new basic frequency of the pitch is 880 Hz. Accordingly, the basic frequency of a pitch of the pitch class a is one octave below the standard pitch a (o=−1) with 220 Hz.
If, on the HarmonyPad, for example the basic setting of the C major chord is selected, then, for example, the first conversion of this chord may be achieved by the user drawing and/or moving a finger along a radially directed C line 850 which leads from the center of the symmetry circle radially outward under an angle which is associated with the pitch class C, in the direction of the circle center point and/or the center. By this, the radius r of the selected circle segment is reduced and the basic setting of the C major chord is slowly converted into the first conversion. Via a connected sound generator, the user may then hear the first conversion of the C major chord.
A conversion of a chord is here an arrangement of the pitches of a chord such that the sounding pitch having the lowest basic frequency is not necessarily also the base pitch, for example in the case of a C major chord the pitch C and/or the pitch class C. In the case of a C major chord, an arrangement of the sounding pitches with increasing frequency in the order E-G-C for example represents the first basic setting. Apart from that, of course also other assignments of the radius r are possible with a certain octaving of a pitch and/or a pitch class or also a certain conversion of a chord.
Just like the spatial single pitch distribution function may be edited and/or defined via the input field 815, by introducing an optional conversion distribution function which may be edited and/or defined via the input field 820, an octaving of the sounding pitches may be influenced. Thus it is possible, based on the selected conversion distribution function, to assign volume information values to single pitches regarding a certain pitch class, so that, for example in the selection of the pitch class C via the active spatial section, more than one pitch of the corresponding pitch class sounds. Likewise, it is possible that the conversion distribution function is used, based on the input of the radius r by the user, to make different conversions of the corresponding pitch combination and/or the corresponding chord sound via a connected sound generator. In order to enable this, the surface of the HarmonyPad offers the corresponding window and/or input field 820.
Fading between single pitches and chords: The HarmonyPad may, for example, be equipped with a midi interface or another control interface, to receive or also to transmit note sequence signals. Using this midi interface or the control interface, now optionally a controller, for example a foot controller, a momentary foot switch, a joystick or another input means may be connected. It is now possible to route the data of this input means (foot controller) to the opening angle β and/or interpret the same influenced by the input via the foot controller. This means that the opening angle may be controlled as an angle parameter by the user using the foot controller. Advantageously, the foot controller enables making a quasi continuous input of data possible which are, for example, associated with the foot position of the user. Hereby, the user may influence the opening angle β using the foot controller within predetermined or variable limits. If the user touches the foot controller so that it is at the bottom stop, this foot position may, for example, be associated with an opening angle of 0°. If the user now touches the HarmonyPad in the area of the symmetry model 810 at the location of the pitch and/or the pitch class C, via the connected sound generator, only the pitch C will sound and/or may be heard, as the opening angle is β=0°. If the user now slowly moves the foot controller in the direction of the top stop, it is possible to correspondingly increase the opening angle β so that the additional pitches and/or pitch classes E minor, G major and B/H minor are added and faded in one after the other in the case illustrated in
Finding pitches which match existing pitches (improvisation): Optionally, the HarmonyPad (just like the musical instrument 600) may be equipped with an analysis functionality which analyzes pitch signals and/or audio data present in the form of audio signals or midi signals and marks the corresponding basic pitches on the surface of the HarmonyPad (pad surface) by a corresponding accentuation.
In addition to that, it is further optionally possible with the help of the HarmonyPad to graphically represent the result of an analysis of an audio datum which may be provided to the HarmonyPad in the form of an analysis signal. The inventive device for analyzing an audio datum may here be both implemented as a component of the HarmonyPad and also as an external component to the HarmonyPad. In the first case, the HarmonyPad thus represents a system which comprises a display device and a device for generating a note signal upon a manual input apart from the inventive device for analyzing an audio datum. In the second case, the analysis signal may be transferred to the HarmonyPad, for example via an external interface, for example a plug, a radio connection, an infrared connection, or another data connection.
Apart from a marking and/or accentuation of the pitch classes included in the audio signal by an accentuation of individual output field radial directions of the symmetry model 810 or larger coherent areas on the symmetry model 810, thus also the sum vector provided in the form of the analysis signal may be illustrated on the output field 810. Here, the angle of the sum vector may be indicated starting from the output field center and/or the center of the symmetry model 810 by an accentuation (e.g. in the shape of an arrow) of an output field radial direction, as it is shown in
Optionally, it is also possible to accentuate the output field radial direction accentuated on the basis of the angle of the sum vector not as a whole, but to accentuate, based on the length of the sum vector starting from the output field center, only a part of the corresponding output field radial direction, as it is shown by the accentuated, arrow-shaped output field radial direction 857 in
Optionally it is also well possible to temporally integrate the incoming audio signals with the help of an input value integrator for so long until the absolute value and/or the length of the resulting sum vector reaches a (temporally local) maximum, as it was already explained in connection with
In the following paragraphs, an inventive measurement device and an inventive analysis device for tonal-harmonic correlations are explained and described in more detail. In other words, in the following sections a further embodiment of a measurement system is explained, as it was already described in connection with
The semitone analysis means 1010 then analyzes the audio datum provided at its input 1010e regarding a volume intensity distribution across an amount of semitones. The semitone analysis means 1010 thus implements (among others) equation 4. The pitch class analysis means 1020 determines a pitch class volume information distribution on the basis of the volume information distribution over the amount of pitch classes as the underlying amount. The vector calculation means 1030 is then provided with the pitch class volume information distribution, wherein the vector calculation means 1030 forms a two-dimensional and/or complex intermediate vector for each pitch class on the basis of the same, calculates a sum vector based on the two-dimensional intermediate vectors and outputs the analysis signal at the analysis signal output 1030a on the basis of the sum vector. The downstream (optional) display device 1040 may then, based on the analysis signal, for example output the sum vector, the angle of the sum vector and/or also the absolute value and/or the length of the sum vector.
In other words, the measurement device 1000 is fed with an audio signal, i.e., for example, an (analog) line signal or a digital audio signal, from which the semitone analysis means 1010 analyzes the semitones. This may, for example, take place by the constant-Q transformation already explained in connection with
The angle and/or the absolute value of the corresponding sum vector may then be represented by the display device 1040.
The input terminal 1010e of the measurement device 1000 and/or the semitone analysis means 1010 may be a microphone input, an analog audio input or also directly a digital input, so that the measurement and display device, if the display device 1040 is also implemented, may in principle analyze both analog and also digital audio data. Depending on the implementation, also note sequence signals, i.e. also control signals like, e.g., midi control signals may be provided to the measurement device 1000. In the case of an analog input, depending on the implementation of the system, an analog/digital converter (ADC) may also be implemented, if it seems advisable.
The optional display device 1040 may, for example, comprise an output field, similar to the HarmonyPad illustrated in
Basically, it is possible, within the context of the calculation of the pitch classes by the pitch class analysis means 1020, to perform a weighting of the analyzed semitones depending on their pitch level and/or their frequency f by introducing a weighting function g(f). The weighting function and/or the weighting describes how different the influence of two pitches of the same pitch class, which, however, belong to different octaves, is on the perception with regard to harmony. From this the possibility results, not only to perform the analysis of the semitones with regard to a volume information distribution which is based on a hearing-adapted variable, but it rather also allows considering the human perception of harmonies of different frequencies, which is more than a mere hearing-dependent variable. The weighting function g(f) thus enables to further refine the analysis with regard to human perception.
Apart from that, it is possible, additionally or alternatively, to integrate and/or include an input value integrator into the measurement device 1000, which temporally integrates the audio signal or a signal derived from the same until the absolute value of the resulting sum vector shows a maximum. By this, a detection system results, as it was already explained in connection with
In the following sections, some further embodiments of the present inventive device are explained and outlined.
In the patent application filed on the same day with the title “Device and method for generating a note signal and device and method for outputting an output signal indicating a pitch class” it is described, how a mobile phone may also be used as a musical instrument, by a user interface, which is similar to the HarmonyPad illustrated in
Further, in the above-cited patent application a so-called DJ tool is explained. The same is an input and output device, i.e., for example, the HarmonyPad explained in
A further embodiment of the present invention is an extension of a keyboard or another electronic sound generator by an accompaniment system 170, described in connection with
In the above-mentioned patent application which was filed on the same day, an integration of the HarmonyPad, also cited in
The current iPod® comprises a circular touch-sensitive area for operating the device. This circular area may be used as an input medium for the HarmonyPad. In addition to that, it is possible to extend the iPod® by a harmony analysis function and/or a harmony analysis device which operates on the basis of the sum vectors. This function analyzes the key and the starting angle and opening angle present at a certain point of time and makes the corresponding circle segment on the iPod® light up. In addition to that, optionally the iPod® may now also be equipped with a sound generator, so that bright kids may enhance their music with trendy accompaniment harmonies. It is to be noted, that this function may need suitable music. Also here, an inventive device for analyzing an audio datum in the form of an accompaniment system, a measurement system or a detection system, as it was explained in connection with
A further embodiment of the present invention represents an automatic accompaniment system which includes an inventive device for analyzing an audio datum and an automatic accompaniment device, which are coupled to each other, as it was already described in connection with
Further embodiments of the present invention represent systems, in which an inventive device for analyzing an audio datum or a device for generating a note signal is coupled to a space sound generator to enable a linking with a space sound or space sound event or other sound parameters. By the symmetry model and the circle of thirds, tonal information like in the form of the selected spatial section and/or the input angle and/or the input angle area, and the analysis signal based on the sum vector, are geometrically represented very efficiently. Today's reproduction systems and/or space sound systems make it possible to reproduce sound at certain spatial positions. There is thus the possibility, in the case of a coupling of a device for generating a note signal with a space sound system, for example to route the (starting) angle, the opening angle and/or the radius of the currently selected circle segment to spatial parameters like direction, diffusity, expansion of the sound in space, etc. and/or to perform a corresponding assignment. It is just as well possible, in the case of a coupling of an inventive device for analyzing an audio datum to a space sound system based on the audio system, i.e. in particular on the basis of the information contained within the same regarding the angle and/or the length of the sum vector, to perform a corresponding assignment to the parameters of the space sound system. In addition to that, it is possible to route these parameters to a frequency-dependent transmission function or to the time course, for example by means of ADSR envelopes (attack-decay-sustain-release) and thus link harmony, sound color and/or sound position with each other.
Another embodiment for an inventive device for analyzing an audio datum within the context of a measurement system, as it was already described and explained in more detail with reference to
Also smaller implementations are possible, which may be held in hand. Such systems, which may, for example, be implemented in the form of the already described HarmonyPad or the DJ tool enable making it possible for people who have no absolute hearing to quickly detect the played pitches of a piece of music and the tonal context.
Depending on the target group, one of the systems described within the scope of the present invention, i.e. in particular an accompaniment system, a measurement system, a detection system or the inventive method for analyzing an audio datum, may be realized in software and/or in the form of a computer program product for a computer, a PDA (personal data assistant), a notebook, a Gameboy®, a mobile phone or another computer system and/or another processor means. The same may optionally be implemented together with the method for generating a note signal upon a manual input and/or the method for outputting an output signal indicating a pitch class, as they were described within the scope of the above-cited patent application which was filed on the same day.
Optionally, here a networking of different systems is further possible, which may also run on physically separated computer systems and/or processor means. By this, individual components of the different systems may be networked to enable a data exchange, wherein the components run on separate processor means. Thus, it is, for example, possible, to network different Gameboys® of several children to enable the latter to play together within the context of a “Gameboy band”. The children may in this case be supported by the inventive method for analyzing an audio datum, which runs on the Gameboys® in the form of software, by the software offering proposals to the children for accompanying the other children based on the analysis signal generated within the scope of the inventive method. Concretely, this may be done, for example, by the sum vector being represented on the display of the Gameboy®.
Another possibility is to couple a musical instrument with a melody analysis device and/or a device for analyzing an audio datum, which may be implemented as an external component or as part of the musical instrument. In the case of an external melody analysis device, the same may, for example, be coupled to the musical instrument via midi signals. In this case, the possibility results that a child or another person plays a simple melody, for example on a flute. The melody of the flute may be detected by a microphone or another sound reception means with the help of the melody analysis device and, for example, be converted into midi signals and provided to the musical instrument. If the melody analysis device represents no external component, a conversion into (midi) signals is maybe not needed. The signals are mapped and/or transmitted to the musical instrument of the first child and represented there. By this, the first child may now generate a suitable accompaniment to the melody of the flute.
A special advantage of the inventive device for analyzing an audio datum here comes to the fore, when more than one child is playing a flute. Should in this case even several children “not hit the right note”, then the inventive device nevertheless enables a determination of the currently played chord and/or the currently played key with a very high reliability, as, due to the weighting of the intermediate vectors within the context of the vector calculation means with the volume information distribution and/or a distribution derived from the volume information distribution, also individual pitches which are not too loud do not strongly disturb the result of the analysis in the form of the sum vector and/or the analysis signal based on the sum vector. It is, rather, to be expected that only the length of the sum vector is slightly reduced and a slight inaccuracy with regard to the sum vector occurs. The inventive device for analyzing an audio datum and/or the inventive method thus also enables an analysis of an audio datum when “interfering components” are mixed among the audio datum (for example in the form of a child playing “wrong tones”).
Depending on the circumstances, the inventive method for analyzing an audio datum may be implemented in hardware or in software. The implementation may take place on a digital storage medium, in particular a floppy disc, CD or DVD having electronically readable control signals, which may cooperate with a programmable computer system so that the inventive method for analyzing an audio datum is performed. In general, the invention thus also consists in a computer program product having a program code stored on a machine-readable carrier for performing the inventive method, when the computer program product runs on a computer. In other words, the invention may also be realized as a computer program having a program code for performing the method, when the computer program runs on a computer or another processor means.
While this invention has been described in terms of several embodiments, there are alterations, permutations, and equivalents which fall within the scope of this invention. It should also be noted that there are many alternative ways of implementing the methods and compositions of the present invention. It is therefore intended that the following appended claims be interpreted as including all such alterations, permutations and equivalents as fall within the true spirit and scope of the present invention.
While preferred embodiments of the present invention have been described above, it is to be understood that variations and modifications will be apparent to those skilled in the art without departing the scope and spirit of the present invention. The scope of the present invention, therefore, is to be determined solely by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10 2006 008 260.5 | Feb 2006 | DE | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP07/00560 | 1/23/2007 | WO | 00 | 9/12/2008 |