The software programs on the enclosed CD-ROM Appendix, attached to the file of this patent application, with identical CD-ROM Copy 1 and Copy 2, are incorporated by reference herein. The software programs are: File name: CiliaASCII.txt; Created: Mar. 27, 2006; Size (bytes): 201,000; and File name: CiliaMicroprocessorASCII.txt; Created: Mar. 27, 2006; Size (bytes): 42,000.
This relates to an electronic musical instrument. Keyboard and percussion electronic musical instruments are widely known. There is a need, however, for electronic musical instruments that are based on other types of musical instruments such as wind instruments.
The present invention is an electronic musical instrument that in appearance and playing characteristics closely resembles flute-like instruments such as a conventional flute or a shakuhachi. The instrument comprises an electronic controller that has operating characteristics that resemble a flute and computer software executable on a computer for converting signals from the controller into data suitable for generating complex sound from conventional speakers. Thus, the instrument provides the complexity and nuance of control of an acoustic instrument while being capable of generating sounds that an acoustic instrument cannot make.
In a preferred embodiment the controller comprises a housing, a mouthpiece mounted on the housing, and a plurality of finger track pads mounted on the housing and positioned so that a player's fingers can engage the track pads while the mouthpiece is held to his or her mouth. In a preferred embodiment, the mouthpiece comprises a wind separator having first and second major surfaces and a microphone mounted on each of the first and second surfaces. The wind separator splits the player's air column using an open lip technique while the microphones function as amplitude sensors. Preferably, there are five track pads positioned to be engaged by two fingers of each hand and one of the player's thumbs. Preferably, the controller also comprises a power amplifier for amplifying the signals from the microphones and a microprocessor for processing the signals from the track pads.
The computer software processes breathing events detected by the microphones and fingering events detected by the track pads and uses the resulting signals to control a plurality of signal synthesizers and envelope generators. In a preferred embodiment, the signals from the track pads are processed by a microprocessor and forwarded via a USB MIDI interface to the computer while the signals from the microphones are forwarded via a Firewire audio interface to the computer. The signals output from the computer are supplied via the Firewire audio interface to a mixer, a power amplifier and finally to a speaker system.
These and other objects, features and advantages of the invention will be more readily apparent from the following Detailed Description in which:
The present invention is a flute controller driven dynamic synthesis system 100 schematically depicted in
A standard ¼ inch audio cable 109 connects to first interface 115; and a cable 110 connects microprocessor 108 to second interface 120. A USB cable 125 connects second interface 120 to first computer 135. Cables 130 and 155 connect first interface 115 to first computer 135 and back. An Ethernet cable connection 145 and an audio signal cable 150 extend from first computer 135 to second computer 140; and an audio signal cable 160 extends from second computer 140 to first interface 115. Stereo and audio cables 165 and 170 extend from first interface 115 to audio mixer 175 and from the mixer to power amplifier 185 and then to the left and right speakers 185 and 190.
Preferably, microphone amplifier 106 is connected to a Firewire audio interface 115. Firewire is a recording industry standard protocol for transmission of audio data, such as, for instance, a Metric Halo Mobile I/O, or a comparable 8-channel in and out interface. Preferably, microprocessor 108 implements the MIDI protocol; and as a non-limiting example, the second interface 120 is a MOTU MIDI Express XT. Like all comparable commercial products, it enables many routing options for large amounts of data. It is capable of handling far greater amounts of data transmission than is generally needed for the present invention.
The use of second computer 140 is optional. Three types of control data are provided at the output of first computer 135: basic note data, volume data, and preset changes. In the absence of the second computer, this data is passed back by Firewire cable 155 into the Firewire interface 115 where it controls the signals provided to the sound system via cables 165 and 170. In the alternative, the control data from first computer 135 is provided to second computer 140 where it undergoes additional processing. In that case, the output from second computer 140 is routed back into the Firewire interface 115, where it controls the signals provided to the sound system.
While the use of a second computer 140 is not needed for fully functional performance, it is generally useful to accomplish more dynamic musical objectives in terms of categories of timbre or sonic color, and the way in which multiple simultaneous voices are brought into relation with one another, call it voicing or layering. There are four categories of timbre: instrument timbre, harmonic timbre, timbre density, and texture. Of voicing and layering, there are likewise four: monophony, homophony, heterophony and polyphony. Together, these concepts enable description of the inner horizon of sound. The accomplishment of dynamic musical objectives entails complex synthesis, which in turn requires a large amount of CPU expenditure. All of the synthesis could be packed into one application, but only at the expense of slower response to the controller.
Referring to
Referring to the side view of
In one embodiment, the present invention uses microphones unconventionally as signal amplitude sensors. Whereas microphones conventionally act as converters from acoustic sound to an electronic audio signal, in the present invention the microphones perform the unconventional function of signal amplitude sensors, and do so by responding to the friction noise from the blowing of air directly onto the microphone surface. Friction noise is a by-product of strong fluctuations of air molecules moving in and out of the microphone which causes the microphone to overload and to generate noise instead of passively representing sound in the acoustic environment. The present invention uses this phenomenon at very low gain levels of operability of the microphone, where the noise does not produce distortion in the signal. At the higher gain levels normally needed to record acoustic sound, the noise causes microphone overload and distortion in the signal. Overload and distortion is what recording engineers especially attempt to avoid in the conventional use of the microphones.
A non-limiting example of frequency tracking techniques in generating control data is as follows. The breath sound is routed through a filter on the computer. The filter routes the breath sound through specified bandwidth channels (i.e. Low, Middle and High). The breath will either be complex enough, or not, in its frequency spectrum, such that sound will pass through any or all three channels. Typically there will be some signal at all three bandwidths, but the amplitudes of those signals can be quite different. The amplitude can be measured and calculated. Threshold triggers can be introduced so that a toggle is turned on when the amplitude exceeds a specified value.
The alternative embodiment in
The alternative embodiment in
It will be apparent to those skilled in the art that various changes and modifications can be made without departing from the spirit and scope of the present invention. Without being limiting, such modifications can include: a variation in the array, number, type, detection input and amplifying of microphones or other signal amplitude sensors; and a variation in the number and placement of wind separators.
Referring to FIGS. 4 A-C, it will be further appreciated by those of skill in the art that a number of characteristics interplay in the design of the microphones or other signal amplitude sensors. Without being limiting, such design concepts center on: housing for the microphones (open or closed); ergonomic microphone principles; maximization of performance efficiency; and comfort of the player. As a non-limiting examples, performance efficiency considerations in the development of one embodiment of the invention include: 1) mounting and proximity positioning of microphones 205 and 215 in relation to each other and to the mouth; 2) placement of a wind separator 210 such as to control the splitting of the air column in terms of distance; 3) designing a lip plate 295 capable of providing a stable physical reference point for the player, such that consistent movements and performance practices can be developed.
As shown in
Each of the five finger track pads produce three continuous controls: X, Y and Z parameters. The positions of the finger on the finger track pad are: X—up and down, Y—sideways, left to right. Both X and Y controls have high resolution, producing a stream of numbers from 0 to 6000 depending on the X and Y position of the thumb or finger on the track pad. The Z parameter measures the percentage of the finger track pad area covered. It is effectively a pressure sensor because the player needs to press harder to cover greater area. The Z control has a lower resolution producing a stream of numbers in the range from 0 to 256 depending on the percentage of the pad that is covered. The finger track pads are set so that the tendency is to use the index and ring fingers. The thumb pad is normally used by the left hand. There is no thumb pad for the right hand. The right thumb and little finger are used to hold and stabilize the instrument.
Finger track pad mounts 222, 223, 224 enable the player to access the entirety of the finger track pad. The mounts are customized milled mounts that are cut to allow the edges and sides of the track pads to be completely available to touch. The milled mounts are aluminum pads custom shaped to secure the entire surface of the finger track pad and make it available for the largest range of possible finger actions. Specialty cables (not shown) connect with the finger trace pad at a 90 degree angle allowing the cable to be routed directly into the body of the instrument.
It is a further object of the present invention to provide an ergonomic design for flute controller driven dynamic synthesis system 100. Among others, several preliminary guiding principles include the need: to exert as little physical effort as possible; to optimize the efficacy of the physical gestures involved in performing, and to provide a look that is aesthetically pleasing to the senses.
In this example, the flute controller's performance gestures are modeled on the shakuhachi flute. These gestures are distinguished by breath technique and fingering technique. The breath technique on the shakuhachi directs the wind forward and backwards, and to either side as well. It thereby introduces a wide range of timbre differences into the tone production. The technique of the transverse silver flute by contrast is inspired by a “bel canto” (beautiful voice) model of tone production, and the technique aspires to keep the wind direction very stable, thereby not introducing sudden timbre shifts into the tone production. The flute controller of the present invention is conceived as a timbre oriented instrument for which the shakuhachi model provides a greater appeal.
In this example, the flute controller's body is also modeled on the shakuhachi flute. The single most important feature of the shakuhachi body for ergonomic considerations is that it is a vertical flute, not a transverse flute. The body symmetry demonstrated in holding a vertical flute is less fatiguing than the left-to-right asymmetry demonstrated in holding a transverse flute. Verticality is the first principle.
It can be appreciated by those of skill in the art that even ostensibly small differences in the physical requirements in holding and manipulating an instrument can become very significant fatigue factors when one considers the hours of activity the musician devotes to practicing. A condition for virtuosity on an instrument is facility at a micro-gestural level (e.g., the single finger shadings over the finger hole that a shakuhachi player executes all the time are invisible to the audience member, but sonically very important to the “vitality” of the sound). In a sense, the musical player is like an engineer, constantly finding ways to ease and disperse load requirements, often by dynamically shifting and transferring the burden of that load.
Like most acoustic models, the shakuhachi has some ergonomic drawbacks as well as assets. Even though on average the shakuhachi is not very heavy (1 to 1.5 lbs), a part of the technical problem is holding the instrument. The right hand can never loose its grip or else the instrument would fall. Ideally, the fingers which are operating the finger track pads should be entirely free from any such structural task. It detracts from what the finger can do on a finger track pad if it has to share in the task of carrying the instrument weight as well. Ostensibly, the index and ring fingers operate the finger track pads, but there are circumstances where it is optimal to extend the technique so that the middle and little fingers can operate the finger track pads as well. The left thumb is occupied always with its own finger track pad. By default that means that the right thumb is the remaining digit whose primary task is carrying the weight of the instrument. However, with this ergonomic design, when the thumb is overworked, the fatigue has negative consequences for other parts of the hand, and performance is compromised.
It is an object of the present invention to provide options for carrying the instrument weight, including, but not limited to, stress release options, and means for distributing and transferring the load. This invention considers the use of the little fingers for the task of holding the instrument. They are the least dexterous on the finger track pads, and are almost always available. Therefore, in one embodiment three digit holes are present: for the left little finger, right thumb and right little finger. As can be appreciated by one of skill in the art, other comparable digit and digit hole positions are also within the spirit and scope of the present invention. The present paradigm allows the player to shift the load, to “address” the finger track pads with the fingers from different angles, and to create additional musical performance options. For example, when taking the instrument weight with the right thumb, it is easier to roll the fingers onto the finger track pads, especially from the right side. When taking the instrument weight with the right little finger, it is easier for the other fingers to come down directly on top of the finger track pads. Staccato (short and sudden) type gestures are easier with this type of support.
As a non-limiting example, the present invention includes the use of neck straps, such as those used by saxophone players, as a means for bearing weight, for setting the proper relationship of control of the instrument to the body, and for introducing simplicity into the design concept.
The shakuhachi may serve as a weight solution paradigm. Few instruments are as ostensibly as simple as the shakuhachi—a single un-mechanized bamboo tube. At the same time, few instruments are as subtle and complex in their crafting as the shakuhachi. Furthermore, it is possible to solve the weight problems with the right choice of light materials, then the neck strap loses its advantage as a solution for bearing weight. Weight is only one of the criteria for selecting the material for the body. The body material would also have to be capable of housing wiring and electronic circuitry in a way that remains invisible and thus unobtrusive as far as the player is concerned. The material would also have to be malleable. It would also have to answer aesthetic requirements, i.e. “invisibility” within the sense of discerning the musical apparatus primarily through the micro-physical gestures of the player.
Non-limiting examples of materials that have been explored include cast resin, plexiglass and plastic assemblages. These materials generally fit the need for malleability, while at the same time equating “invisibility” with transparency. However, they are generally also negatively associated with certain structural defects. Resins tend to be brittle, especially for use on heavy loads. Plastic assemblages do not lend themselves easily to designs with complex curves, unless they are cast, in which case they present the above load issue.
By redefining “invisibility” as minimum volume visible from the front (the predominant playing position relative to an audience), the invention disclosed herein opens up the possibilities for use of other materials. In a preferred embodiment of the invention, rosewood and aluminum tubing are used. Rosewood is easily milled in three dimensions, which adds simplicity to making the housing for wiring and other electronics. It is also very light and robust. It can bear significant load when cut and shaped strategically with respect to load. Aluminum is very light: also, aluminum tubing offers a useful cable transporting function. Together, the rosewood and aluminum tubing materials have a well-crafted look which combines traditional with high tech appearance.
A first attempt to mount the finger track pads set the left hand finger track pads at a left tilted angle, and the right hand finger track pads at a right tilted angle, and situated this in a foam board body. This set-up turned out to be an over determination which does not account for how adaptable and flexible the wrist is. If the finger track pads are mounted at one angle only, both wrists can easily accommodate the change and adapt. This experiment clarifies that the solution to many ergonomic problems rests with the player and his ability to quickly adapt his body to unpredictable performance situations. A working hypothesis is premised on the idea that if there is no perfect posture for the elbows-wrist-hand-finger combination, then a player would expect to develop a performance practice most easily when the “mechanics” of the instrument are simplified. Accordingly, in one embodiment the finger track pads were mounted uniformly such that each finger track pad would be addressed by a finger in the same way. However, while the foam board embodiment enabled assembly of the components in preliminary ways, it was not sufficiently robust and quickly deteriorated. Furthermore, a limitation arose from the inset. On the foam board version the finger track pads had been inset, such that the edges of the finger track pad were slightly covered, and such that the finger track pad was slightly depressed. The transition for the finger from the side of the finger track pad lacked smoothness, and created jumps in data as a result, whenever an action on the edge of the finger track pad was executed.
Another embodiment employs the use of plastic hardware. The first impetus behind the plastic embodiment was to create an instrument that was robust enough for performance. The plastic embodiment positioned the finger track pads top mounted flush with the body surface, and therefore enabled smooth performance actions from the edges of the finger track pads. As a downside, this version was much heavier than the previously described versions.
The accomplishments of the two early embodiments pertained largely to the finger track pads. Another embodiment of the invention further optimized the finger action of the finger track pads. Aluminum milled finger track pads mounts 222, 223, 224 were made that suspended the finger track pad slightly above the body (
In one embodiment, ergonomic developments of the mouthpiece are considered. With respect to the mouthpiece, some problems to be solved related to: the mounting of the mouthpiece at the top of the instrument; the shape of the neck; and the type of tubing material. As a non-limiting example, aluminum tubing is preferred because this metal is very light and allows a hidden passage for the microphone cables. Advantageously, the neck is also made adjustable. This serves a dual purpose: folding to facilitate transportation and packing; and allowing some minute adjustments in how the player holds the instrument. The latter is determined by the posture habits of the player, and by their comfort level with angling the instrument body towards or parallel with their own.
Important ergonomic considerations further relate to the outer appearance of the flute controller. The look of present-day electronic musical instrument systems tends to be either dominated by racks of gear, or by indefinite complexities of nuts, bolts, cables and boxes. In contrast, the present invention sought a look which is highly compact and simple in appearance. In a most preferred embodiment, this requirement is accomplished by the use of wireless technology. This simultaneously satisfies the criteria of aesthetics, ergonomics, and higher degree of mobility of the player in performance space. However, limitations as to transmission range and proximity to loudspeakers may result. In this respect, playing directly in front of a loudspeaker tends to create data feedback. If the room sound is loud enough, the microphone tends to detect sound even at low gains. In this context, data feedback is undesirable, as it takes control away from the player.
Illustratively, the microprocessor is programmed in BASIC or C++ to convert track pad data into MIDI protocol. The microprocessor 106 sends data using the MIDI protocol through port 307 by way of a standard 5-pin MIDI cable. More specifically, microprocessor 106 converts the electromagnetic data generated by moving the fingers over the surface of the finger track pads into high resolution data that can be transmitted using the MIDI protocol. Illustratively, parameter or axis X and parameter or axis Y each has a resolution in terms of a range of 0-6000 and parameter or surface percentage Z has a resolution in terms of a range of 0-256. Each finger track pad generates these three data streams. Therefore the microprocessor 106 sends continuous control signal data for three continuous controllers for each of the five finger track pads 225, 230, 235, 240, 245, resulting in fifteen continuous streams of control signal data in all. Without being limited, the processing of the control data also includes: monitoring for when only zeroes are being produced (when no finger is on the finger track pad) and not sending redundant values; and enabling diagnostics on the finger track pads; and enabling a visual report to be used in such diagnostics.
Amplifier 108 is a first stage of amplification of the microphone transducer signal. It supplies the minimal amount of voltage needed to push the signal to its destination in the Firewire interface 115. The amplifier output is provided to audio signal port 308. Audio signal port 308 is a standard mini cable plug at the controller contact point, and a standard ¼ inch plug at the Firewire interface 115 point of contact.
Serial I/O port 305 may be used for example as a diagnostic and development tool to help locate the source of malfunctioning of a finger track pad (i.e., the chip, the cable connections or the finger track pad itself). Visual output 306 is used by the same application as a diagnostic and development tool, such as for instance to provide a report for diagnostic purposes.
The embodiment of
Wireless technology can be implemented, without any limitation, by using Bluetooth or other comparable wireless technology for control data, and where applicable, other wireless transmission technology for audio data. Without being limited, criteria for choice of transmitter 312 center on the ability to program the transmitter 312 with respect to transmission frequency.
It is another object of the present invention to provide a means of dynamic control which achieves standards bearing the sound complexity of acoustic flutes. To this end, fingering events detected by the track pads 225, 230, 235, 240, 245 and blowing events detected by the microphones 205, 215 are used to control a plurality of signal synthesizers that are used to generate sound.
The processing of the breath events received from microphones 205, 215 is depicted in the flowchart of
There are several techniques that can be used to ‘massage’ the individual microphone data such that it becomes manageable musically including averaging, scaling, compression and ramping. They all have advantages and disadvantages, and so the solution to musical ends has to come through a combination and careful negotiation between such individual strategies. Averaging the data reduces the resolution and slows and reduces the bumpiness, but depending on the averaging sample size, possibly at the expense of quickness of response. Scaling contracts, expands or transposes the control data. Depending on where the data is being sent, different types of numbers may be used (natural integers or floats). Compression assures that there will be no numbers higher or lower than a desired bandwidth and protects the routine from being overloaded with an excessive value. Ramping is enormously useful in filling in the spaces (the larger intervals) of jittery data. However if the data is being received at a rate that is faster than the ramping rate, it does not help. Averaging in conjunction with ramping is very useful in achieving the aims of smoothness but not at the expense of a slow response. In addition to this, interval gating is another effective technique. Such a routine specifies an interval threshold. Any registered interval (jump in the data) greater than the specified threshold interval, results in a filtering out of the values that produce the jump. This technique has the one disadvantage in that one extreme value always makes it through the filter before the filter is activated. In other words, it is still a statistical technique and as such always falls a little behind the fact. But again, when used in conjunction with averaging and ramping, the danger of sudden large peaks in the received data is removed and the smaller peaks that find their way into the control stream are not large enough to be a problem; they are tolerable.
The control destination is important in determining what type of manipulation the original data needs. As a general rule, if the control destination directly affects an audio signal, it is important to achieve both smoothness and quick response.
Another consideration is the amount of delay that inevitably results from such routines. Delays of up to 100 milliseconds are tolerable from a musical time standpoint, and musical time is the criterion here.
Accordingly, the processing of the digital signals from the two microphones includes the steps of averaging 490-491, scaling 492-493, compression 494-495 and ramping 496-497 to generate tolerable basic amplitude streams. These streams are provided to outputs 447 or 448, interval gates 451 or 452, and to outputs 449 and 450. The two digital signals are also analyzed at step 446 to determine the maximum value of the raw microphone data streams. Averaging, scaling, compression or ramping is not needed in this case because the output of step 446 is only used to control a gate. If the output is above a threshold, a gate is opened, and if below, the gate is closed. It can be appreciated by those of skill in the art that sometimes the individual microphone data is pertinent as in outputs 447 and 448; and sometimes only the average of the two streams or the maximum of the two streams is of interest as at outputs 449 and 450. Interval gates 451 and 452 are employed to aid in stabilizing the routine which determines at step 323 the ratio of amplitude between the two microphones. This routine needs to achieve as much stability as possible because it is used in changing the microphone ratio zone 330, which in turn changes the basic fingering values 331 described in
As schematically depicted in
The flowchart of
Ideally, an on/off reading from only one of the three parameters (X, Y or Z) would be sufficient to determine whether the finger is on or off the finger track pad. But as the finger track pads have response idiosyncrasies, it is an object of the present invention to present a routine where all three parameters are combined to make this on/off determination. There are several reasons why relying on only one parameter may not indicate that the finger has left the finger track pad. Depending on how the microprocessor on the flute controller is programmed, there may occasionally be “hanging” values which persist after the finger has left the finger track pad. This may also be due to idiosyncrasies of the finger track pads themselves. The finger track pad's sensitivity differs towards the edge of the finger track pad; and there is less predictability at the numerical limits of all three controllers. A solution is found in the player adopting the appropriate performance practice sensitivity. There are instances when the finger track pads demonstrate proximity sensitivity, such that they generate data when the finger hovers close to them, but does not make direct contact. The flute controller player may, following practice, become flexible and capable of quick adjustment in order to take advantage of this sensitivity approach. As a further non-limiting solution, redundancy is introduced into the event detection routine to guarantee that none of these other factors influence the on/off toggle function.
The data from the four finger track pads is provided to a four finger track pad synchronizer 327. Synchronizer 327 provides discrete combinatorial control, which is possible on the basis of such rudimentary event detection, and through combination and synchronization of the four finger track pads. The combination of the event states of the four finger track pads yields a fingering output that specifies a configuration of the finger track pad states. This is a new control level based on the simple event detections of the individual finger track pads. It is discrete (step wise or incremental) as opposed to continuous (no discernable steps or increments between states). In one embodiment of the invention the thumb is not included in the fingerings as it serves several other specialized functions. The fingering output includes vented fingerings 436, non-vented fingerings 437, numeric fingerings 438, fingering patterns 439, and basic fingerings 331.
It is conventional to differentiate between “vented” and “non-vented” fingerings on a woodwind instrument. Vented fingerings 436 introduce “gaps” in the length of the fingered tube. On the flute controller there are 11 such vented fingerings. When implemented they have the specific function of changing specific waveforms that are used in the complex FM synthesizer 359 described below in conjunction with
Fingering patterns 439 is a discrete control derived from non-vented fingerings 437. The fingering pattern routine simply tracks sequences of non-vented fingering iterations. It is optionally implemented in selecting and implementing presets, which belong to a set of pre-determined signal routing configurations of what is “mixed” (
Numeric fingerings 438 (the determination of how many fingers [1, 2, 3 or 4] are on keys, whether vented or not) are available on the flute controller, but are redundant on an acoustic woodwind instrument. A feature of control data relied upon in one embodiment of this invention relies upon abstracting from the redundancy and assigning a specific functionality. In this application, the four possible values of numeric fingerings 438 are combined with the three possible values derived from the microphone ratio zone 330 of
The “basic fingerings” output 331 is used in the re-synthesizer 415 of
Frequency 332 indicates the assigning of frequency values to note designations, much like determining the pitch frequency of solfage (do, re, mi, etc.) designations, e.g., to determine that ‘la’ is 440 Hz. Control recipients of this data usually require only a note designation (1-12). Synthesis recipients require frequency values in order to generate audio signals.
Upon positive detection of an event by the software routine, four actions follow. First, the finger track pad data (digital data converted from analog) is processed at step 464 with regard to its on/off status, and its X, Y and Z parameter values are forwarded at step 468. Second, the microphone signal amplitude data (digital data converted from analog) is processed at step 465 with regard to two amplitude stream values, as well as derivative data (namely, mean, maximum, and ratio) and this data is forwarded at step 469. Third, any audio signal (breath noise, in digital format converted from analog) is processed at step 466 with regard to bandwidth amplitudes. Bandwidth resolution is variable, and upon its determination, bandwidth amplitude configurations are forwarded at step 470. This process is likewise in effect in other embodiments of the invention where a microphone array is used and where conventional use of the microphones is employed (
The sensor control data forwarded at steps 468, 469, 470 is processed at step 471 and output to networks 472, 477. Network 472 includes Control Network and Synthesis Routines (C.S.R.) that are used to control the synthesis of sound. In a preferred embodiment, there are three such routines, a noise generator, a complex synthesizer and an additive synthesizer described more fully in conjunction with
Particular C.S.R.s or combinations thereof are selected at step 478. Upon such selection, particular C.S.R.P.s or combinations thereof are selected at step 479. Since such selections affect the entirety of the system, they are handled with presets, data sets which enable large numbers of decisions to be made at once. The presets can be selected by control data generated at step 471, or through manual selection from the keyboard of the computer, or from predetermined timed sequences. For example, a player can scroll through presets at will using preset timings, or basing the clocking on more ‘subjective’ clocks such as the number of completed phrases (e.g., complete two complete phrases before scrolling to the next present in the predetermined sequence of presets). It is also possible to set ‘interval’ triggers and frequency pattern triggers. For example, if a basic note sequence 1, 2, 3 and 4 is played, then preset #5 is played; and if a basic note sequence 2, 4, 2 and 4 is played, then preset #10 is played.
For each of several channels of sound so far generated, amplitude envelope selection is then made at step 480. Amplitude envelopes can be shaped directly by the player's breath, or through a process independent of the player's breath, or through some combination thereof. Such decisions are also handled by presets. After the selection is made, the resulting sound is output to a conventional sound amplification system at step 481.
The computer software program for the flute control driven dynamic synthesis system (File name: CiliaASCII.txt; Created: Mar. 27, 2006; Size (bytes): 201,000) is attached to the file of this patent application on a CD-ROM, with identical Copy 1 and Copy 2, and is incorporated by reference herein.
The synthesizer functions include: a complex FM synthesizer 345 where “FM” indicates frequency modulation; an additive synthesizer 360; and a broadband white noise generator 340. The processing functions include: a “brick wall” filter 385; a two source cross synthesizer 390; an amplitude envelope generator 395; a re-synthesizer 415, a granular synthesizer 420 and a direct out 425. A term designation of “mix” on an item indicates a designation that any source connected to an item can pass through in any combination in the course of the designated process.
Control data from the finger track pads and the microphone, are routed to every part described in
Complex FM synthesizer 345 implements routines for cascading frequency modulation. It is characterized as complex because it is one of four parts of the synthesis path. It implements two waveform synthesis routines: a cascading FM routine, and a ring modulation routine. Synthesizer 345 is described in more detail in conjunction with
Additive synthesizer 360 is a sinusoidal generator that is capable of both sinusoidal addition and of waveform transformation. Synthesizer 365 is described in more detail in conjunction with
The “brick wall” filter 385 blocks any frequency not specified within a defined bandwidth. The “brick wall” filter 385 is a “spectral” filter, wherein this term implies a functional designation of filtering done in the digital domain, not the signal domain. The conversion into the data domain requires a Fast Fourier Transform (FFT) of the signal data numbers.
In an alternative embodiment of the invention, which employs conventional microphone use (
In an alternative embodiment of the invention, which employs unconventional microphone use, a broadband white noise generator 340 is used and dynamically controlled with “brick wall” filter 385. In this embodiment, the sound generated by the microphones is not utilized for the purpose of detecting direct audio input, primarily because its frequency character shows insignificant change over time, and further because it occupies a small mid-range bandwidth.
The two-source cross synthesizer 390 takes two original signal sources and recombines only certain aspects of those two sources into one new source, creating an audio morphing. This is a spectral procedure—that is, one performed on the digital data representing the frequency and amplitude spectra of the audio signal. Because it is a two source synthesizer, it needs two mixers. Typically, such a synthesizer takes the amplitude spectral data of one source and recombines it with the frequency spectral data of a second source.
The amplitude envelope generator 395 is operable to give the sound coming from the speaker (the very end of the sound generating process) an intuitive connection with the breath of the player. When breath from the player is registered on the instrument, this module insures that sound will follow which is commensurate in scope with the effort of blowing that the player demonstrates. To accomplish this, it resolves technical problems, such as: it enables quick response to breath contours; it resolves “jitters” or sudden large jumps in the breath signal data; and it smoothes the data at breath amplitude thresholds and thereby removes “glitches” or registrations of amplitude that are not intended musically. Further details of envelope generator 395 are set forth in
The re-synthesizer 415, also a spectral processor, takes the audio signal thus far processed, reproduces the frequency spectrum as a signal, but only with some specified original frequency content. The result in the sound is subtractive: frequencies are removed.
The granular synthesizer 420 functions to break up the source into samples whose size, separation, and pitch can be controlled. Finger track pad data is hardwired directly into this module. The granular synthesizer 420 enables both textural as well as timbre modifications of the source material.
The Y parameters from track pads 230, 235, 240, 245 are scaled and ramped at steps 348, 349, 350, 351, respectively. As noted above, the maximum scaling values of the Y parameters are controlled by the X parameters from the same track pad. The outputs of steps 348, 349, 350, 351 and input frequency in 346 are supplied to first waveform oscillator 352, second waveform oscillator 353, FM oscillator 354 and ring modulating oscillator 357 as follows. Frequency in 346 is derived from basic fingerings 331 of
The output of waveform oscillator 1 and waveform oscillator 2 are combined at 355 to produce cross-multiplied signals 1. The cross-multiplied signals 1 are combined at step 356 with the output of FM oscillator 354 to produce cross-multiplied signals 2. The cross-multiplied signals 2 are combined with the input frequency by ring modulating oscillator 357. Finally, the output of waveform oscillator 1 and the output of ring modulating oscillator 357 are combined by mixer 358.
It can be appreciated by those of skill in the art that the arithmetical variations of this synthesis engine are almost infinite. In one embodiment, one of the arithmetic configurations is to have clearly identifiable sonic results associable with every distinctive control gesture and combination of control gestures.
It can be appreciated by those of skill in the art that a number of ways for synthesis of control data can be implemented without departing from the spirit and scope of the present invention. Without being limiting, examples include variations in dynamic control configurations. Some synthesis implementations of the control data are more effective than others. There are two general criteria for evaluating the efficacy of dynamic control configurations. First, when considering the control combinations abstractly (without reference to their control destination) one can eliminate from scrupulous scrutiny complex combinations where one controller negates or compromises the effect of another. Two controllers inversely affecting the amplitude of a synthesis procedure will either average the amplitude with a single value (in the case where the mean is being produced), or create a constant jitter between disparate values (in the case where the control data is routed through the same ramping procedure). Second, the generated results should not involve undue self-cancellations when considering the control combinations with reference to their control destination. The player will be able to sense when there is an inappropriate degree of sonic response to an executed physical gesture. These variations appeal to a principle of efficiency: physical effort should not be wasted and routines should not be excessive. A player should be able to perform complex idiosyncratic synthesis routines and to catch such moments of waste by practice, playing and listening. It can be appreciated by the person of skill in the art, as is self-evident from the development history of any instrument, that the instrument maker anticipates results through science and calculation, but corrects, adjusts and modifies only after playing and listening.
The data is directly derived from the mouthpiece 200 through signal amplitude sensing provided by the microphones 205 and 215, and from finger track pads 225, 230, 235, 240, 245 through finger shading sensing. The raw microphone data is identified as data 315, 316. The raw thumb track pad data 430 is delivered to the application as X-data 317, Y-data 318, and Z-data 319. The left index track pad data is delivered to the application as X-data 320 Y-data 321 and Z-data 322. In similar fashion but not shown, the left ring finger pad X-data, Y-data and Z data are combined in the same way and routed to the second of the four type A sound generators. The right index finger pad X-data, Y-data and Z data are combined in the same way and routed to the third of the four type A sound generators. The right ring finger pad X-data, Y-data and Z data X-data, Y-data and Z data are combined in the same way and routed to the fourth of the four type A sound generators. As indicated by the filled in circle, all the raw data is continuous data meaning that there are no discernable steps. The raw microphone data undergoes preliminary processing which is identical for each of the two microphones. From the processed data from the first and second microphones, a microphone amplitude ratio 323 is obtained as described in more detail in conjunction with
As indicated in conjunction with
The complexity of the three final stages of control data is achieved through indirect control networking. It draws from several factors, including: generating and combining data streams from both breath and finger actions either alone or in combination; generating both continuous control and discrete control data (represented as filled or outlined circles, respectively), and the inherent complexity of the sensors themselves, where either a breath or a finger action immediately is capable of producing complex streams of data. Although the second controller stream overtone structure 336 is a direct feed from a finger pad Z parameter, it is still complex by virtue of being produced simultaneously with an X and a Y parameter and is not combined with other data from the network, and is accordingly also a dynamic form of control on the sound.
The multiplication value of second signal multiplication 411 is more complex. Truth value monitors 400 (envelope 2 reset), 403 (envelope 1 off), 404 (mean gate opened), 405 (maximum amplitude off detector), 407 (mean gate closed), and 408 (envelope 2 off) determine collectively whether the mean amplitude gate 409 allows mean amplitude control data 396 adjusted by the mean scaler 397 to determine a second stage of signal multiplication 411. If mean amplitude control data 396 is allowed through the mean amplitude gate 409, then the output signal amplitude 411 will be variable, but always in the audible range as the mean amplitude values have been scaled by scaler 397 from 0.5 (which is ½ of the original signal amplitude) to 1 which is the full original signal volume assuming that first signal multiplication 406 is set at multiplier value 1. If the mean amplitude gate 409 is closed, then automatic ramping procedures go into effect. Truth value monitor 408 (envelope 2 off) looks to maximum amplitude off detector 405 to determine if second signal multiplication 411 should be ramped down to multiplier value 0, effectively turning it off. The effect in sound is that the breath of the player has stopped and the synthesized sound lingers before ramping down.
Truth value monitor 400 (envelope 2 reset) looks to detector 401 (maximum amplitude on) to determine if second signal multiplication 411 should be ramped up to multiplier value 0.5, effectively setting it in a ready position to receive the signal from first signal multiplication 406. In this case, second signal multiplication 411 is again subject to mean amplitude 396 control because the mean amplitude gate 409 is opened by truth value monitor 404 (mean gate opened) which is responding to a positive value from detector 401 (maximum amplitude on detector).
Amplitude data received at this stage in the program still demonstrates jitter at the threshold of silence. A player may think that he is playing a rest, but some little transient jitter such as the accidental smacking of the lips causes a little amplitude bump. Again the acoustic flute paradigm is instructive in shaping a program solution.
The interior acoustics of the shakuhachi tube (resonances, reflections and resistances) enables ramping of the volume into silence easily. Strictly speaking, reflected sound continues after the player stops blowing. It is certainly true with a room, but also at a micro-level within the space of the shakuhachi tube. Reflected sound is simulated not by using a conventional effect such as reverb, but by using delayed ramping.
The maximum volume 398 activates an attack portion of the ramped envelope 402 which freezes at that level 406 until it receives a ‘0’ value from the maximum amplitude of the two microphones. When the breath stops, the maximum amplitude reads zero, triggering the fixed first envelope 406 down to zero. And upon this zero, the modifying amplitude envelope 410 also slopes down to zero. There is always a controlled ramping down after the breath has stopped.
The second problem happens when controllers are inflexibly stable. The mean amplitude 396 is used to modify the first amplitude envelope as when the mean amplitude gate 409 is opened.
The first signal multiplication 406 holds the amplitude at one level as long as the player blows at whatever volume. There are micro-inconsistencies, moments of indecision or decision in the breath technique of wind players which make for nuance and vitality. To retain this vitality, the second signal multiplication 411 introduces micro variation in the amplitude, but with a stability provided by first signal multiplication 406.
It will be apparent to those of skill in the art that other signal amplitude sensor and microphone models and arrays can also function—within the spirit and scope of the present invention—to capture alternative variations in the quantity of calculation and the amount of control.
Variations in discrete control can be based on detecting and amplifying input data streams, including, but not limited to, the following control parameters: volume of each microphone individually, mean volume, maximum rough volume, maximum volume, continuous ratio and ratio threshold.
In one embodiment of the invention, tubes 298 and 296, as depicted in
In a wireless embodiment of the invention, the flute controller may be heavier and less ergonomic due to the need for battery power. In an alternative ergonomic light design embodiment of the flute controller, a design solution to the heavier weight may be found, without any limitation, by tethering the transmitter and battery to an external unit fastened to the player's belt or clothing.
It can be appreciated by those of skill in the art that embodiments of the invention that require use of more than two microphones may, without limitation, require audio transmission re-engineering due to an increase of the weight of the instrument when the controller is outfitted with the additional components needed for multi-channel (greater than stereo) wireless audio transmission.
In an alternative light-weight embodiment of the invention, additional microprocessors may be introduced such as to allow for the basic analog-to-digital conversion of the microphone signal to be done on the flute controller itself.
In one alternative light-weight embodiment of the invention, a second microprocessor may be implemented, particularly in association with the use of low resolution (8-bit) analog-to-digital conversion processing. It is an object of the present invention to provide a means for simplification of the data conversion process. It can be appreciated by those of skill in the art that where the instrument utilizes an unconventional use of the microphones as amplitude sensors, the application of low (8-bit) resolution data may serve to both convert the control data as well as simplify the data manipulation process involved in such a conversion. This engineering advantage resides with the ability to transmit control data with greater ease than audio signals, as less control data is required to be transmitted at lower resolutions.
In one embodiment of the invention, the Bluetooth wireless technology may be utilized. It can be appreciated by the person of skill in the art that there are numerous available technologies for wireless transmission of control data.
In an alternative embodiment of the invention, which uses the additional microphone in a conventional way (as in
This application claims the benefit of provisional application No. 60/787,148, filed Mar. 28, 2006, which is incorporated herein in its entirety.
Number | Date | Country | |
---|---|---|---|
60787148 | Mar 2006 | US |