A portion of the disclosure of this patent document may contain material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the patent and trademark office patent file or records, but otherwise reserves all copyright rights whatsoever.
One or more embodiments relate generally to filterbanks (or filter banks) for multiband sound equalization, and in particular, to obtaining a target frequency response gain from a filterbank using a trained artificial intelligence model.
Multiband graphic equalizer uses frequency adjacent filters to obtain a desired frequency response. The gains of each filter is manually adjusted by trial and error until the target response is obtained. Adjacent filters interact with each other and iterative tuning by an expert is needed to obtain the desired equalization with good precision.
Interactions between filters make it difficult to obtain a given target equalization, i.e. specific gains at specific frequencies. It is an iterative and tedious task that requires expertise. Conventional algorithms based on linear approximation exist, but these have limited precision.
One embodiment provides a computer-implemented method that includes accessing an artificial intelligence (AI) model trained for a filterbank based on a control gain of the filterbank and a resulting frequency response gain. Based on a target frequency response gain inputted into the trained AI model, a control gain applicable to a filter in the filterbank (for example, a respective control gain applicable to each filter in the filterbank) is outputted. The target frequency response gain is obtained at a center frequency of the filter in the filterbank.
Another embodiment includes a non-transitory processor-readable medium that includes a program that when executed by a processor performs obtaining a target frequency response gain from a filterbank using a trained AI model, including accessing, by the processor, an artificial intelligence model trained for a filterbank based on a control gain of the filterbank and a resulting frequency response gain. The processor further provides outputting, based on a target frequency response gain inputted into the trained AI model, a control gain applicable to a filter in the filterbank (for example, a respective control gain applicable to each filter in the filterbank). The processor additionally obtains the target frequency response gain at a center frequency of the filter in the filterbank.
Still another embodiment provides an apparatus that includes a memory storing instructions, and at least one processor executes the instructions including a process configured to access an AI model trained for a filterbank based on a control gain of the filterbank and a resulting frequency response gain. The process further outputs, based on a target frequency response gain inputted into the trained AI model, a control gain applicable to a filter in the filterbank (for example, a respective control gain applicable to each filter in the filterbank). Additionally, the process obtains the target frequency response gain at a center frequency of the filter in the filterbank.
These and other features, aspects and advantages of the one or more embodiments will become understood with reference to the following description, appended claims and accompanying figures.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
For a fuller understanding of the nature and advantages of the embodiments, as well as a preferred mode of use, reference should be made to the following detailed description read in conjunction with the accompanying drawings, in which:
The following description is made for the purpose of illustrating the general principles of one or more embodiments and is not meant to limit the inventive concepts claimed herein. Further, particular features described herein can be used in combination with other described features in each of the various possible combinations and permutations. Unless otherwise specifically defined herein, all terms are to be given their broadest possible interpretation including meanings implied from the specification as well as meanings understood by those skilled in the art and/or as defined in dictionaries, treatises, etc.
A description of example embodiments is provided on the following pages. The text and figures are provided solely as examples to aid the reader in understanding the disclosed technology. They are not intended and are not to be construed as limiting the scope of this disclosed technology in any manner. Although certain embodiments and examples have been provided, it will be apparent to those skilled in the art based on the disclosures herein that changes in the embodiments and examples shown may be made without departing from the scope of this disclosed technology.
One or more embodiments relate generally to a computer-implemented method that includes accessing an artificial intelligence (AI) model trained for a filterbank (or filter bank) based on a control gain of the filterbank and a resulting frequency response gain. Based on a target frequency response gain inputted into the trained AI model, a control gain applicable to a filter in the filterbank (for example, a respective control gain applicable to each filter in the filterbank) is outputted. The target frequency response gain is obtained at a center frequency of the filter in the filterbank.
AI models may include a trained machine learning (ML) model (e.g., models, such as a neural network (NN), a convolutional NN (CNN), a deep NN (DNN), a recurrent NN (RNN), a Long short-term memory (LSTM) based NN, gate recurrent unit (GRU) based RNN, tree-based CNN, self-attention network (e.g., an NN that utilizes the attention mechanism as the basic building block; self-attention networks have been shown to be effective for sequence modeling tasks, while having no recurrence or convolutions), BiLSTM (bi-directional LSTM), etc.). An artificial NN is an interconnected group of nodes. A NN is interconnected layers of small units referred to as nodes that perform operations to detect patterns in data. Neurons are a basic building block of a NN that takes weighted values, performs a calculation and produces output. The input to the NN is the data/values that are passed to the neurons. A NN is made of several neurons stacked into layers. All intermediate layers are referred to as hidden layers, and the number of layers in a network determines the depth of the model.
In some embodiments, training of an AI model, for the filterbank 120, is performed to develop a relationship between control gains 117 of the filterbank 120 and a resulting FR (Final FR 125) gain. Based on a target FR 110 gain inputted into the trained machine learning model (NN model 115), a control gain 117 applicable to a filter in the filterbank 120 (for example, a respective control gain applicable to each filter in the filterbank) is outputted. The control gain 117 applied to a filter in the filterbank 120 produces an output FR (Final FR 125 gain) that matches the target FR 110 gain within an allowable deviation. Obtaining the target FR 110 gain at a center frequency of a filter in the filterbank 120.
In some embodiments, for a given filterbank 120 composed of N filters (e.g. a graphic equalizer) centered at N specific frequencies Fk, k=1 . . . N, the NN model 115 is trained to learn the relationship between control gains 117 (inputs) and resulting FR (final FR 125 gain) gains (outputs) at Fk. For the NN model 115, the training: Inputs=M vectors of N scalars (dB control gain applied to each filter). The outputs=M vectors of N scalars (dB gains obtained each frequencies Fk). The NN model 115 inference provides that at inference time, the target FR 110 gains are input in the NN model 115 that calculates the corresponding control gains 117. These are applied in turn to the filterbank 120. The final FR 125 gain obtained from the filterbank 120 matches the target gains at the specified frequencies Fk.
at ⅓ octave frequencies from 20 to 20 kHz. The Gains=dB vector∈
for a filterbank 365 with ten (10) filters. The NN model 330 operates on the filterbank 365 to result in the Final FR 370 gain.
. The (second order section (SOS)) coefficients 620=dB vector∈
for a PEQ's 625. The NN model 615 operates on the PEQ's 625 to result in the Final FR 630.
. The control gain(s) 720=dB vector∈
for a filterbank 725 with six (6) filters. The NN model 715 operates on the filterbank 725 to result in the Final FR 730
In some embodiments, process 800 further provides that the trained machine learning model (e.g., NN 115 (
In one or more embodiments, process 800 additionally provides that the control gain applied to the filter produces an output frequency response gain that matches the target frequency response gain within an allowable deviation.
In some embodiments, process 800 further includes that the machine learning model comprises an NN (e.g., NN 115 (
In one or more embodiments, process 800 includes the feature that the NN provides that target frequency response gains are obtained at center frequencies of each filter in the filterbank.
In some embodiments, process 800 additionally provides the NN adjusts control gains of N filters to control target frequency response gains at M points (see, e.g., pipeline 600 (
In one or more embodiments, process 800 further provides the feature that the NN adjusts all coefficients of a given set of biquad filters to obtain the target frequency response gain (see, e.g.,
Information transferred via communications interface 907 may be in the form of signals such as electronic, electromagnetic, optical, or other signals capable of being received by communications interface 907, via a communication link that carries signals and may be implemented using wire or cable, fiber optics, a phone line, a cellular phone link, a radio frequency (RF) link, and/or other communication channels. Computer program instructions representing the block diagram and/or flowcharts herein may be loaded onto a computer, programmable data processing apparatus, or processing devices to cause a series of operations performed thereon to produce a computer implemented process.
In some embodiments, processing instructions for process 800 (
Embodiments have been described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products. Each block of such illustrations/diagrams, or combinations thereof, can be implemented by computer program instructions. The computer program instructions when provided to a processor produce a machine, such that the instructions, which execute via the processor create means for implementing the functions/operations specified in the flowchart and/or block diagram. Each block in the flowchart/block diagrams may represent a hardware and/or software module or logic. In alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures, concurrently, etc.
The terms “computer program medium,” “computer usable medium,” “computer readable medium”, and “computer program product,” are used to generally refer to media such as main memory, secondary memory, removable storage drive, a hard disk installed in hard disk drive, and signals. These computer program products are means for providing software to the computer system. The computer readable medium allows the computer system to read data, instructions, messages or message packets, and other computer readable information from the computer readable medium. The computer readable medium, for example, may include non-volatile memory, such as a floppy disk, ROM, flash memory, disk drive memory, a CD-ROM, and other permanent storage. It is useful, for example, for transporting information, such as data and computer instructions, between computer systems. Computer program instructions may be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
As will be appreciated by one skilled in the art, aspects of the embodiments may be embodied as a system, method or computer program product. Accordingly, aspects of the embodiments may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, aspects of the embodiments may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Any combination of one or more computer readable medium(s) may be utilized. The computer readable medium may be a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain or store a program for use by or in connection with an instruction execution system, apparatus, or device.
Computer program code for carrying out operations for aspects of one or more embodiments may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of one or more embodiments are described above with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other devices to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
References in the claims to an element in the singular is not intended to mean “one and only” unless explicitly so stated, but rather “one or more.” All structural and functional equivalents to the elements of the above-described exemplary embodiment that are currently known or later come to be known to those of ordinary skill in the art are intended to be encompassed by the present claims. No claim element herein is to be construed under the provisions of 35 U.S.C. section 112, sixth paragraph, unless the element is expressly recited using the phrase “means for” or “step for.”
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The corresponding structures, materials, acts, and equivalents of all means or step plus function elements in the claims below are intended to include any structure, material, or act for performing the function in combination with other claimed elements as specifically claimed. The description of the embodiments has been presented for purposes of illustration and description, but is not intended to be exhaustive or limited to the embodiments in the form disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the invention.
Though the embodiments have been described with reference to certain versions thereof; however, other versions are possible. Therefore, the spirit and scope of the appended claims should not be limited to the description of the preferred versions contained herein.