The present disclosure relates to the field of communication technologies, and in particular, to an audio signal processing method and apparatus, an electronic device, and a computer-readable storage medium.
Hearing aids as professional equipment usually need face-to-face communication with audiologists in offline stores for fitting. For example, a fitting process with an audiologist may include as follows: The audiologist first needs to test hearing of a user, and then adjusts, according to a hearing test result by using a prescription formula, a parameter of a hearing aid worn by the user. As can be seen, such fitting process of the hearing aid is cumbersome, which leads to low efficiency.
Embodiments of the present disclosure provide an audio signal processing method and apparatus, an electronic device, a computer-readable storage medium, and a computer program product, which can implement efficient fitting of an audio device.
Technical solutions in the embodiments of the present disclosure are implemented as follows:
An embodiment of the present disclosure provides an audio signal processing method, including: displaying a hearing test control in a human-computer interaction interface; outputting a first test audio signal in response to a trigger operation on the hearing test control; displaying a first hearing test result of a target object in response to a feedback operation on the first test audio signal; and transmitting, to an audio device in response to a configuration operation on the audio device, a first hearing assistance policy generated according to the first hearing test result, where the first hearing assistance policy is configured to be applied to the audio device to output a first audio signal adapted to the first hearing test result.
An embodiment of the present disclosure provides an audio signal processing apparatus, including: a display module, configured to display a hearing test control in a human-computer interaction interface; an output module, configured to output a first test audio signal in response to a trigger operation on the hearing test control, the display module being further configured to display a first hearing test result of a target object in response to a feedback operation on the first test audio signal; and a transmitting module, configured to transmit, to an audio device in response to a configuration operation on the audio device, a first hearing assistance policy generated according to the first hearing test result, where the first hearing assistance policy is configured to be applied to the audio device to output a first audio signal adapted to the first hearing test result.
An embodiment of the present disclosure provides an audio signal processing method, including: obtaining a first hearing test result of a target object; determining, in descending order of frequencies of a plurality of sub-bands in an auditory frequency range, a filter parameter of each sub-band based on the first hearing test result, a filter parameter of a low-frequency sub-band being determined based on a filter parameter of a high-frequency sub-band; performing combination on the filter parameter of each sub-band, and using a combined filter bank parameter as a first hearing assistance policy for the target object; and transmitting the first hearing assistance policy to an audio device, where the first hearing assistance policy is configured to be applied to the audio device to output a first audio signal adapted to the first hearing test result.
An embodiment of the present disclosure provides an audio signal processing apparatus, including: an obtaining module, configured to obtain a first hearing test result of a target object; a determining module, configured to determine, in descending order of frequencies of each sub-band in an auditory frequency range, a filter parameter of each sub-band based on the first hearing test result, a filter parameter of a low-frequency sub-band being determined based on a filter parameter of a high-frequency sub-band; a combination module, configured to perform combination based on the filter parameter of each sub-band, and use an obtained filter bank parameter as a first hearing assistance policy for the target object; and a transmitting module, configured to transmit the first hearing assistance policy to an audio device, where the first hearing assistance policy is configured to be applied to the audio device to output a first audio signal adapted to the first hearing test result.
An embodiment of the present disclosure provides an audio signal processing method, including: receiving a first hearing assistance policy for a target object, the first hearing assistance policy including a filter bank parameter, the filter bank parameter including a filter parameter of a plurality of sub-bands in an auditory frequency range, the filter parameter of each sub-band being determined in descending order of frequencies based on a first hearing test result of the target object, and a filter parameter of a low-frequency sub-band being determined based on a filter parameter of a high-frequency sub-band; and outputting, according to the first hearing assistance policy, a first audio signal adapted to the first hearing test result.
An embodiment of the present disclosure provides an audio signal processing apparatus, including: a receiving module, configured to receive a first hearing assistance policy for a target object, the first hearing assistance policy including a filter bank parameter, the filter bank parameter including a filter parameter of each sub-band in an auditory frequency range, the filter parameter of each sub-band being determined in descending order of frequencies based on a first hearing test result of the target object, and a filter parameter of a low-frequency sub-band being determined based on a filter parameter of a high-frequency sub-band; and an output module, configured to output, according to the first hearing assistance policy, a first audio signal adapted to the first hearing test result.
An embodiment of the present disclosure provides an electronic device, including: a memory, configured to store an executable instruction; and a processor, configured to execute the executable instruction stored in the memory, to implement the audio signal processing method provided in the embodiments of the present disclosure.
An embodiment of the present disclosure provides a non-transitory computer-readable storage medium, storing an executable instruction, when executed by at least one processor, implementing the audio signal processing method provided in the embodiments of the present disclosure.
The embodiments of the present disclosure have the following beneficial effects:
A function of hearing testing and a function of configuring an audio device based on a hearing test result are integrated into a computer program, so that a user can configure the audio device through interaction with the computer program. Compared with a method in which the user needs to go to an offline store to configure the audio device, this lowers an operating threshold, improves efficiency in configuring the audio device, and improves auditory experience of the user.
To make objectives, technical solutions, and advantages of the present disclosure clearer, the following describes the present disclosure in further detail with reference to accompanying drawings. The described embodiments are not to be considered as a limitation to the present disclosure. All other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present disclosure.
In the following description, the term “some embodiments” describes subsets of all suitable embodiments, but it can be understood that “some embodiments” may be the same subset or different subsets of all the suitable embodiments, and can be combined with each other without conflict.
It can be understood that in the embodiments of the present disclosure, for related data such as user information (such as a hearing test result of a user), when the embodiments of the present disclosure are applied to a specific product or technology, permission or consent of the user needs to be obtained, and acquisition, use, and processing of the related data need to comply with related laws, regulations, and standards of related countries and regions.
In the following description, the terms “first/second/ . . . ” are merely for distinguishing between similar objects, and do not represent a specific order of objects. It can be understood that “first/second/ . . . ” may be interchanged where permitted, so that the embodiments of the present disclosure described herein can be implemented in an order other than that illustrated or described herein.
Unless otherwise defined, meanings of all technical and scientific terms used in this specification are the same as those usually understood by a person skilled in the art to which the present disclosure belongs. The terms used in this specification are merely for describing the embodiments of the present disclosure, but are not intended to limit the present disclosure.
Before the embodiments of the present disclosure are further described in detail, nouns and terms in the embodiments of the present disclosure are described, which are explained as follows:
The embodiments of the present disclosure provide an audio signal processing method and apparatus, an electronic device, a computer-readable storage medium, and a computer program product, which can implement efficient and convenient configuration of an audio device. The following describes exemplary application of the electronic device provided in the embodiments of the present disclosure. The electronic device provided in the embodiments of the present disclosure may be implemented as various types of terminal devices such as a notebook computer, a tablet computer, a desktop computer, a set-top box, a mobile device (for example, a mobile phone, a portable music player, a personal digital assistant, a special messaging device, and a portable game device), and an in-vehicle terminal. Alternatively, the electronic device may be implemented as an audio device, or may be implemented by a terminal device and an audio device in cooperation. The audio device may be a power amplifier, a sound box, a multimedia console, a digital mixer, an audio sampling card, a synthesizer, a mid/high frequency speaker, a microphone, an audio card in a notebook computer, an earphone, a hearing aid, or other peripheral audio devices, such as a professional microphone series, a headset, and a public address system.
The following description is provided by using an example that a terminal device and an audio device cooperate to implement the audio signal processing method provided in the embodiments of the present disclosure.
In some embodiments, a client (not shown in
The terminal device 200 may implement, by running a computer program, the audio signal processing method provided in the embodiments of the present disclosure. For example, the computer program may be a native program in an operating system or a software module; may be a native APP, that is, a program that needs to be installed in the operating system to run, for example, various types of clients such as a network conference APP, an instant messaging APP, and an audio-video playing APP; or may be an applet, that is, a program that only needs to be downloaded to a browser environment to run. In short, the computer program may be in any form such as an APP, a module, or a plug-in.
The following continues to describe a structure of the terminal device 200 shown in
The processor 210 may be an integrated circuit chip with a signal processing capability, such as a general-purpose processor, a digital signal processor (DSP) or another programmable logic device, a discrete gate, a transistor logic device, or a discrete hardware component. The general-purpose processor may be a microprocessor or any processor or the like.
The user interface 230 includes one or more output apparatuses 231 that can present media content, including one or more speakers and/or one or more visual displays. The user interface 230 further includes one or more input apparatuses 232, including user interface components that facilitate user inputting, such as a keyboard, a mouse, a microphone, a touchscreen, a camera, and other input buttons and controls.
The memory 250 may be a removable memory, a non-removable memory, or a combination thereof. Exemplary hardware devices include a solid-state memory, a hard disk drive, an optical disc drive, and the like. In some embodiments, the memory 250 includes one or more storage devices physically remote from the processor 210.
The memory 250 includes a volatile memory, a non-volatile memory, or both. The non-volatile memory may be a read-only memory (ROM). The volatile memory may be a random access memory (RAM). The memory 250 described in this embodiment of the present disclosure aims to include any memory of an appropriate type.
In some embodiments, the memory 250 can store data to support various operations. Examples of the data include programs, modules, and data structures or subsets or supersets thereof, as illustrated below.
An operating system 251 includes system programs for processing various basic system services and executing hardware-related tasks, for example, a framework layer, a kernel library layer, and a driver layer, to implement various basic services and process hardware-based tasks.
A network communication module 252 is configured to reach other computing devices via one or more (wired or wireless) network interfaces 220. Exemplary network interfaces 220 include: Bluetooth, wireless compatibility certification (Wi-Fi), USB, and other interfaces.
A presentation module 253 is configured to present information via one or more output apparatuses 231 (for example, a display and a speaker) associated with the user interface 230 (for example, a user interface for operating a peripheral device and displaying content and information).
An input processing module 254 is configured to detect one or more user inputs or interactions from one or more input apparatuses 232 and translate the detected input or interaction.
In some embodiments, the audio signal processing apparatus provided in the embodiments of the present disclosure may be implemented in a software manner.
The following continues to describe a structure of the audio device 300 shown in
The audio signal processing method provided in the embodiments of the present disclosure is specifically described below from the perspective of interaction between the terminal device and the audio device.
Steps performed by the terminal device are specifically performed by various forms of computer programs running on the terminal device, which are not limited to a client, but may alternatively be an operating system, a software module, or a script described above. Therefore, the client is not to be considered as a limitation on the embodiments of the present disclosure. In addition, for ease of description, no specific distinction is made below between the terminal device and the computer program running on the terminal device.
In step 101, a terminal device displays a hearing test control in a human-computer interaction interface.
In some embodiments, a client runs on a terminal device associated with a target object (that is, an object requiring hearing testing, for example, a user A), and a hearing test control, for example, a “start test” button, is displayed in a human-computer interaction interface provided by the client.
In some other embodiments, the terminal device may further perform the following processing before displaying the hearing test control in the human-computer interaction interface: displaying a historical hearing test result of the target object in the human-computer interaction interface in response to existence of the historical hearing test result and the historical hearing test result being within a validity period (for example, 3 months); and transmitting, to an audio device in response to a configuration operation on the audio device, a fourth hearing assistance policy generated according to the historical hearing test result, the fourth hearing assistance policy being applied to the audio device to output a fourth audio signal adapted to the historical hearing test result. This can save time required for the user to test hearing, and further improve efficiency in configuring the audio device.
For example, the validity period is a maximum interval between current time and the last time of testing. The validity period may be manually preset. For example, the validity period may be set to 3 months or half a year.
For example, the historical hearing test result may be a historical hearing test result of the target object obtained from a third-party hearing detection organization, or may be a historical hearing test result that is obtained from hearing testing previously performed by the target object based on an APP and that is obtained from the terminal device locally or a server.
In step 102, the terminal device outputs a first test audio signal in response to a trigger operation on the hearing test control.
In some embodiments, when the terminal device receives a trigger operation of the target object on the hearing test control (for example, the “start test” button) displayed in the human-computer interaction interface, the terminal device may obtain a first test audio signal from a server, or invoke a computing capability of the terminal device to generate a first test audio signal locally in the terminal device based on factors such as a channel, a frequency, and a sound pressure level (SPL), or obtain a first test audio signal from a plurality of test audio signals stored in advance in the terminal device locally, and transmit the first test audio signal to an audio apparatus (for example, a speaker) disposed in the terminal device, and the audio apparatus outputs the first test audio signal. Certainly, the terminal device may alternatively transmit the first test audio signal to an external audio device, and the audio device outputs the first test audio signal.
In some other embodiments, before outputting the first test audio signal, the terminal device may further detect an SPL of an environment in which the target object is located. The terminal device turns to the operation of outputting a first test audio signal in a case that an average SPL of the environment in which the target object is located within a set duration (for example, 2 seconds) is less than an SPL threshold (for example, 40 dB). In this way, before hearing testing, an environment is first detected to ensure that the target object is in a relatively quiet environment, thereby improving accuracy of a subsequent hearing test result.
The SPL threshold may be an average SPL value of environments assessed as quiet environments by a plurality of tested persons. For example, five persons are tested. Assuming that a tested person 1 assesses an environment at an SPL less than 42 dB as a quiet environment, a tested person 2 assesses an environment at an SPL less than 38 dB as a quiet environment, a tested person 3 assesses an environment at an SPL less than 41 dB as a quiet environment, a tested person 4 assesses an environment at an SPL less than 39 dB as a quiet environment, and a tested person 5 assesses an environment at an SPL less than 40 dB as a quiet environment, an average value (that is, 40 dB) of these five SPLs may be used as the SPL threshold.
For example,
Before the three detection steps are completed, the “start test” button 1001 may be in a disabled state. For example, the “start test” button 1001 may be displayed in a gray mode, and a response to a tap operation on the “start test” button 1001 may be shielded. That is, before the detection steps are completed, the user cannot test hearing, thereby ensuring accuracy of subsequent hearing testing. Certainly, the user may alternatively directly test hearing by tapping a “direct test” button 1005 displayed in the human-computer interaction interface 1000, to save time of the user.
In step 103, the terminal device displays a first hearing test result of the target object in response to a feedback operation on the first test audio signal.
In some embodiments, the first hearing test result may include at least one of a hearing parameter and a speech recognition capability parameter, and the first test audio signal may include at least one of the following types of test audio signals: a hearing test audio signal for testing hearing of the target object, and a speech recognition capability test audio signal for testing a speech recognition capability of the target object. Then, the terminal device may perform step 103 in the following manner: generating a hearing parameter of the target object in response to a feedback operation on the hearing test audio signal; generating a speech recognition capability parameter of the target object in response to a feedback operation on the speech recognition capability test audio signal; and displaying a hearing test result including at least one of the hearing parameter and the speech recognition capability parameter.
In some other embodiments, further, the hearing parameter may include a hearing threshold of the target object in each sub-band in an auditory frequency range. For example, the auditory frequency range may be divided into six sub-bands respectively having center frequencies of 250 Hz, 500 Hz, 1000 Hz, 2000 Hz, 4000 Hz, and 8000 Hz according to response characteristics of a human ear to different frequencies. Then, the hearing parameter of the target object may be generated in response to the feedback operation on the hearing test audio signal in the following manner: performing the following processing for any sub-band in the auditory frequency range: displaying an SPL control (configured to indicate an SPL of a currently outputted hearing test audio signal) and the following feedback controls in the human-computer interaction interface: a first feedback control (for example, a “not heard” button) configured to indicate that the hearing test audio signal is not heard by the target object, and a second feedback control (for example, a “heard” button) configured to indicate that the hearing test audio signal is heard by the target object; re-outputting the hearing test audio signal (the hearing test audio signal has a specific duration) in a manner higher than an SPL of current output in response to a trigger operation on the first feedback control; re-outputting the hearing test audio signal in a manner lower than an SPL of current output in response to a trigger operation on the second feedback control; and for any SPL used for current output, determining the any SPL as a hearing threshold of the target object in the sub-band in response to receiving a second trigger operation on the second feedback control at the any SPL.
For example,
Still refer to
The sub-band with the center frequency of 1000 Hz is used as an example for description. The hearing test audio signal is outputted first at an SPL of 30 dB. If a tap operation of the user A on the “heard” button 1008 is received in this case, the SPL of the hearing test audio signal is reduced by 10 dB, that is, the hearing test audio signal is outputted at an SPL of 20 dB. If a tap operation of the user A on the “not heard” button 1007 is received at the SPL of 20 dB, the SPL of the hearing test audio signal is increased by 5 dB, that is, the hearing test audio signal is outputted at an SPL of 25 dB. If a tap operation of the user A on the “not heard” button 1007 is received at the SPL of 25 dB, the SPL of the hearing test audio signal is further increased by 5 dB, that is, the hearing test audio signal is outputted at the SPL of 30 dB. If a tap operation of the user A on the “heard” button 1008 is received again at the SPL of 30 dB, 30 dB may be used as a hearing threshold of the user A in the sub-band with the center frequency of 1000 Hz.
In some other embodiments, the following processing may alternatively be performed for any sub-band in the auditory frequency range: for any SPL used for current output, determining the any SPL as a hearing threshold of the target object in the sub-band in response to receiving a trigger operation of the target object on the second feedback control at the any SPL.
The sub-band with the center frequency of 1000 Hz is used as an example. The hearing test audio signal is sequentially outputted at different SPLs by continuously increasing an SPL. For example, the hearing test audio signal is outputted first at an SPL of 20 dB. If a tap operation of the user A on the “not heard” button 1007 is received in this case, the SPL of the hearing test audio signal is increased by 5 dB, that is, the hearing test audio signal is outputted at an SPL of 25 dB. If a tap operation of the user A on the “not heard” button 1007 is still received at the SPL of 25 dB, the SPL of the hearing test audio signal is further increased by 5 dB, that is, the hearing test audio signal is outputted at an SPL of 30 dB. If a tap operation of the user A on the “heard” button 1008 is received at the SPL of 30 dB, 30 dB may be used as a hearing threshold of the user A in the sub-band with the center frequency of 1000 Hz. This simplifies a process of hearing threshold testing, thereby saving time of the user.
The hearing threshold is not a fixed value, that is, it does not mean that the user absolutely can hear a sound at an SPL, and absolutely cannot hear a sound below this SPL. In fact, it is a gradual transition process from “not heard” to “sometimes heard and sometimes not heard” and to “heard” as sound intensity increases. Therefore, a plurality of hearing threshold tests may be performed for the target object, and an average value of hearing thresholds obtained from the plurality of hearing threshold tests is used as a hearing threshold of the target object, to further improve accuracy of a testing result.
In some embodiments, the hearing parameter may further include a pain threshold of the target object in each sub-band in the auditory frequency range. Then, the hearing parameter of the target object may be generated in response to the feedback operation on the hearing test audio signal in the following manner: performing the following processing for any sub-band in the auditory frequency range: displaying an SPL control (configured to indicate an SPL of a currently outputted hearing test audio signal), a first adjustment control (for example, a slider bar), and a third feedback control (for example, an “ear uncomfortable” button) in the human-computer interaction interface, the third feedback control being configured to indicate that physiological discomfort occurs on the target object in hearing the hearing test audio signal; adjusting the SPL of the currently outputted hearing test audio signal in response to a trigger operation on the first adjustment control; and in response to a trigger operation on the third feedback control, determining an SPL in response to that the trigger operation is received as a pain threshold of the target object in the sub-band.
For example,
In some embodiments, the terminal device may further generate the speech recognition capability parameter of the target object in response to the feedback operation on the speech recognition capability test audio signal in the following manner: displaying a decibel control (configured to indicate a decibel value of a currently outputted speech recognition capability test audio signal) and a plurality of fourth feedback controls in the human-computer interaction interface, each fourth feedback control being corresponding to a pitch; sequentially outputting a plurality of speech recognition capability test audio signals, and recording a fourth feedback control triggered by the target object in the plurality of fourth feedback controls each time the speech recognition capability test audio signal is outputted; and determining a correctness percentage of pitch recognition of the target object based on pitches respectively corresponding to the plurality of speech recognition capability test audio signals and fourth feedback controls respectively triggered by the target object in a plurality of test processes. That is, each time the speech recognition capability test audio signal is outputted, it is determined whether a pitch corresponding to the speech recognition capability test audio signal outputted by the audio device is consistent with a pitch corresponding to the fourth feedback control triggered by the target object. When the pitches are consistent, it is determined that the target object succeeds in pitch recognition, when the pitches are inconsistent, it is determined that the target object fails in pitch recognition, and the determined correctness percentage of pitch recognition is used as the speech recognition capability parameter of the target object.
For example,
For example, the target object is the user A. It is assumed that 10 speech recognition capability test audio signals are sequentially outputted to the user A, pitches respectively corresponding to the 10 speech recognition capability test audio signals are: u/wu, s/si, i/yi, sh/shi, a/a, u/wu, s/si, m/mo, u/wu, i/yi, and fourth feedback controls respectively triggered by the user A in these 10 test processes are: the “u/wu” button 1206, the “sh/shi” button 1207, the “i/yi” button 1204, the “s/si” button 1205, the “a/a” button 1202, “u/wu” button 1206, the “sh/shi” button 1207, the “m/mo” button 1203, the “u/wu” button 1206, and the “i/yi” button 1204. The user A recognizes three pitches incorrectly. Then, it may be determined that a correctness percentage of pitch recognition of the user A is 70%, and the correctness percentage 70% is used as a speech recognition capability parameter of the user A.
In step 104, the terminal device transmits, to the audio device in response to a configuration operation on the audio device, a first hearing assistance policy generated according to the first hearing test result.
In some embodiments, the terminal device may further perform the following processing before transmitting, to the audio device, the first hearing assistance policy generated according to the first hearing test result: determining, in descending order of frequencies, a filter parameter of each sub-band in the auditory frequency range based on the first hearing test result; and performing combination based on the filter parameter of each sub-band, and using an obtained filter bank parameter as the first hearing assistance policy for the target object.
For example, the first hearing test result may include the hearing threshold of the target object in each sub-band in the auditory frequency range. Then, the terminal device may determine, in descending order of frequencies, the filter parameter of each sub-band in the auditory frequency range based on the first hearing test result in the following manner: obtaining a gain value of each sub-band based on the hearing threshold of the target object in each sub-band and a prescription formula (for example, the NAL series prescription formulas or the DSL series prescription formula), for example, the hearing threshold of the target object in each sub-band may be substituted into the prescription formula for calculation to obtain the gain value of the corresponding sub-band; and obtaining, in descending order of frequencies, the filter parameter of each sub-band based on the gain value of each sub-band. In this way, the filter parameter is determined by using a “backward” calculation method, that is, a filter parameter corresponding to a high-frequency sub-band is determined first, and then a filter parameter of a low-frequency sub-band is calculated according to a characteristic of a frequency response that undergoes filtering, so that a desired frequency response curve can be closer, thereby achieving a better gain effect and improving auditory experience of the user.
For example, the auditory frequency range includes N sub-bands (for example, six sub-bands). It is assumed that a sixth sub-band is a sub-band with a center frequency of 8000 Hz, a fifth sub-band is a sub-band with a center frequency of 4000 Hz, a fourth sub-band is a sub-band with a center frequency of 2000 Hz, a third sub-band is a sub-band with a center frequency of 1000 Hz, a second sub-band is a sub-band with a center frequency of 500 Hz, and a first sub-band is a sub-band with a center frequency of 250 Hz, N being an integer greater than 1. Then, the terminal device may obtain, in descending order of frequencies, the filter parameter of each sub-band based on the gain value of each sub-band in the following manner: substituting a gain value of the Ni sub-band into a filter function for calculation to obtain a filter parameter of the Ni sub-band; and determining a filter parameter of an ith sub-band based on a difference between a gain value of the ith sub-band and a frequency response of a filter of an (i+1)th sub-band at the ith sub-band. For example, a filter parameter of the sixth sub-band is first calculated based on a gain value of the sixth sub-band, a filter parameter of the fifth sub-band is then calculated based on a difference between a gain value of the fifth sub-band and a frequency response of a filter of the sixth sub-band at the fifth sub-band, and so on, so that filter parameters respectively corresponding to the six sub-bands can be obtained. A value range of i meets 1≤i≤N−1, and a frequency of the (i+1)th sub-band is greater than a frequency of the ith sub-band.
The first hearing assistance policy may be generated in real time in response to the configuration operation triggered by the target object, or may be generated in advance; and may be generated locally in the terminal device, or may be generated in a server. For example, the terminal device sends the first hearing test result of the target object to the server, and the server generates the first hearing assistance policy. This is not specifically limited in this embodiment of the present disclosure.
In step 105, the audio device outputs a first audio signal adapted to the first hearing test result.
In some embodiments, the audio device may output, in the following manner, the first audio signal adapted to the first hearing test result: controlling, in ascending order of frequencies, a filter of each sub-band in a filter bank to sequentially filter an original audio signal according to a filter parameter of the corresponding sub-band in the filter bank parameter, to obtain the first audio signal adapted to the first hearing test result.
For example, the filter bank parameter is obtained by combining filter parameters of six sub-bands. After the audio device receives the original audio signal, the original audio signal may be filtered by filters of the six sub-bands in ascending order of frequencies, that is, sequentially processed by the six filters from a low frequency to a high frequency, to obtain the first audio signal adapted to the first hearing test result (that is, an audio signal that undergoes personalized equalization). In addition, to prevent a “clipping” phenomenon from occurring in the first audio signal finally outputted and affecting a hearing sense of the target object, before the first audio signal is outputted, dynamic range control (DRC) may be further performed to ensure integrity of the first audio signal.
The DRC means mapping a dynamic range of an input audio signal to a specified dynamic range. The dynamic range after the mapping is usually smaller than the dynamic range before the mapping. Therefore, the DRC is also referred to as dynamic range compression. The DRC provides compression and amplification capabilities, so that a sound can sound softer or louder, which is a signal amplitude adjustment method.
In some other embodiments,
In step 106, the terminal device amplifies the first audio signal according to at least one gain curve, to obtain a second test audio signal of at least one volume.
In some embodiments, the terminal device may further perform the following processing before amplifying the first audio signal according to the at least one gain curve: obtaining characteristic information (for example, an age, a wearing side, and wearing years) of the target object; determining a gain factor of the first audio signal according to the characteristic information of the target object; generating the at least one gain curve according to the hearing parameter (including at least one of the hearing threshold and the pain threshold of the target object in each sub-band in the auditory frequency range) included in the first hearing test result, the gain factor, and the prescription formula, each gain curve being corresponding to a volume, for example, three gain curves may be calculated according to the gain factor, the hearing threshold, and the pain threshold by using the prescription formula, and respectively correspond to a plurality of volumes, including low sound, medium sound, and loud sound, where the plurality of volumes may be obtained through uniform or non-uniform division according to a decibel range of sounds that can be perceived by humans, for example, a decibel value being 0 dB to 20 dB may be defined as low sound, a decibel value being 20 dB to 60 dB may be defined as medium sound, and a decibel value greater than 60 dB may be defined as loud sound; and then performing interpolation processing on each gain curve through frequency band mapping, so that the number of sub-bands of the gain curve is consistent with the number of channels of the filter bank.
For example, when the sub-bands of the gain curve are mapped to the channels of the filter bank, because the number of sub-bands of the gain curve is less than the number of channels of the filter bank, for example, it is assumed that the original number of sub-bands of the gain curve is 5 while the number of channels of the filter bank is 8, interpolation processing needs to be performed on the gain curve. For example, interpolation processing may be performed on the gain curve through linear interpolation or parabolic interpolation, so that the number of sub-bands of the gain curve that undergoes the interpolation processing is increased to 8.
In step 107, the terminal device generates a second hearing test result of the target object in response to a feedback operation on the second test audio signal.
In some embodiments, the terminal device may perform step 107 in the following manner: displaying a second adjustment control (for example, a slider bar), a plurality of fifth feedback controls, and a plurality of volume controls in the human-computer interaction interface, each fifth feedback control being corresponding to a pitch, and a volume represented by a volume control in a selected state being used as a volume used during outputting of the second test audio signal; adjusting a gain of the currently outputted second test audio signal in response to a trigger operation on the second adjustment control; sequentially outputting a plurality of second test audio signals, and recording a fifth feedback control triggered by the target object in the plurality of fifth feedback controls each time the second test audio signal is outputted; and obtaining, based on pitches respectively corresponding to the plurality of second test audio signals and fifth feedback controls respectively triggered by the target object in a plurality of test processes, a pitch incorrectly recognized by the target object. For example, each time the second test audio signal is outputted, it is determined whether a pitch corresponding to the second test audio signal is consistent with a pitch corresponding to the fifth feedback control triggered by the target object. When the pitches are inconsistent, the pitch corresponding to the second test audio signal is determined as the pitch incorrectly recognized by the target object, and the pitch incorrectly recognized by the target object is used as the second hearing test result of the target object.
For example,
In addition, an “inaudible” button 1611 is also displayed in the human-computer interaction interface 1600. When a tap operation of the target object on the “inaudible” button 1611 is received, the second test audio signal may be re-outputted, or the second test audio signal may be re-outputted in a manner higher than a current decibel value.
In step 108, the terminal device transmits a second hearing assistance policy to the audio device.
In some embodiments, the second hearing assistance policy may be obtained by adjusting the first hearing assistance policy according to the second hearing test result. The second hearing test result includes the pitch incorrectly recognized by the target object. Then, the terminal device may further perform the following processing before transmitting the second hearing assistance policy to the audio device: pertinently compensating the first hearing assistance policy according to the pitch incorrectly recognized by the target object, to obtain the second hearing assistance policy, that is, adding, according to the pitch incorrectly recognized by the target object, an adjustment amount of the pitch incorrectly recognized by the target object in the first hearing assistance policy, to obtain the second hearing assistance policy.
For example, a specific process of the pertinent compensation may be: for the pitch incorrectly recognized by the target object, compensating, according to a frequency corresponding to the pitch, a filter parameter of a corresponding sub-band in the filter bank parameter included in the first hearing assistance policy. For example, a filter corresponding to the frequency may be determined from the filter bank according to the frequency corresponding to the pitch, and then a parameter of the filter is compensated. For example, it is assumed that the frequency corresponding to the pitch is 500 Hz. As the center frequency of the third sub-band is exactly 500 Hz, it may be determined that a filter parameter of the third sub-band in the filter bank needs to be compensated, that is, a specific adjustment amount is added so that the target object can perceive the pitch as a compensation target.
The adjustment amount may be fixed or dynamic. For example, the adjustment amount may be a fixed value preset by operation personnel of the APP, that is, a fixed adjustment amount is added each time. Certainly, the adjustment amount may alternatively be dynamically changed according to different error situations, that is, a different adjustment amount may be added each time.
In an example of compensation, assuming that the pitch incorrectly recognized by the target object is “sh/shi”, a filter parameter of a corresponding sub-band in the filter bank parameter included in the first hearing assistance policy may be compensated pertinently according to the pitch “sh/shi” incorrectly recognized by the target object. For example, a volume of the pitch “sh/shi” may be increased so that the target object can clearly hear the pitch. In addition, different error situations may correspond to different compensation, while an adjustment amount of the compensation may be preset without manual adjustment of the user. In step 109, the audio device outputs a second audio signal adapted to the second hearing test result in place of the first audio signal.
In some embodiments, after receiving the second hearing assistance policy transmitted by the terminal device, the audio device may use the second hearing assistance policy in place of the first hearing assistance policy received in step 104. In this way, when an original audio signal is subsequently received, the received original audio signal may be adjusted by using the second hearing assistance policy. For example, the original audio signal may be sequentially filtered from a low frequency to a high frequency based on the pertinently compensated filter bank parameter included in the second hearing assistance policy, to output a second audio signal adapted to the second hearing test result (that is, an audio signal obtained through pitch adjustment based on the first audio signal), thereby further improving auditory experience of the user.
In some embodiments,
In step 110, the terminal device adjusts the second audio signal based on a plurality of candidate hearing sense adjustment policies, to obtain a plurality of third test audio signals.
In some embodiments, the audio device may not output the second audio signal, but directly output a third audio signal obtained by adjusting a hearing sense of the second audio signal. For example, a plurality of different types of candidate hearing sense adjustment policies may be displayed in the human-computer interaction interface of the terminal device for the target object to select. Then, the terminal device may adjust the hearing sense of the second audio signal based on a plurality of hearing sense adjustment policies selected by the target object to obtain a plurality of third test audio signals.
In an example in which the terminal device adjusts the hearing sense of the second audio signal based on a hearing sense adjustment policy, a pitch carried in the hearing sense adjustment policy is first obtained, and then the second audio signal is adjusted through wide dynamic range compression based on a frequency corresponding to the obtained pitch, to obtain a third test audio signal. For example, a frequency of the second audio signal may be reduced, and a corresponding gain value may be adjusted in real time according to sound intensity (for example, a decibel value) of the second audio signal, so that the third test audio signal finally obtained sounds lower than the second audio signal. In addition, each third test audio signal corresponds to a different hearing sense. For example, the hearing sense of the second audio signal may be adjusted by using four different types of hearing sense adjustment policies, to obtain third test audio signals with four different hearing senses: an original hearing sense, higher, lower, and a clearer speech.
The wide dynamic range compression means that as sound intensity of an input audio signal changes, a corresponding gain also changes in real time, so that an amplified audio signal completely falls within a reduced auditory dynamic range of a hearing-impaired user.
In step 111, the terminal device generates a third hearing test result of the target object in response to a feedback operation on the plurality of third test audio signals.
In some embodiments, the third hearing test result may include a preferred hearing sense of the target object. Then, the terminal device may perform step 111 in the following manner: displaying a plurality of sixth feedback controls in the human-computer interaction interface, each sixth feedback control being corresponding to a hearing sense; and sequentially outputting a plurality of third test audio signals in one-to-one correspondence with the plurality of sixth feedback controls, and determining a hearing sense corresponding to a sixth feedback control triggered by the target object in the plurality of sixth feedback controls as the preferred hearing sense of the target object.
For example,
In step 112, the terminal device transmits a third hearing assistance policy to the audio device.
In some embodiments, the third hearing assistance policy may be obtained by adjusting the second hearing assistance policy according to the third hearing test result. Then, the terminal device may further perform the following processing before transmitting the third hearing assistance policy to the audio device: adjusting, according to the preferred hearing sense of the target object, a gain curve included in the second hearing assistance policy. For example, assuming that the preferred hearing sense of the target object is “soft”, the gain curve included in the second hearing assistance policy may be pertinently adjusted based on a factor such as timbre corresponding to “soft”, to obtain the third hearing assistance policy.
In step 113, the audio device outputs a third audio signal adapted to the third hearing test result in place of the second audio signal.
In some embodiments, after receiving the third hearing assistance policy transmitted by the terminal device, the audio device may use the third hearing assistance policy in place of the second hearing assistance policy received in step 108. In this way, when an original audio signal is subsequently received, the received original audio signal may be adjusted by using the third hearing assistance policy, to output a third audio signal adapted to the third hearing test result (that is, an audio signal obtained through hearing sense adjustment based on the second audio signal), thereby further improving auditory experience of the user.
In the audio signal processing method provided in the embodiments of the present disclosure, a solution based on a form of a computer program is provided, which integrates functions of personalized hearing testing and configuring an audio device based on a hearing test result into the computer program. Compared with the fitting session in which the user needs to go to an offline store to configure the audio device, this lowers an operating threshold, improves efficiency in configuring the audio device, and improves auditory experience of the user.
Exemplary application of the embodiments of the present disclosure in a practical application scenario is described below by using an example that the audio device is a hearing aid. The embodiments of the present disclosure provide an autonomous fitting and machine adjusting solution based on an APP form, which integrates comprehensive functions of personalized audiometry and convenient autonomous fitting, to improve auditory experience of a hearing-impaired user in using a hearing aid.
The audio signal processing method provided in the embodiments of the present disclosure is specifically described below.
For example,
The following continues to describe the “autonomous fitting” part shown in
In some embodiments, after wearing a hearing aid, the user connects the hearing aid to the APP on a mobile phone through Bluetooth. Then, the user may select a hearing file. After the hearing aid is started, a parameter of the hearing aid is updated, and a basic hearing assistance function (corresponding to the first hearing assistance policy) takes effect. Next, the user may further update the parameter of the hearing aid through a pitch adjustment part designed in the APP, and a first enhanced hearing assistance function (corresponding to the second hearing assistance policy) takes effect. Subsequently, the user may further update the parameter of the hearing aid through a hearing sense adjustment part designed in the APP, and a second enhanced hearing assistance function (corresponding to the third hearing assistance policy) takes effect.
As can be seen from
For ease of expression, main constants of a quantitative description part in the embodiments of the present disclosure are as follows:
A sampling rate of a speech signal is 16000 Hz.
A frame length is 20 ms, that is, the number of samples per frame is 320 points.
If overlap-add time-frequency transform, for example, short-term Fourier transform (STFT), is performed, there is 50% overlapping. Therefore, a hop size is 320 points, and 640-point discrete Fourier transform (DFT) is performed. The hop size is the number of samples staggered between two adjacent windows.
The following first describes the personalized audiometry part.
A first part of the personalized audiometry is pure tone hearing threshold testing and pain threshold testing.
In some embodiments,
In some embodiments, the preparation before testing mainly includes environmental sound detection (for example, the “select a quiet environment” control 1002 shown in
In some embodiments,
In some embodiments, during pain threshold testing, to reduce a test time of the user, an initial value of the pain threshold testing may be set to a value x dB higher than the hearing threshold. For example, a value of x may be 30 dB, which may be determined according to the hearing threshold, and when the hearing threshold is higher than a threshold (for example, 60 dB), the value of x may be appropriately reduced. In addition, a protection mechanism may be added for the pain threshold testing. For example, when an SPL of a currently outputted test audio signal is higher than 75 dB, an adjustment step of the user may be forcibly reduced to prevent a sudden volume increase from damaging hearing of the user.
For example,
In some embodiments, after a hearing threshold and a pain threshold of the user in each sub-band in the auditory frequency range are obtained through hearing threshold testing and pain threshold testing, an audiogram of the user may be generated based on the hearing threshold and the pain threshold and saved. In addition, the audiogram may be further presented, or a related result and suggestion may be given according to the audiogram.
The following continues to describe a second part of the personalized audiometry, that is, a process of pitch testing.
For a hearing-impaired user wearing a hearing aid, a main scenario is to communicate with people. Therefore, pitch testing is mainly intended to evaluate a speech recognition rate of the user without hearing assistance means. Chinese is used as an example. Main pitches of Chinese are limited, and in the embodiments of the present disclosure, the following six pitches are used for combination: a/a, i/yi, u/wu, m/mo, s/si, and sh/shi. Certainly, an extension may be further performed on this basis, for example, h/he. For ease of description, only the six pitches are used herein. Certainly, there may be more other combinations. This is not limited in the embodiments of the present disclosure.
For example,
The following describes a process of “autonomous fitting”.
A first part of the autonomous fitting is to obtain basic hearing for the hearing-impaired user through the hearing aid according to the audiogram obtained through the personalized audiometry.
In some embodiments, a solution of the basic hearing assistance function is calculating a gain of each frequency band according to a personalized hearing status of the user (for example, by loading the audiogram), and then compensating for hearing loss of the user in a fullband range by using different gains of frequency bands, to improve a perception capability of the user in a hearing loss band, thereby improving speech intelligibility of the user and meeting a need of daily communication. The following specifically describes the solution of the basic hearing assistance function.
First, the audiogram (that is, a hearing test result obtained through personalized hearing testing) of the user is loaded. As described above, the audiogram may be obtained by the user through a personalized audiometry module in the APP provided in the embodiments of the present disclosure, or may be obtained from a third party through direct input (for example, an accurate pure tone audiogram is obtained from a professional organization). This is not limited in the embodiments of the present disclosure.
Then, a gain value of each frequency band is calculated according to the audiogram and a prescription formula. Herein, the prescription formula is a formula for determining the gain value of each frequency band according to a hearing threshold in the frequency band. A prescription formula given in Table 1 is used as an example. TH represents a corresponding hearing threshold, and G represents a calculated gain. The formula in Table 1 is a non-linear prescription formula, and the gain value of each frequency band may be calculated according to an SPL of an input audio signal and the hearing threshold. A specific gain calculation process is as follows: First, the SPL of the input audio signal is calculated. Then, sound intensity is determined according to the SPL of the input audio signal, to determine a gain interval of the formula in Table 1. The SPL being below 40 dB indicates a low-intensity sound, the SPL being 40 dB to 65 dB indicates a comfort-zone sound, and the SPL being 65 dB to 90 dB indicates a high-intensity sound. Subsequently, the hearing threshold in each frequency band is determined, and is substituted into the prescription formula given in Table 1 for calculation, to obtain a gain value of a current frequency band.
In the embodiments of the present disclosure, a gain outputted by the prescription formula is limited, that is, to ensure that strength of an equalized audio signal does not cause further loss to hearing of the user, when a sum of the SPL of the input audio signal and the gain value exceeds a pain threshold of the user, a part of the gain exceeding the pain threshold is removed.
Further, gain values respectively corresponding to the six sub-bands are substituted into a filter function for calculation, to obtain the filter bank parameter. The filter bank may include a shelving filter and a peak filter. The shelving filter may include a low shelf filter and a high shelf filter. The low shelf filter is characterized in that a high-frequency part is straight-through and a low-frequency part is adjustable (that is, it may be configured to adjust a gain of a low-frequency sub-band. The high shelf filter is characterized in that a low-frequency part is straight-through and a high-frequency part is adjustable (that is, it may be configured to adjust a gain of a high-frequency sub-band. The peak filter is located between the low shelf filter and the high shelf filter, and configured to raise a center frequency response, and adjust a gain of an intermediate sub-band.
Further, in the embodiments of the present disclosure, “backward” filter parameter calculation is performed, that is, a filter parameter corresponding to a high-frequency sub-band is first determined, and then a filter parameter of a low-frequency sub-band is calculated according to a characteristic of a frequency response that undergoes filtering, to obtain filter parameters level by level.
For example,
Finally, the input audio signal is filtered for equalization to obtain an output audio signal (corresponding to the first audio signal). The input original audio signal is sequentially filtered by filters (from a low frequency to a high frequency), and the output audio signal obtained is an equalized audio signal.
For example,
For example, it is assumed that the calculated SPL is spl=60 dB, and hearing thresholds corresponding to the left ear in the six sub-bands are pta_L=[30 35 35 40 45 45]. First, a corresponding gain interval is determined according to the SPL. Refer to Table 1. The input SPL of 60 dB corresponds to the comfort-zone sound. Then, a corresponding gain may be calculated according to the hearing threshold in each sub-band. Herein, the first sub-band is used as an example. The hearing threshold is 30 dB, corresponding to a formula (2) of the comfort-zone sound interval, and is substituted to calculate g[1]=0.6×(30−20)=6 dB. Similarly, gain values of the other five sub-bands may be calculated sequentially.
Subsequently, when filter parameters are calculated based on the gain values g, in this embodiment of the present disclosure, a “backward” calculation method is used to get closer to a desired response curve. That is, a parameter of a high shelf filter is first calculated according to a gain value g[6] of the sixth sub-band, and then a filter parameter of the fifth sub-band is calculated according to a difference g′[5] between a gain value g[5] of the fifth sub-band and a frequency response h_6[5] of the filter at the fifth sub-band, and so on, to obtain a parameter aijbij of the entire filter bank. Details are not described herein in this embodiment of the present disclosure.
After the filter bank parameter is obtained, the input original audio signal may be sequentially processed by six filters from a low frequency to a high frequency, to obtain an audio signal s′(n) that undergoes personalized equalization. The filtering operation is embodied as convolution in time domain and corresponding frequency bin multiplication in frequency domain, which are basic operations of signal processing. Details are not described herein in this embodiment of the present disclosure.
Finally, to prevent a “clipping” phenomenon from occurring in the output audio signal and affecting a hearing sense, in this embodiment of the present disclosure, a DRC module is further added after output equalization, to protect integrity of the audio signal.
The following continues to describe a second part of the autonomous fitting, that is, a process of pitch adjustment.
The pitch adjustment is intended to use, on the basis of implementing the basic hearing assistance function, the pitch adjustment part designed in the APP, that is, perform pitch testing in real time after the hearing-impaired user wears the hearing aid, to finely adjust a parameter according to a testing result, and update an adjusted parameter to the hearing aid by using the Bluetooth protocol, so as to improve the hearing sense of the user (that is, implement the first enhanced hearing assistance function).
In some embodiments,
The pitch adjustment is intended to play any pitch and adjust perception of the user on the pitch at different frequencies according to feedback of the user, which can help improve a speech recognition rate. In addition, an interaction mode of pitch adjustment provided in the embodiments of the present disclosure is also different from that of fitting provided in other fitting solutions. In the other fitting solutions, autonomous adjustment is generally performed in three stages, which does not have an enough channel resolution, but has a high professional requirement, leading to a high operating threshold. In contrast, as shown in
The following describes a third part of the autonomic fitting, that is, a process of hearing sense adjustment.
A main process of hearing sense adjustment is using, on the basis of implementing the basic hearing assistance function and the first enhanced hearing assistance function, the hearing sense adjustment part designed in the APP, to perform hearing sense testing in real time after the hearing-impaired user wears the hearing aid, to finely adjust a parameter according to a hearing sense testing result, and update an adjusted parameter to the hearing aid by using the Bluetooth protocol, so as to improve a hearing sense of the user (that is, implement the second enhanced hearing assistance function).
For example,
For example,
The following describes, by comparing the other fitting schemes with the audio signal processing method provided in the embodiments of the present disclosure, technical effects brought by the embodiments of the present disclosure.
In other fitting schemes, first, hearing aids as professional equipment usually need face-to-face communication with audiologists in offline stores for fitting, which leads to poor timeliness. In addition, in the other fitting schemes, hearing is generally assisted based on an audiometry result by using a general prescription formula. However, considering uniqueness of each person's hearing, it is necessary to implement more personalized hearing assistance based on user feedback. In addition, in the aspect of autonomous fitting, another fitting solution is to perform adjustment by stages, and directly provide a gain adjustment interface of each frequency band to a user for the user to perform adjustment. However, this adjustment manner has a high professional requirement and a high operating threshold. In addition, when the user cannot control an adjustment amount well, a hearing assistance effect is reduced instead.
Embodiments of the present disclosure provide fitting solutions with the following improvement directions:
In view of this, in the embodiments of the present disclosure, comprehensive functions of personalized audiometry and convenient autonomous fitting are integrated into an APP, so that a user can test hearing only by interacting with the APP, thereby meeting a requirement of the user for hearing testing at any time. In addition, during configuration on a hearing aid, in the embodiments of the present disclosure, a corresponding hearing assistance policy is generated based on a personalized hearing test result of the user, so that the generated hearing assistance policy can better meet a personalized requirement of the user. Further, in the solution provided in the embodiments of the present disclosure, during pitch adjustment, compensation may be performed pertinently according to a feedback result of the user, where the user only needs to feed back a recognition status through a button, thereby lowering an operating threshold for the user, making the adjustment process more convenient and quick, and improving user experience.
The following continues to describe an exemplary structure in which the audio signal processing apparatus 255 provided in the embodiments of the present disclosure is implemented as software modules. In some embodiments, as shown in
The display module 2551 is configured to display a hearing test control in a human-computer interaction interface. The output module 2552 is configured to output a first test audio signal in response to a trigger operation on the hearing test control. The display module 2551 is further configured to display a first hearing test result of a target object in response to a feedback operation on the first test audio signal. The transmitting module 2553 is configured to transmit, to an audio device in response to a configuration operation on the audio device, a first hearing assistance policy generated according to the first hearing test result, where the first hearing assistance policy is configured to be applied to the audio device to output a first audio signal adapted to the first hearing test result.
In some embodiments, the first hearing test result includes at least one of a hearing parameter and a speech recognition capability parameter, and the first test audio signal includes at least one of the following types of test audio signals: a hearing test audio signal for testing hearing of the target object, and a speech recognition capability test audio signal for testing a speech recognition capability of the target object. The audio signal processing apparatus 255 further includes a generation module 2554, configured to generate a hearing parameter of the target object in response to a feedback operation on the hearing test audio signal; and generate a speech recognition capability parameter of the target object in response to a feedback operation on the speech recognition capability test audio signal. The display module 2551 is further configured to display a hearing test result of the target object, the hearing test result including at least one of the hearing parameter and the speech recognition capability parameter.
In some embodiments, the hearing parameter includes a hearing threshold of the target object in each sub-band in an auditory frequency range. The generation module 2554 is further configured to perform the following processing for any sub-band in the auditory frequency range: displaying a first feedback control and a second feedback control in the human-computer interaction interface, the first feedback control being configured to indicate that the hearing test audio signal is not heard, and the second feedback control being configured to indicate that the hearing test audio signal is heard; re-outputting the hearing test audio signal in a manner higher than an SPL of current output in response to a trigger operation on the first feedback control; re-outputting the hearing test audio signal in a manner lower than an SPL of current output in response to a trigger operation on the second feedback control; and for any SPL used for current output, determining the any SPL as a hearing threshold of the target object in the sub-band in response to receiving a second trigger operation on the second feedback control at the any SPL.
In some embodiments, the display module 2551 is further configured to perform the following processing during the displaying of the first feedback control and the second feedback control in the human-computer interaction interface: displaying an SPL control in the human-computer interaction interface, the SPL control being configured to indicate an SPL of the currently outputted hearing test audio signal.
In some embodiments, the hearing parameter includes a pain threshold of the target object in each sub-band in an auditory frequency range. The generation module 2554 is further configured to perform the following processing for any sub-band in the auditory frequency range: displaying a first adjustment control and a third feedback control in the human-computer interaction interface, the first adjustment control being used for adjusting an SPL, and the third feedback control being configured to indicate that physiological discomfort occurs in hearing the hearing test audio signal; and in response to a trigger operation on the third feedback control, determining an SPL in response to that the trigger operation is received as a pain threshold of the target object in the sub-band.
In some embodiments, the display module 2551 is further configured to display a plurality of fourth feedback controls in the human-computer interaction interface, each fourth feedback control being corresponding to a pitch. The output module 2552 is further configured to sequentially output a plurality of speech recognition capability test audio signals. The audio signal processing apparatus 255 further includes a recording module 2555, configured to record a fourth feedback control triggered in the plurality of fourth feedback controls each time the speech recognition capability test audio signal is outputted. The generation module 2554 is further configured to generate the speech recognition capability parameter of the target object based on pitches respectively corresponding to the plurality of speech recognition capability test audio signals and fourth feedback controls respectively triggered in a plurality of test processes.
In some embodiments, the display module 2551 is further configured to perform the following processing during the displaying of the plurality of fourth feedback controls in the human-computer interaction interface: displaying a decibel control in the human-computer interaction interface, the decibel control being configured to indicate a decibel value of the currently outputted speech recognition capability test audio signal.
In some embodiments, the audio signal processing apparatus 255 further includes a detection module 2556 and a turn module 2557. The detection module 2556 is configured to: before the first test audio signal is outputted, detect an SPL of an environment in which the target object is located. The turn module 2557 is configured to turn to the operation of outputting the first test audio signal in response to that an average SPL of the environment within a set duration is less than an SPL threshold.
In some embodiments, the audio signal processing apparatus 255 further includes a determining module 2558 and a combination module 2559. The determining module 2558 is configured to: before the first hearing assistance policy generated according to the first hearing test result is transmitted to the audio device, determine, in descending order of frequencies of each sub-band in an auditory frequency range, a filter parameter of each sub-band based on the first hearing test result, a filter parameter of a low-frequency sub-band being determined based on a filter parameter of a high-frequency sub-band. The combination module 2559 is configured to perform combination on the filter parameter of each sub-band, and use a combined filter bank parameter as the first hearing assistance policy for the target object.
In some embodiments, the first hearing test result includes a hearing threshold of the target object in each sub-band. The determining module 2558 is further configured to obtain a gain value of each sub-band based on the hearing threshold of the target object in each sub-band and a prescription formula; and obtain, in descending order of frequencies, the filter parameter of each sub-band based on the gain value of each sub-band.
In some embodiments, the auditory frequency range includes N sub-bands, N being an integer greater than 1. The determining module 2558 is further configured to substitute a gain value of the Nth sub-band into a filter function for calculation to obtain a filter parameter of the Ni sub-band; and determine a filter parameter of an it sub-band based on a difference between a gain value of the it sub-band and a frequency response of a filter of an (i+1)th sub-band at the it sub-band, a value range of i meeting 1≤i≤N−1, and a frequency of the (i+1)th sub-band being greater than a frequency of the it sub-band.
In some embodiments, the determining module 2558 is further configured to amplify the first audio signal according to at least one gain curve, to obtain a second test audio signal of at least one volume. The generation module 2554 is further configured to generate a second hearing test result of the target object in response to a feedback operation on the second test audio signal. The transmitting module 2553 is further configured to transmit a second hearing assistance policy to the audio device, the second hearing assistance policy being obtained by adjusting the first hearing assistance policy according to the second hearing test result, and being applied to the audio device to output a second audio signal adapted to the second hearing test result in place of the first audio signal.
In some embodiments, the display module 2551 is further configured to display a second adjustment control and a plurality of fifth feedback controls in the human-computer interaction interface, the second adjustment control being used for adjusting a gain of the second test audio signal, and each fifth feedback control being corresponding to a pitch. The output module 2552 is further configured to sequentially output a plurality of second test audio signals. The recording module 2555 further configured to record a fifth feedback control triggered in the plurality of fifth feedback controls each time the second test audio signal is outputted. The generation module 2554 is further configured to generate the second hearing test result of the target object based on pitches respectively corresponding to the plurality of the second test audio signals and fifth feedback controls respectively triggered in a plurality of test processes.
In some embodiments, the display module 2551 is further configured to perform the following processing during the displaying of the second adjustment control and the plurality of fifth feedback controls in the human-computer interaction interface: displaying a plurality of volume controls in the human-computer interaction interface, a volume represented by a volume control in a selected state being used as a volume used during outputting of the second test audio signal.
In some embodiments, the second hearing test result includes a pitch incorrectly recognized by the target object. The audio signal processing apparatus 255 further includes a compensation module 25510, configured to: before the second hearing assistance policy is transmitted to the audio device, pertinently compensate the first hearing assistance policy according to the pitch incorrectly recognized by the target object, to obtain the second hearing assistance policy.
In some embodiments, the determining module 2558 is further configured to determine a gain factor of the first audio signal according to characteristic information of the target object before the first audio signal is amplified according to the at least one gain curve. The generation module 2554 is further configured to generate the at least one gain curve according to the hearing parameter included in the first hearing test result, the gain factor, and the prescription formula, each gain curve being corresponding to a volume, and the hearing parameter including at least one of the hearing threshold and the pain threshold of the target object in each sub-band in the auditory frequency range. The audio signal processing apparatus 255 further includes an interpolation module 25511, configured to perform interpolation processing on each gain curve through frequency band mapping, so that the number of sub-bands of the gain curve is consistent with the number of channels of a filter bank.
In some embodiments, the audio signal processing apparatus 255 further includes an adjustment module 25512, configured to respectively adjust the second audio signal based on a plurality of candidate hearing sense adjustment policies, to correspondingly obtain a plurality of third test audio signals. The generation module 2554 is further configured to generate a third hearing test result of the target object in response to a feedback operation on the plurality of third test audio signals. The transmitting module 2553 is further configured to transmit a third hearing assistance policy to the audio device, the third hearing assistance policy being obtained by adjusting the second hearing assistance policy according to the third hearing test result, and being applied to the audio device to output a third audio signal adapted to the third hearing test result in place of the second audio signal.
In some embodiments, the third hearing test result includes a preferred hearing sense of the target object. The display module 2551 is further configured to display a plurality of sixth feedback controls in the human-computer interaction interface, each sixth feedback control being corresponding to a hearing sense. The output module 2552 is further configured to sequentially output a plurality of third test audio signals in one-to-one correspondence with the plurality of sixth feedback controls. The determining module 2558 is further configured to determine a hearing sense corresponding to a sixth feedback control triggered in the plurality of sixth feedback controls as the preferred hearing sense of the target object.
In some embodiments, the adjustment module 25512 is further configured to: before the third hearing assistance policy is transmitted to the audio device, adjust, according to the preferred hearing sense of the target object, a gain curve included in the second hearing assistance policy, to obtain the third hearing assistance policy.
In some embodiments, the display module 2551 is further configured to display a historical hearing test result of the target object in the human-computer interaction interface in response to existence of the historical hearing test result and the historical hearing test result being within a validity period. The transmitting module 2553 is further configured to transmit, to the audio device in response to a configuration operation on the audio device, a fourth hearing assistance policy generated according to the historical hearing test result, the fourth hearing assistance policy being applied to the audio device to output a fourth audio signal adapted to the historical hearing test result.
In some other embodiments, as shown in
The following continues to describe an exemplary structure in which the audio signal processing apparatus 355 provided in the embodiments of the present disclosure is implemented as software modules. In some embodiments, as shown in
The receiving module 3551 is configured to receive a first hearing assistance policy for a target object, the first hearing assistance policy including a filter bank parameter, the filter bank parameter including a filter parameter of each sub-band in an auditory frequency range, the filter parameter of each sub-band being determined in descending order of frequencies based on a first hearing test result of the target object, and a filter parameter of a low-frequency sub-band being determined based on a filter parameter of a high-frequency sub-band. The output module 3552 is configured to output, according to the first hearing assistance policy, a first audio signal adapted to the first hearing test result.
In some embodiments, the receiving module 3551 is further configured to receive a second hearing assistance policy for the target object, the second hearing assistance policy being obtained by adjusting the first hearing assistance policy according to a second hearing test result, the second hearing test result being obtained based on a feedback operation of the target object on a second test audio signal, and the second test audio signal being obtained by amplifying the first audio signal according to a gain curve. The output module 3552 is further configured to output, according to the second hearing assistance policy, a second audio signal adapted to the second hearing test result in place of the first audio signal.
In some embodiments, the receiving module 3551 is further configured to receive a third hearing assistance policy for the target object, the third hearing assistance policy being obtained by adjusting the second hearing assistance policy according to a third hearing test result, the third hearing test result being obtained based on a feedback operation of the target object on a plurality of third test audio signals, and the plurality of third test audio signals being obtained by adjusting the second audio signal based on a plurality of candidate hearing sense adjustment policies. The output module 3552 is further configured to output, according to the third hearing assistance policy, a third audio signal adapted to the third hearing test result in place of the second audio signal.
In some embodiments, the output module 3552 is further configured to control, in ascending order of frequencies, a filter of each sub-band in a filter bank to sequentially filter an original audio signal according to a filter parameter of the corresponding sub-band in the filter bank parameter, to obtain the first audio signal adapted to the first hearing test result.
The description of the apparatus in the embodiments of the present disclosure is similar to the description of the method embodiment, and the apparatus has beneficial effects similar to those of the method embodiment, and therefore details are not described again. Technical details that are not explained in detail in the audio signal processing apparatus provided in the embodiments of the present disclosure can be understood with reference to the description of any one of
An embodiment of the present disclosure provides a computer program product or a computer program, the computer program product or the computer program including a computer instruction, the computer instruction being stored in a computer-readable storage medium, A processor of a computer device reads the computer instruction from the computer-readable storage medium, and the processor executes the computer instruction, to cause the computer device to execute the audio signal processing method in the embodiments of the present disclosure.
An embodiment of the present disclosure provides a computer-readable storage medium storing an executable instruction, when executed by a processor, causing the processor to execute the audio signal processing method provided in the embodiments of the present disclosure, for example, the audio signal processing method shown in any one of
In some embodiments, the computer-readable storage medium may be an FRAM, a ROM, a PROM, an EPROM, an EEPROM, a flash memory, a magnetic surface storage, an optical disc, a CD-ROM, or other memories; or may be various devices including one or any combination of the memories.
In some embodiments, the executable instruction may be in a form of a program, software, a software module, a script, or code, written in any form of programming language (including compiling or interpreting languages, or declarative or procedural languages), and may be deployed in any form, including being deployed as a stand-alone program, a module, a component, a subroutine, or other units suitable for use in a computing environment.
In an example, the executable instruction may but not necessarily correspond to a file in a file system, and may be stored as a part of a file storing other programs or data, for example, stored in one or more scripts in a hypertext markup language (HTML) file, stored in a single file specific to a program discussed, or stored in a plurality of collaborative files (for example, a file storing one or more modules, subroutines, or portions of code).
In an example, the executable instruction may be deployed on one electronic device, a plurality of electronic devices at one location, or a plurality of electronic devices distributed at a plurality of locations and interconnected through a communication network for execution.
The foregoing descriptions are merely exemplary embodiments of the present disclosure and are not intended to limit the protection scope of the present disclosure. Any modification, equivalent replacement, improvement, and the like made without departing from the spirit and range of the present disclosure shall fall within the protection scope of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202210771358.9 | Jun 2022 | CN | national |
This application is a continuation application of PCT Patent Application No. PCT/CN2023/090030, filed on Apr. 23, 2023, which claims priority to Chinese Patent Application No. 202210771358.9, filed on Jun. 30, 2022, both of which are incorporated herein by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2023/090030 | Apr 2023 | WO |
Child | 18647430 | US |