 
                 Patent Application
 Patent Application
                     20250054480
 20250054480
                    The present invention relates to an active noise control technique.
An active noise control technique is a technique which suppresses noise at specific locations such as a user's surroundings and is incorporated into vehicles, for example, to suppress sounds from the outside of the vehicle so that the user can hear desired sounds such as phone calls and music while in the vehicle. In active noise control techniques, systems which include a reference microphone which picks up noise, an error microphone which picks up sound at a user's position, a noise suppression device which generates an erasure sound signal for eliminating noise from a noise signal which is an output of the reference microphone and an error signal which is an output of the error microphone, and a loudspeaker which emits sound based on an erasure sound signal (hereafter referred to as “erasure sound”) are commonly used (Refer to NPL 1). In this system, the operation of determining what kind of erasure sound needs to be emitted from the loudspeaker to suppress noise is repeatedly performed while measuring how much the noise picked up by the reference microphone is suppressed by an error microphone installed near the user.
  
However, if the sound from the outside is fully cut off, an inconvenient situation may occur for the user. For example, if the siren of an emergency vehicle is also cut off while driving, a situation in which the driver cannot notice the approach of the emergency vehicle or is late in noticing it may arise. Thus, it is preferable to allow the user to be aware of sounds necessary to recognize the external situation rather than blocking out all external sounds.
Therefore, an object of the present invention is to provide a noise suppression technique which allows a user to recognize that an event has occurred when a specific event that the user needs to be aware of occurs even during the process of suppressing noise around the user.
An aspect of the present invention allows a user to recognize that an event has occurred when the noise around the user includes a sound originating from a predetermined event (hereinafter referred to as “event sound”).
According to the present invention, even during execution of processing for suppressing noise around the user, when a specific event that the user needs to recognize occurs, it is possible to make the user aware of the occurrence of the event.
    
    
    
    
    
Embodiments of the present invention will be described in detail below. Note that constituent elements having the same function will be denoted by the same reference numerals and redundant description thereof will be omitted.
A noise suppression device 100 allows a user to recognize that an event has occurred when the noise around the user includes a sound originating from a predetermined event (hereinafter referred to as “event sound”).
The noise suppression device 100 will be described below with reference to 
The noise suppression device 100 is connected to one or more sensors (not shown) to acquire data used for identifying a situation around the user (hereinafter referred to as “sensor data”). For example, a microphone, a camera, or a vibration detection sensor can be used as the sensor. That is to say, the situation around the user is identified using the sounds, images, and vibrations around the user. Also, the noise suppression device 100 is connected to a situation presentation device (not shown) to make the user aware that an event has occurred. As the situation presentation device, for example, a loudspeaker, a display, or a mobile terminal such as a mobile phone or smartphone can be used. That is to say, the user is made to recognize that an event has occurred using sound, images, and vibrations.
The noise suppression device 100 is connected to one or more microphones (not shown, hereinafter referred to as “reference microphones”) to capture the noise around the user and one or more microphones (not shown, hereinafter referred to as “error microphones”) to acquire the sound heard by the user. Also, the noise suppression device 100 is connected to one or more loudspeakers (not shown) to emit a sound based on an erasure sound signal, that is, the erasure sound.
An operation of the noise suppression device 100 will be described according to 
In S110, the situation identification result generation part 110 receives sensor data obtained using a sensor and uses sensor data to generate and output a situation identification result that indicates whether an event is occurring around the user. For example, when the sensor data is a railroad crossing sound, the situation identification result generation part 110 uses an existing speech recognition technique to generate a situation identification result indicating to the user that a train is approaching the railroad crossing. Also, for example, when the sensor data is a video or image of a railroad crossing blocking, the situation identification result generation part 110 uses an existing image recognition technique to generate a situation identification result that indicates to the user that a train is approaching a railroad crossing. Furthermore, for example, when the vibration detection sensor detects sudden vibration, the situation identification result generation part 110 generates a situation identification result indicating to the user that there is a possibility of collision with an obstacle.
The situation identification result generation part 110 calculates, for example, a value indicating a degree of correlation between a reference sound for determining a sound originating from a predetermined event and noise around the user, generates a situation identification result indicating that an event has occurred around the user when the value is greater than a predetermined threshold value or the value is greater than or equal to the predetermined threshold value, and generates a situation identification result indicating that an event has not occurred around the user in other cases. Note that the reference sound may be recorded in the recording part 190 in advance.
In S120, the notification signal generation part 120 receives the situation identification result generated in S110, and generates and outputs a notification signal that notifies the user of the occurrence of an event when the situation identification result indicates that an event has occurred around the user. The situation presentation device receives the notification signal generated in S120 and presents the notification content to the user on the basis of the notification signal. When the situation presentation device is a loudspeaker, the notification signal is a notification sound signal which informs the user that an event occurs. Furthermore, when the situation presentation device is a display, the notification signal is a signal of a notification video which notifies the user that an event has occurred. When the situation presentation device is a mobile terminal, the notification signal is a notification vibration signal which notifies the user that an event has occurred.
In S130, the erasure sound signal generation part 130 receives, as inputs, the user's ambient noise signal obtained using one or more reference microphones and the user's listening sound signal obtained using one or more error microphones and generates an erasure sound signal for erasing noise from the noise signal and the listening sound signal and outputs it. The loudspeaker receives the erasure sound signal generated in S130 and emits sound based on the erasure sound signal.
Note that the erasure sound signal generation part 130 can also generate the erasure sound signal without using the noise signal. In this case, in S130, the erasure sound signal generation part 130 receives, as inputs, the user's listening sound signal acquired using one or more error microphones and generates and outputs an erasure sound signal for erasing noise from the listening sound signal.
According to an embodiment of the present invention, even during execution of processing for suppressing noise around the user, when a specific event that the user needs to recognize occurs, it is possible to make the user aware of the occurrence of the event.
A noise suppression device 200 allows a user to recognize that an event has occurred when the noise around the user includes a sound originating from a predetermined event (hereinafter referred to as “event sound”).
The noise suppression device 200 will be described below with reference to 
As in the noise suppression device 100, the noise suppression device 200 is connected to one or more sensors (not shown) to acquire data used for identifying a situation around the user (hereinafter referred to as “sensor data”).
As in the noise suppression device 100, the noise suppression device 200 is connected to one or more microphones (not shown, hereinafter referred to as “reference microphones”) to acquire the noise around the user and one or more microphones (not shown, hereinafter referred to as “error microphones”) to acquire the user's listening sound. Furthermore, as in the noise suppression device 100, the noise suppression device 200 is connected to one or more loudspeakers (not shown) to emit a sound based on the erasure sound signal, that is, an erasure sound.
An operation of the noise suppression device 200 will be described with reference to 
In S210, the situation identification result generation part 210 receives sensor data obtained using a sensor and uses sensor data to generate and output a situation identification result that indicates the likelihood that an event occurs around the user. The value of the situation identification result may be a binary value of a value indicating that an event has occurred around the user and a value indicating that an event has not occurred and may be a value that indicates the degree of correlation between the reference sound for determining the sound originating from a predetermined event and the noise around the user. Note that the reference sound may be recorded in the recording part 290 in advance. When the value of the situation identification result is the above binary value, the situation identification result generation part 210 calculates, for example, a value indicating a degree of correlation between a reference sound for determining a sound originating from a predetermined event and noise around the user, generates a situation identification result indicating that an event occurs around the user when the value is greater than a predetermined threshold value or the value is greater than or equal to the predetermined threshold value, and generates a situation identification result indicating that an event does not occur around the user in other cases. Also, when the value of the situation identification result is a value that indicates a degree of correlation, the situation identification result generation part 210 calculates, for example, a value indicating a degree of correlation between a reference sound for determining a sound originating from a predetermined event and noise around the user and uses the value indicating the degree of correlation as the situation identification result.
In S220, the adjustment information generation part 220 receives the situation identification result generated in S210 and generates information (hereinafter referred to as “adjustment information”) regarding the relative magnitude of the event sound erasure sound for erasing the event sound with respect to the erasure sound for erasing the noise determined in accordance with the value of the situation identification result. When the value of the situation identification result is a binary value indicating that an event has occurred around the user and a value that indicates that it has not occurred, the adjustment information generation part 220 generates, as adjustment information, information indicating that the event sound erasure sound needs to be excluded from the erasure sounds when the value of the situation identification result is a value indicating that an event occurs around the user. Also, when the value of the situation identification result is a value indicating a degree of correlation between the reference sound for determining the sound originating from a predetermined event and the noise around the user, the adjustment information generation part 220 generates, as adjustment information, information indicating that the greater the value of the situation identification result, the greater the proportion of excluding the event sound erasure sound from the erasure sound.
For example, when the sensor data is a video or an image for determining whether the railroad crossing is blocked, the situation identification result generation part 210 uses an existing image recognition technique to generate a situation identification result that is a binary value of a value indicating to the user that a train is approaching the railroad crossing and a value indicating to the user that the train is not approaching the railroad crossing. When the situation identification result is a value that indicates to the user that a train is approaching the railroad crossing, the adjustment information generation part 220 generates a sound for erasing a sound in a specific frequency band corresponding to the railroad crossing sound as an event sound erasure sound and generates, as adjustment information, information indicating that the event sound erasure sound needs to be excluded from the erasure sound.
In S230, the erasure sound signal generation part 230 receives, as inputs, the user's ambient noise signal acquired using one or more reference microphones, the user's listening sound signal acquired using one or more error microphones, and the adjustment information generated in S220 and generates and outputs an erasure sound signal for erasing noise from the noise signal, the listening sound signal, and the adjustment information. The loudspeaker receives the erasure sound signal generated in S230 and emits sound based on the erasure sound signal.
Note that the erasure sound signal generation part 230 can also generate the erasure sound signal without using the noise signal. In this case, in S230, the erasure sound signal generation part 230 receives, as inputs, the user's listening sound signal obtained using one or more error microphones and the adjustment information generated in S220 and generates and outputs an erasure sound signal for erasing noise from the listening sound signal and the adjustment information.
According to an embodiment of the present invention, even during execution of processing for suppressing noise around the user, when a specific event that the user needs to recognize occurs, it is possible to make the user aware of the occurrence of the event.
  
The device of the present invention includes, for example, as a single hardware entity, an input part to which a keyboard or the like can be connected, an output part to which a liquid crystal display can be connected, a communication part to which a communicable communication device (for example, a communication cable) can be connected to the outside of the hardware entity, a CPU (Central Processing Unit: which may include a cache memory, a register, and the like), a RAM and a ROM which are memories, an external storage device such as a hard disk, and a bus which connects these input parts, output parts, communication parts, CPU, RAM, ROM, and external storage devices so that data can be exchanged. Also, if necessary, the hardware entity may include a device (drive) capable of reading and writing a recording medium such as a CD-ROM. A physical entity with such hardware resources includes a general purpose computer.
The external storage device of the hardware entity stores programs necessary for realizing the functions described above, data necessary for processing the programs, and the like (not limited to the external storage device, for example, the program may be stored in a ROM which is a read-only storage device). In addition, the data obtained by the processing of these programs are appropriately stored in a RAM, an external storage device, or the like.
In the hardware entity, each program stored in an external storage device (or a ROM or the like) and the data necessary for processing each program are read into a memory as needed, and interpreted, executed, and processed by the CPU as appropriate. As a result, the CPU realizes a predetermined function (each constituent element represented by the above, . . . part, . . . means, and the like).
The present invention is not limited to the above-described embodiments, and can be modified as appropriate without departing from the scope of the present invention. Furthermore, the processes described in the above embodiments are not only executed in chronological order according to the described order, but may also be executed in parallel or individually according to the processing capacity of the device which executes the processes or as necessary.
As described above, when the processing functions of the hardware entity (devices of the present invention) described in the above embodiments are implemented by a computer, the processing contents of the functions that the hardware entity needs to have are described by a program. The processing functions of the hardware entity are realized on the computer by executing this program on a computer.
A program describing the contents of this processing can be recorded in a computer-readable recording medium. Any computer-readable recording medium may be used, for example, a magnetic recording device, an optical disk, a magneto-optical recording medium, a semiconductor memory, or the like. Specifically, for example, as magnetic recording devices, a hard disk device, a flexible disk, a magnetic tape, and the like can be used, as optical discs, a DVD (Digital Versatile Disc), a DVD-RAM (Random Access Memory), a CD-ROM (Compact Disc Read Only Memory), a CD-R (Recordable)/RW (ReWritable), and the like can be used, as magneto-optical recording media, an MO (Magneto-Optical disc), and the like can be used, and as semiconductor memories, an EEP-ROM (Electronically Erasable and Programmable-Read Only Memory) and the like can be used.
Also, distribution of this program is carried out by selling, assigning, lending, etc. portable recording media such as DVDs and CD-ROMs on which the program is recorded, for example. Furthermore, the program may be distributed by storing the program in the storage device of the server computer and transferring the program from the server computer to other computers via the network.
A computer which executes such a program, for example, first stores the program recorded on a portable recording medium or the program transferred from the server computer once in an own storage device thereof. When executing the process, this computer reads the program stored in an own storage device thereof and executes the process according to the read program. Furthermore, as another execution form of this program, the computer may directly read the program from a portable recording medium and execute processing according to the program. In addition, each time the program is transferred from the server computer to this computer, the process according to the received program may be sequentially executed. In addition, the above processing is executed by a so-called ASP (Application Service Provider) type service which does not transfer the program from the server computer to this computer and realizes the processing function only by the execution instruction and result acquisition. Note that the program in this embodiment includes information to be used for processing by a computer and equivalent to a program (data that is not a direct command to the computer but has the property of prescribing the processing of the computer or the like).
Moreover, in this embodiment, the hardware entity is configured by executing a predetermined program on the computer, but at least part of these processing contents may be implemented by hardware.
The foregoing descriptions of embodiments of the invention have been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Modifications and variations are possible in light of the above teachings. The embodiments have been chosen and expressed to provide the best illustration of the principles of the invention and to enable those skilled in the art to utilize the present invention in various embodiments and with various modifications as suitable for the practical use contemplated. All such modifications and variations are within the scope of the present invention as defined by the appended claims, construed in accordance with their breadth which is fairly and legally afforded.
| Filing Document | Filing Date | Country | Kind | 
|---|---|---|---|
| PCT/JP2021/047310 | 12/21/2021 | WO |