A danger sensing information device according to an embodiment of the present invention is explained below with reference to the drawings.
The danger sensing information device 10 is composed of an audio input unit 100, a danger sensing unit 200, and a danger information unit 900. The audio input unit 100 is provided with a microphone 110 and an A/D converter 120 wherein air vibrations due to the sound of a voice are captured and then converted to digital signals which are stored as input voice 240 in the memory device as will be explained later. The danger sensing unit 200 is composed of a processor 210, a memory device 220, and an external status input device 230. The memory device 220 contains each of programs such as a danger audio detection program 300, a child audio detection program 400, a noise volume measurement program 500, a danger determination program 600, an input voice 240, a danger audio volume 350, a child audio volume 420, a noise danger 520, danger audio weight 345, a noise threshold value 515, an active audio model 325, an inactive audio model 335, and a child audio model 415. The processor 210 performs a processing of input voice 240 using each program and transfers the results to the danger information unit 900. The details of the operation of each program will be explained later. The danger information unit 900 is provided with a communication device 910 which transmits the results of the danger sensing unit 200 to the outside.
In the basic frequency measurement processing 310, a basic frequency of the input voice 240 is calculated by an arbitrary method.
The active audio model matching detection processing 320 and the inactive audio model matching detection processing 330 calculate a degree of matching of the active audio model Ma and a degree of matching of the inactive audio model Mb, respectively, which indicate the degree of matching of input voice 240 with active audio model 325 and the inactive audio model 335, respectively.
For example, the degree of matching of the active audio model Ma is calculated using the following equation:
Ma=max Dai (i=1: Number of active audio models)
Dai=∥Finput−Fai∥
where Dai indicates a distance between the input voice and the active audio model i, Finput is a characteristic vector of the input voice, Fai is a characteristic vector of the active audio model i. For characteristic vectors, for example, MFCC (Mel Frequency Cepstrum Coefficients), LPC (Linear Prediction Coefficients), and auto relevant functions. As a result, a difference is calculated between the input voice and the active audio model having the closest acoustic characteristic to the input voice.
Mb is also calculated using the same equation as in the case of Ma.
Mb=max Dbi (i=1: Number of inactive audio models)
Dbi=∥Finput−Fbi∥
In the weight adjustment processing 340, the basic frequency f as the results of basic frequency measurement processing, the degree of matching Ma as the results of active audio model matching detection processing 320, and the degree of matching Mb as the results of inactive audio model matching detection processing 330 are weighted with a danger audio weight 345 and then added to calculate a danger audio volume 350 Cd. The danger audio volume Cd is calculated by the following equation wherein a basic frequency danger function is given by ff, danger audio weights are given by Wf, Wa and Wb:
Cd=Wf·ff(f)+Wa·Ma+Wb·Mb
where the basic frequency danger function ff is a function having a peak near the mean basic frequency fm of the speech of an adult male. For example, the following value is used:
ff(f)=|f−fm| if f>90 and f<130
0 otherwise
The child audio volume Cc is calculated by the following equation as in the cases of Ma and Mb.
Cc=max Dci (i=1: Number of child audio models)
Dci=∥Finput−Fci∥
The noise danger Cs is calculated using the following equation:
Cs=fs(N)
where N indicates a noise value of the input voice, fs is a function shown in
Initially, the degree of noise danger 510 is analyzed in the determination 710. The distinction analysis is carried out as follows using the threshold values θa and θb.
If the degree of noise danger is determined to be within the safe range (S1), a safe state is output (780). If the degree of danger is determined to be high due to the fact that the state with high noise continued for a fixed time (S2), the control shifts to the decision 740. If the degree of danger is determined to be low due to the fact that the state with low noise continued for a fixed time (S3), the control shifts to the decision 720. This processing is based on the hypotheses that in a place with abnormally high noise, dangers such as accidents or natural disasters are approaching the child or there is a high possibility of these, or that in a place with less noise, the number of passersby is less so that there is a higher possibility of running into an event of kidnapping.
In the decision 720, a distinction analysis is carried out for the danger audio volume input 350. For example, such distinction analysis is carried out as follows using the threshold values θd, θt and θT,
where d (x) is a function of 1 when equation x is true and 0 when equation x is false. Cdt is a value of Cd at a time t.
If the state with a high danger audio volume does not continue for a fixed time (S4), a safe state (780) is output. If the state with a high danger audio volume continues for a fixed time (S5), the control is shifted to the decision 730.
The threshold values θd, θt and θT are set up in the parameter setting unit 900. This is a processing performed by the danger audio detection program 300 for a brief voice. This is based on the hypothesis that if an audio danger state continues for a fixed time, it should be decided as dangerous
In the decision 730, a distinctive analysis is performed for the input child audio volume 410. The distinctive analysis is carried out as follows using the threshold values θe, θc and θc,
S7 otherwise where d(x) is a function of 1 when equation x is true and 0 when equation x is false. Cct is a value of Cc at a time t.
If the state with a high child audio volume lasts for a fixed time (S6), a safe state (780) is output. If the state with a high child audio volume does not last for a fixed time (S7), the control is shifted to the decision 740. This is a processing performed by the danger audio detection program 300 for brief voice. This is based on the hypothesis that when the child audio volume lasts for a fixed time, that is, in such a state that the child is determined to be talking to a dangerous voice, there is a high probability that the person is acquainted with the child so that the degree of danger is determined to be not as high.
In the decision 740, a decision is made based on the locked state of the device acquired from the external state input unit 600. If the device is locked in order to prevent false reports, a safe state is output (780). If it is not locked, a danger state is output (790).
The locking function of the device has the advantage of reducing the number of false reports, but it also interferes with regular communication. In that case, the locking function is excluded. In this case, the decision 740 immediately outputs a danger state (790).
The following methods are available for outputting the danger state:
As one of embodiments of a danger sensing information device 10 of the present invention, software mounting on the cellular phone is possible. If a microphone 110 and an A/D converter 120 in the audio input unit 100, a processor 210, a memory device 220 and an outer state input device 230 in the danger sensing unit and a communication device 910 in the danger information unit 900 are provided from those used in the calling functions and data communication functions of the cellular phone, programs and data in the memory device 220 can be newly introduced so that the advantage is that the product cost can be maintained to be low. In addition, for cellular phone users, the advantage is that there is no need of owning additional cellular terminals.
Number | Date | Country | Kind |
---|---|---|---|
2006-250239 | Sep 2006 | JP | national |