Adaptive Blind Noise Suppression in Some Speech Processing Applications
Adaptive Blind Noise Suppression in Some Speech Processing Applications
Adaptive Blind Noise Suppression in Some Speech Processing Applications
Abstract
1
Under comparable SNR improvement, the proposed method adjusts only
3 coefficients against 250-450 for the conventional adaptive noise
cancellation systems. A framework for a speech recognition system that
uses the proposed method is suggested.
CONTENTS
Page No.
INTRODUCTION 1
2
TEST RESULTS 7
CONCLUSIONS 10
1. INTRODUCTION
3
ago The core of the problem is that in most situations the
characteristics of the noise are not known a priori and moreover they
may change in time. This implies the use of adaptive systems capable of
identifying and tracking the noise characteristics. This is why the
application of adaptive filtering for noise cancellation is widely used.
4
contribution the approach is slightly different. The basic idea is that in
many applications, for instance, hands-free cellular phones in car
environment howling control in hands-free phones, noise reduction in an
office environment, the noise reveals specific features that can be
exploited. In most instances although the noise might be quite wide-
band, there are always, as a rule, no more than two or three regions of
its frequency spectrum that carry most of the noise energy and the
removal of these dominant frequencies results in a considerable
improvement of S/N ratio. This brings the idea to use notch adaptive
filters capable of tracking the noise characteristics. In this paper a
modification of all-pass structures is used They are recursive, and at the
same time, are stable during the adaptive process. The approach is called
“blind” because there is no need of a reference signal.
5
II. CLASSICAL ADAPTIVE NOISE CANCELLATION
………….(3)
6
where N is the filter order and wi(n) is the ith coefficient of the adaptive
filter. Having in mind that
…..(4)
the minimization of E[e2(n)] is equivalent to the minimization of the
difference between the signal on the adaptive filter output y(n) and the
noise signal n1(n) present on the primary input. Obviously the better
replica of n1(n) y(n) is, that is, the better the adaptive filter is modeling
the impulse response hi(n) of the noise path, the smaller the difference.
The minimization of E[e2(n)] can be achieved by updating of the adaptive
filter coefficients. Most often the LMS and NLMS algorithms are used, the
latter having the advantage that the step size is relatively independent of
the amplitude of the input signal. According to the scheme in Fig. 1 the
updating equations for LMS and NLMS algorithms are given by
7
Both requirements could be met much easier using IIR adaptive
filters instead of FIR adaptive filters. IIR filters are usually avoided
because they create a lot of stability problems. To overcome this problem
we use a realization based on second order Gray-Markel lattice circuit
Fig.2. Using this circuit it becomes possible to implement a second order
notch/bandpass section Fig. 3.The advantages of such a realization are
first, it has extremely low pass band sensitivity that means resistance to
quantization effects. Second, it is very convenient for realization of
adaptive notch filters because it is possible tocontrol independently the
notch frequency and the bandwidth.
8
But, on the other hand, BW is directly connected to the distance
from the pole to the unity-circle. So if we use the structure of Fig. 3 as an
adaptive filter we may fix BW and thus fixing k2 we make constant the
distance from the pole to the unity-circle which means that with this
constraint we obtain an adaptive IIR filter free of stability problems.
Adapting k1 we may shift the notch frequency around the unity-circle.
Using the basic structure of Fig. 3 and the constraint mentioned above,
the final arrangement of our system
is shown in Fig. 4.
The system will work in the following manner: each section will
remove one of the dominant frequencies using an appropriate adaptive
algorithm. As shown in Fig. 4 we propose to update only the coefficients
k11, k12,…, k1M, while k2 is a priori determined from equation (9). Thus we
can reduce considerably the number of computations and can guarantee
the stability of the adaptive structure. The number of sections is
determined by the application. Here we introduce the NLMS algorithm for
adjusting the filter coefficients as
9
where M is the number of sections, ei(n) is the error signal, is the step
size and yi‘(n) is the derivative of yi(n) with respect to the coefficient
subject of adaptation.
10
IV. TEST RESULTS
The performance of the ABNS method for noise suppression is
assessed by computer simulations. Fig. 5 shows the original speech. The
speech is corrupted with noise from computer cooling fan that is most
often encountered in office environment and the resultant signal is
depicted in Fig. 6. The process of noise suppression is shown in Fig .7.
Here the system is composed of 3 sections each of them adapting its
coefficient to one of the dominant frequencies in the noise spectrum.
Fig.8 presents the trajectories of the filter coefficients. In this experiment
the capability of the system to track the changes in noise signal is tested
as the dominant frequencies shift from 0.1, 0.2 and 0.4 at the
beginning, to 0.14, 0.23 and 0.36. Here the system does not have
information about the dominant frequencies and adjusts its coefficients
to them, as it works.
11
12
Table 1 shows the improvement in SNR as a result of the
application of the proposed system. The obtained results are comparable
to these of the conventional adaptive noise canceller (ANC).The proposed
system is much faster and simpler to implement.
13
are extracted over 20 ms frames. Hamming windows having an overlap of
10 ms are used to calculate Mel Frequency Scale Cepstral Coefficients
and log- energy. Here the speech recognizer can be implemented on the
base of adaptive evolving fuzzy neural networks (EFuNNs) .Since the
input layer of the networks has fixed size, while the segments (words) are
made up of a variable number of frames, a technique for normalization is
needed. A discrete cosine transform (DCT) is applied to the whole
segment, retaining as many parameters as it is necessary. Several
application-oriented systems for automatic dialing and robot control are
under development.
VI. CONCLUSIONS
14
15