A New Method for Denoising Underwater Acoustic Signals Based on EEMD, Correlation Coefficient, Permutation Entropy, and Wavelet Threshold Denoising

Zhang Yuyan Yang Zhixia Du Xiaoli Luo Xiaoyuan

Yuyan Zhang, Zhixia Yang, Xiaoli Du, Xiaoyuan Luo (2024). A New Method for Denoising Underwater Acoustic Signals Based on EEMD, Correlation Coefficient, Permutation Entropy, and Wavelet Threshold Denoising. Journal of Marine Science and Application, 23(1): 222-237. https://doi.org/10.1007/s11804-024-00386-6
Citation: Yuyan Zhang, Zhixia Yang, Xiaoli Du, Xiaoyuan Luo (2024). A New Method for Denoising Underwater Acoustic Signals Based on EEMD, Correlation Coefficient, Permutation Entropy, and Wavelet Threshold Denoising. Journal of Marine Science and Application, 23(1): 222-237. https://doi.org/10.1007/s11804-024-00386-6

A New Method for Denoising Underwater Acoustic Signals Based on EEMD, Correlation Coefficient, Permutation Entropy, and Wavelet Threshold Denoising

https://doi.org/10.1007/s11804-024-00386-6
Funds: 

the National Natural Science Foundation of China 62033011

Science and Technology Project of Hebei Province 216Z1704G

Science and Technology Project of Hebei Province 20310401D

    Corresponding author:

    Yuyan Zhang yyzhang@ysu.edu.cn

  • Abstract

    The complexities of the marine environment and the unique characteristics of underwater channels pose challenges in obtaining reliable signals underwater, necessitating the filtration of underwater acoustic noise. Herein, an underwater acoustic signal denoising method based on ensemble empirical mode decomposition (EEMD), correlation coefficient (CC), permutation entropy (PE), and wavelet threshold denoising (WTD) is proposed. Furthermore, simulation experiments are conducted using simulated and real underwater acoustic data. The experimental results reveal that the proposed denoising method outperforms other previous methods in terms of signal-to-noise ratio, root mean square error, and CC. The proposed method eliminates noise and retains valuable information in the signal.

     

    Article Highlights
    ● This study mitigates partial loss in traditional denoising by combining denoising with PE and WTD, leveraging EEMD.
    ● PE segregates noise into pure and noise-dominant components, preserving effective signals through targeted denoising of the noise-dominant component.
    ● The method efficiently removes noise, eliminating local prominent peaks in the signal for a smoother and refined final output.
  • The Earth's ocean area significantly exceeds its land area, and the ocean contains abundant resources. Therefore, countries worldwide have increasingly focused on ocean development and protection. Owing to the complexity of ocean background noises and the non-linear, non-Gaussian, and non-stationary properties of underwater acoustic signals, the detection and mitigation of noise in underwater acoustic signals are vital (Tucker et al., 2016; Li et al., 2018b). Therefore, the research and application of underwater acoustic signal denoising methods are of great significance in the underwater acoustic field.

    Matched filters have been widely used in the past (Islam and Chong, 2014). These filters effectively utilize the differences between signals and noise distributions and use specific filters that match the frequency characteristics of both signals to achieve noise reduction. However, the use of matched filters for denoising non-stationary signals has limitations. Moreover, wavelet-based methods are used for denoising signals, but these methods require pre-selection of basis functions, and their effectiveness is not ideal as the frequency bands of useful signals and noise overlap (Yue et al., 2019; Wess and Dixon, 1997). Huang et al. (1998) proposed a new signal decomposition method named empirical mode decomposition (EMD). EMD can decompose a signal into high-to-low components and serve as an adaptive signal processing technique suitable for non-linear and non-stationary processes. This method was significantly used for signal denoising (Cui and Chen, 2015; Lu et al., 2016). However, EMD is susceptible to mode mixing, which leads to signal loss and affects signal processing. To solve this issue, Wu and Huang (2011) improved EMD and proposed a modified signal decomposition method named ensemble empirical mode decomposition (EEMD). EEMD incorporates the properties of EMD, adaptively processes non-linear and non-stationary signals, and effectively mitigates mode mixing.

    Recently, numerous denoising methods based on EEMD and its improved algorithms have been widely used in different fields. Zhao et al. (2011) used the EEMD algorithm for denoising Electrocardiogram (ECG) in the medical field. Sun et al. (2020) introduced a new surface electromyography (sEMG) denoising method based on EEMD and wavelet threshold to eliminate the random noise from sEMG signals. Yang et al. (2017) proposed a vibration signal denoising algorithm in the field of engineering vibration based on EEMD and correlation coefficients (CC). Moreover, Jia et al. (2021) introduced a novel method for denoising the vibration signal based on EEMD and gray theory. Xue et al. (2019) proposed a noise suppression method in the radar communication field based on EEMD and permutation entropy (PE) according to the characteristics of ground-penetrating radar signals. To effectively suppress noise in an atmospheric lidar return signal, Cheng et al. (2021) proposed a segmentation singular value decomposition (SVD) -lifting wavelet transform (LWT) denoising algorithm based on EEMD. Peng et al. (2021) proposed a signal denoising method in the field of seismic monitoring based on EEMD and multiscale principal component analysis. The method exhibited good prospects for processing microseismic waves.

    However, no studies have been reported on underwater acoustic signal denoising using EEMD (Li et al., 2017b; Li et al., 2018a). Among the existing denoising algorithms, intrinsic mode functions (IMFs) are mainly classified into noise IMFs and real IMFs (Chen et al., 2012; Singh et al., 2014; Li et al., 2018). Generally, noise IMFs are frequently discarded during the denoising process, but this method often lacks accuracy.

    Therefore, this study presents a hybrid algorithm for denoising underwater acoustic signals based on EEMD, CC, PE, and wavelet threshold denoising (WTD). Compared with existing denoising algorithms, the proposed algorithm categorizes IMFs into three parts and does not directly discard high-frequency noise signals but instead uses WTD for denoising. Thus, the proposed algorithm can effectively preserve features of the original signals.

    The basic concept of EMD methods involves the decomposition of the original signal into its IMFs (Ogundile et al., 2020). This process has two constraints:

    1) The number of zero crossings and extrema must be either equal or differ by one.

    2) At any point in the complete data, the mean value of the envelope defined by local minima and local maxima must be zero.

    The specific decomposition steps are as follows (Xiong et al., 2017):

    1) All local maxima and minima of the signal are determined, and the maxima and minima with a tertiary spline interpolation curve are connected to form the upper and lower envelopes of the signal yup(t) and ylow(t).

    2) The average value curve of the upper and lower envelope is calculated as follows:

    $$ m_1(t)=\frac{1}{2}\left[y_{\text {up }}(t)+y_{\text {low}}(t)\right] $$ (1)

    3) x(t) is subtracted from m1(t) to obtain a new sequence h1(t) with the removal of low-frequency components:

    $$ h_1(t)=x(t)-m_1(t) $$ (2)

    4) Steps (1)–(3) are repeated, and then the signal is filtered m times to obtain IMF1(t) = m(t).

    5) r(t) = x(t) − m1(t), the signal r(t) is further decomposed according to steps (1)–(4). As it becomes impossible to extract components that meet the IMF conditions from Residual component, we can obtain n components that satisfy IMF. Finally, we can obtain IMFn(t) = r(t).

    2.2.1   The basic principles of EEMD

    When using the EEMD algorithm to process the signal, white Gaussian noise is added to compensate for any missing signal scale. After a sufficient number of additions, the final decomposition result is averaged. In this phase, the additional noise cancels each other out, leaving only the lasting and stable part that represents the signal. The summarized process of EEMD is expressed as follows (Jia et al., 2021):

    1) A white noise with a specific amplitude A is added to the original signal.

    2) EMD is performed on the signal with the added white noise, and K IMF components and one residual component are obtained.

    3) Steps 1) and 2) are repeated with a specific number of trials. In each trial, K, the number of IMF, remains constant. Equation (3) shows the results of all trials.

    $$ x(t)+n_1(t)=\sum\limits_{j=1}^k \mathrm{IMF}_{i j}(t)+r_i(t) $$ (3)

    where x(t) is the original signal; ni(t) is the added white noise of the ith trial; IMFij(t) is the jth IMF component of the ith trial; ri(t) is the residual component of the ith trial.

    4) The ensemble mean of all trials is calculated. Corresponding formulas are expressed in Eq. (4) and (5). The final decomposition result of the original signal obtained via EEMD is shown in Eq. (6).

    $$ \operatorname{IMF}_j(t)=\frac{1}{M} \sum\limits_{i=1}^M \operatorname{IMF}_{i j}(t) $$ (4)
    $$ r(t)=\frac{1}{M} \sum\limits_{i=1}^M r_i(t) $$ (5)
    $$x(t)=\sum\limits_{j=1}^k \mathrm{IMF}_j(t)+r(t) $$ (6)

    where M is the number of trials, and component of EEMD.

    2.2.2   EEMD parameter selection

    When introducing white noise to an original signal, the selection of the two vital parameters, amplitude (A) and ensemble number (M), becomes an urgent issue that needs to be addressed. The following relationship between A and M is established (Du et al., 2016):

    $$ S=A / \sqrt{M} $$ (7)

    where S denotes the standard deviation of the error between the original signal and IMFs. Equation (7) reveals that the error is directly proportional to the A of the added white noise and inversely proportional to M.

    When using the EEMD algorithm to process signals, if A is too small, the white noise may not be evenly distributed across the scales of each component, and the main frequency of the component may not be unique. Moreover, white noise cannot eliminate interruptions in signal decomposition and suppress mode mixing. If A is too large, noise interference may occur, and the added white noise cannot be eliminated during ensemble averaging. Therefore, this factor affects the final decomposition result. Because EEMD decomposition is more sensitive to noise, A is usually set to a relatively small value. Wu et al. (2010) analyzed large datasets and found that when A is set to 0.1–0.5, the maximum interference error caused by the added noise to the final decomposition result is ~1%.

    As the same amount of A is added to the Gaussian white noise, a smaller value of M cannot eliminate the interference of the added white noise on the IMF component during the averaging process. As M continuously increases, the added white noise gradually reduces its impact on the decomposition result, thereby enhancing signal processing efficiency. However, at a larger M value, the computational load of signal decomposition increases, leading to a decrease in signal processing efficiency. Generally, an average of ~100 sets is considered the standard. Nevertheless, specific experiments are required for individual signals.

    After the original signal is decomposed via EEMD, k IMF components are obtained across k frequency bands. Among these components, high-frequency components mainly comprise noise signals, while low-frequency components mainly contain effective signals. Therefore, determining the critical point between noise components and effective signal components is vital (Shang et al., 2019).

    According to the empirical energy of the IMF components, effective IMF components are selected for reconstruction to obtain the denoised signal, while the noise IMF components are discarded (Zhao et al., 2011). The corresponding threshold for this selection is determined according to the specific characteristics of each signal. Incorrect component selection can lead to erroneous results (Yang et al., 2015). Additionally, the correlation between each component and the original signal is calculated to distinguish noise components from effective signal components. Covariance is commonly used to quantify the degree of relationship and relevance between two random variables (Gong et al., 2022), as shown in Eq. (8):

    $$ \operatorname{cov}(X, Y)=\mathrm{E}(X Y)-\mathrm{E}(X) \mathrm{E}(Y) $$ (8)

    However, the size of the covariance value is not an accurate measure of the degree of correlation between two random variables. The size of the covariance value is influenced by the dimensions of the two variables and is not suitable for direct comparison. To accurately measure the correlation degree between two random variables, CC is introduced to assess their linear correlation. Generally, the CC between the noise-dominant component and the original signal is smaller than that between the effective and original signals. Furthermore, the change in CC of the effective signal is more significant than that of the noise component. Figure 1 displays the critical point, and the turning point is visibly distinct.

    $$ \mathrm{CC}(n, x)<\mathrm{CC}(s, x) $$ (9)
    Figure  1  Correlation coefficient diagram
    Download: Full-Size Img

    The CC between each modal component and the original signal is calculated as follows:

    $$ \mathrm{CC}\left[x(t), \mathrm{IMF}_i\right]=\frac{\sum\limits_{i=1}^N[x(t)-\bar{x}]\left[f_i(t)-\bar{f}_i\right]}{\left.\sqrt{\sum\limits_{i=1}^N[x(t)-\bar{x}]^2} \sqrt{\sum\limits_{i=1}^N\left[f_i(t)-\bar{f}_i\right.}\right]^2} $$ (10)
    $$ \bar{x}=\frac{1}{N} \sum\limits_{i=1}^N x(t) $$ (11)
    $$ \overline{f_i}=\frac{1}{N} \sum\limits_{i=1}^N f_i(t) $$ (12)

    where fi(t) is the ith IMF component, N is the number of sampling points, fi is the mean of N IMF components, and x is the sample mean of the original signal.

    After calculating CCs, the turning point is identified as the point where the curvature value R of the correlation curve reaches its maximum (Shaw et al., 2016). The specific calculation formula is as follows:

    $$ R=\frac{\mathrm{CC}^{\prime}}{\left(1+\mathrm{CC}^{\prime 2}\right)^{1.5}} $$ (13)
    $$ \mathrm{CC}^{\prime \prime}=\mathrm{CC}_{i+1}-2 \mathrm{CC}_i+\mathrm{CC}_{i-1} $$ (14)
    $$ \mathrm{CC}^{\prime}=\mathrm{CC}_i-\mathrm{CC}_{i-1} $$ (15)

    Therefore, the noise and signal components can be identified using the CC graph by locating the maximum curvature. This approach has the following advantages:

    1) CC standardizes the correlations between two random variables by subtracting the covariance, thereby eliminating dimensional numerical influence (Zhao, 2015; Qiao, 2016).

    2) CC is used to analyze the characteristic signals within long-term signal waveforms. The calculation of CC aids in the extraction of similar components (Li et al., 2017a).

    3) A line chart is created based on the calculation of CC. This approach eliminates the need for setting artificial thresholds, thereby preventing the issue of setting different thresholds for different signals.

    However, owing to the non-linearity, non-stationarity, and non-Gaussian properties of the underwater acoustic signal, the direct elimination of noise signals may result in the loss of some useful signals. Therefore, further classifying noise components into pure noise and noise-dominant components is vital.

    PE is a spatial complexity measurement used for analyzing one-dimensional time series and is characterized by its simple calculation and good noise resistance (Bandt and Pompe, 2002). The summarized process of PE is as follows (Zanin et al., 2018):

    1) The time series X = {x1, x2, ⋯, xN} is reconstructed as follows:

    $$ \left\{\begin{array}{l} \{x(1), x(1+\tau), \cdots, x(1+(m-1) \tau)\} \\ \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\vdots \\ \{x(j), x(j+\tau), \cdots, x(j+(m-1) \tau)\} \\ \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\vdots \\ \{x(K), x(K+\tau), \cdots, x(K+(m-1) \tau)\}(K=n-(m-1) \tau) \end{array}\right. $$ (16)

    where τ and m denote the time lag and embedding dimension.

    2) Each row vector is rearranged in an ascending order:

    $$ x\left(i+\left(j_1-1\right) \tau\right) \leqslant x\left(i+\left(j_2-1\right) \tau\right) \leqslant \cdots \leqslant x\left(i+\left(j_m-1\right) \tau\right) $$ (17)

    3) A symbol sequence for each row vector is obtained as:

    $$ S(g)=\left(j_1, j_2, \cdots, j_m\right)(g=1, 2, \cdots, l \text { and } l \leqslant m !) $$ (18)

    4) PE is defined as:

    $$ H_p(m)=-\sum\limits_{j=1}^l P_j \ln P_j $$ (19)

    where Pj is the probability of one symbol sequence.

    5) Normalized PE is defined as:

    $$ H_p=H_p(m) / \ln (m !) $$ (20)

    In Equation (20), the $H_p$ value reflects the randomness of the time series. A smaller $H_p$ value indicates a more regular time series, while a larger $H_p$ value signifies a more random time series. In this study, we set $m=3$ and $\tau=1$ based on the recommendation from a previous study (Bandt and Pompe, 2002). Zhang et al. (2022) indicated that at $H_p \geqslant 0.75$, the corresponding mode is considered a noise signal, and at $H_p<0.75$, the corresponding mode is considered an effective signal. Therefore, in this paper, we use PE to further identify the noise components obtained from the CC diagram as noise-dominant components and pure noise components.

    Compared with traditional filtering methods, wavelet transform exhibits multi-resolution characteristics. A one-dimensional noisy time series can be expressed as follows (Wang et al., 2018):

    $$ s(t)=y(t)+n(t) $$ (21)

    where y(t) denotes the original signal, n(t) is the noise signal, and s(t) is the noisy signal.

    Assuming that n(t) is Gaussian white noise, y(t) is usually represented as a low-frequency signal in practical engineering applications. Therefore, the following methods are used to reduce noise. The specific steps are as follows:

    1) A suitable wavelet basis function and decomposition level are selected to perform wavelet decomposition on the noisy signal.

    2) Thresholding is applied to high-frequency coefficients through the selection of an appropriate threshold method at different decomposition scales.

    3) The signal is reconstructed using low-frequency coefficients from wavelet decomposition and threshold high-frequency coefficients at different scales.

    Wavelet thresholding with different thresholds usually involves three methods, namely denoising with a default threshold, denoising with a specified threshold, and denoising with a hard threshold. Among these methods, denoising with a specified threshold can be further classified into soft thresholding and hard thresholding. In this paper, a soft thresholding method is selected to eliminate the threshold.

    When using the EMD algorithm, EEMD algorithm, and the CC method for denoising signals, the direct removal of the noise IMF components may also eliminate hydroacoustic signal components, thereby affecting noise reduction. This study introduces PE and WTD based on the EEMD method to develop the proposed underwater acoustic signal denoising algorithm (Figure 2). The specific procedures are summarized as follows:

    Figure  2  Flow chart of the proposed denoising algorithm for underwater acoustic signals
    Download: Full-Size Img

    1) The underwater signal is decomposed via EEMD to produce numerous IMFs, including noise IMFs and real IMFs;

    2) CCs between each IMF component and the original signal are calculated;

    3) Noise and signal IMFs are classified based on the inflection points of the CC graph. The maximum curvature of the CC curve serves as the critical point distinguishing between noise and signal IMFs.

    4) Signal IMFs are screened out, and the PEs of other IMFs are calculated.

    5) Noise-dominant IMFs are identified based on their PEs. If the PE of IMF is > 0.75, it is considered a pure noise IMF; if the PE of IMF is < 0.75, it is classified as a noise-dominant IMF.

    6) The noise-dominant IMFs are denoised via WTD. Wavelet soft-threshold denoising (WSTD) is used for enhancing the quality of noise-dominant IMFs, with the wavelet basis function and decomposition level set to db4 and 4, respectively.

    7) The denoised signal is obtained through the reconstruction of the denoised noise-dominant IMFs and real IMFs.

    To effectively evaluate the denoising effect, numerous scholars have proposed some evaluation methods, including signal-to-noise ratio (SNR), root mean square error (RMSE), CC, Lyapunov exponent, and noise intensity. In this study, we selected the first three evaluation criteria for the assessment of the denoising effect (Wang et al., 2013; Rosenstein et al., 1993).

    3.2.1   SNR

    SNR represents an energy relationship between signal and noise. A higher SNR indicates that the signal contains a larger amount of useful information and a lesser noise. Therefore, the SNR serves as an intuitive method for evaluating the quality of the denoised signal by analyzing whether the SNR has improved. SNR is defined as follows:

    $$ \mathrm{SNR}=10 \times \lg \left(\frac{\|x\|^2}{\|\hat{x}-x\|^2}\right) $$ (22)

    where $x, \hat{x}$, and $\left\|^*\right\|$ indicate the noise signal, denoised signal, and norm, respectively.

    3.2.2   RMSE

    RMSE quantifies the numerical difference between the denoised signal and the original signal. A smaller RMSE indicates a more effective noise reduction. RMSE is defined as follows:

    $$ \text { RMSE }=\sqrt{\frac{\|\hat{x}-x\|^2}{\operatorname{length}(x)}} $$ (23)

    where length(*) represents the length of signals.

    3.2.3   CC

    CC represents the correlation degree of the two signals. A larger coefficient indicates a stronger correlation degree between two signals. The specific calculation formula is introduced in section 2.3.

    According to the non-linear and non-stationary properties of underwater acoustic signals, four types of simulation signals are selected for denoising, namely Blocks, Bumps, Doppler, and Heavysine. The sampling frequency and data length are set to 1 kHz and 1 024, respectively. Noisy signals with different SNRs are obtained through the addition of Gaussian white noise. Figure 3 shows the original signal and noisy signal with an SNR of 0 dB.

    Figure  3  Time-domain waveform of the noisy signal with 0 dB
    Download: Full-Size Img

    Taking the Heavysine signal as an example, Figure 3 shows the complete submersion of the Heavysine signal in noise. The following is the complete denoising process.

    The noisy Heavysine signal is decomposed via EMD and EEMD methods (Figure 4). In both decomposition methods, the IMF1 represents the shortest oscillation period, typically corresponding to a noise component or high-frequency components.

    Figure  4  Decomposition results of the noisy Heavysine signal with 0 dB
    Download: Full-Size Img

    The CC between each IMF component obtained through EMD and EEMD methods and the original signal is calculated (Figure 5).

    Figure  5  CC curve of the noisy Heavysine signal obtained via EMD and EEMD
    Download: Full-Size Img

    Figure 5 reveals that the maximum curvature of CC curves occurs at the fourth-order position in EEMD and the sixth-order position in EMD. The first four IMFs can be classified as noise IMFs in EEMD, and the first six IMFs can be identified as noise IMFs in EMD.

    Noise-dominant IMFs can be identified based on the PEs of noise IMFs in EEMD. For the noisy Heavysine signal with an SNR of 0 dB, the PEs of noise IMFs are shown in Table 1. The PE of IMF4 is less than 0.75, indicating that IMF4 is the noise-dominant IMF in EEMD.

    Table  1  PEs of noise IMFs for simulation signal
    IMF1 IMF2 IMF3 IMF4
    0.997 7 0.884 2 0.757 7 0.578 0

    WSTD is applied to IMF4 using the wavelet basis function of db4 and a decomposition level of 4. The denoised Heavysine signal is obtained through the reconstruction of the denoised IMF4 and real IMFs. Figure 6 displays the denoising results. The denoising method that involves the use of wavelet soft-threshold method to directly eliminate signals is referred to as WSTD. Denoising methods that combine CC with EMD and EEMD are denoted as EMD–CC and EEMD–CC. The proposed denoising method is denoted as EEMD–CC–PE–WSTD.

    Figure  6  Denoising results of the noisy Heavysine signal
    Download: Full-Size Img

    Figure 6 indicates that the denoised signal obtained through EEMD–CC–PE–WSTD can effectively fit the source signal. Sharp spikes are effectively eliminated, leading to a smoother and cleaner signal with minimal distortion. Compared with the signal obtained through the other three methods, the denoised signal obtained through the proposed method retains more features of the original signal. Moreover, the baseline of the denoised signal closely corresponds to that of the source signal, indicating effective baseline correction.

    To further verify the effectiveness of the proposed method, four types of signals with different SNRin are denoised through EMD–CC, EEMD–CC, WSTD, and EEMD–CC–PE–WSTD methods. Figures 7 and 8 show the denoising results.

    Figure  7  SNRout of different methods for simulated signals
    Download: Full-Size Img
    Figure  8  RMSEout of different methods for simulated signals
    Download: Full-Size Img

    On average, EEMD–CC–PE–WSTD achieves the highest denoising effect. Particularly, at a low SNRin, EEMD–CC–PE–WSTD features a significantly improved denoising performance. Additionally, EEMD–CC–PE–WSTD exhibits a higher SNR and lower RMSE than the other three denoising methods.

    EMD–CC, EEMD–CC, WSTD, and EEMD–CC–PE–WSTD algorithms are used to denoise the ship sound signal from an open-source database to further verify the effectiveness of the proposed underwater acoustic denoising algorithm. Subsequently, SNR, RMSE, and CC are used to evaluate the denoising results.

    The ocean acoustic recordings used to train the classifier were obtained from various open-source databases. Ship noise data were collected from the ShipsEar (Santos-Domínguez et al., 2016). Most of these recordings were collected in or near the Port of Vigo using a hydrophone with a sampling rate of 52 734 Hz. Recordings from different types of vessels in or near the docks included fishing boats, ocean liners, ferries of various sizes, containers, tugs, pilot boats, yachts, and small sailboats. In this study, the audio recordings of passenger ships were selected for analysis to assess the performance of the denoising algorithm.

    To facilitate the subsequent denoising processing, we use a 0.6-s frame to crop the filtered target audio signal. To ensure the integrity of the signal during trimming, the overlapping duration between two adjacent frames of signals is set to 0.05 s. The signal-trimming process is shown in Figure 9.

    Figure  9  Ship signal clipping process
    Download: Full-Size Img

    The time-domain diagram of the processed ship signal is shown in Figure 10.

    Figure  10  Tested ship signal waveform
    Download: Full-Size Img

    The time-domain diagram of the collected ship signal contains numerous peaks and a significant amount of noise, indicating the need for denoising (Figure 10).

    Notably, the A of the added white noise and the number of trials M are the two key parameters in EEMD. To select suitable parameters, the SNR of the ship signal after EEMD processing is calculated by varying A and M (Table 2).

    Table  2  SNR of signals processed via EEMD under different A and M
    M (s) A (mV)
    0.1 0.2 0.3 0.4 0.5
    50 13.574 3 13.774 9 13.882 6 13.855 6 14.170 4
    100 13.627 4 13.776 6 14.079 5 14.021 7 14.073 0
    150 13.462 3 13.660 0 13.870 3 13.887 1 13.981 2

    Under a constant number of trials, a noise amplitude of 0.3 mV results in a maximum SNR value, while SNR decreases with increasing noise amplitude (Table 2). This is due to the effect of the added white noise on the signal, which increases overall noise and affects the decomposition results. At a constant noise amplitude of 0.3 mV, the maximum SNR value is achieved when M = 100. As the number of trials increases, the corresponding SNR value gradually stabilizes, indicating that excessively increasing the number of trials displays minimal effects on the decomposition results.

    In summary, the experiment indicates that the optimal values are A = 0.3 mV and M = 100.

    5.4.1   EMD–CC method

    The processing effect of the decomposition of the ship signals obtained via EMD is shown in Figure 11. The ship signal decomposes into eight natural modal components through EMD (Figure 11). The first three components exhibit significant noise interference and mode mixing in the spectrogram. Subsequently, the signal is reconstructed, and the CC between each component and the original ship signal is calculated (Figure 12).

    Figure  11  EMD decomposition components and spectrum diagram of ship signals
    Download: Full-Size Img
    Figure  12  EMD–CC variation curve of ship signal
    Download: Full-Size Img

    Figure 12 shows that the maximum curvature of CC occurs at the third-order position. The first three eigenmode components are considered noise signals and will be discarded, while the remaining components are classified as effective signals for reconstruction.

    5.4.2   EEMD–CC method

    The ship signal is decomposed using the two EEMD parameters determined in Figure 13.

    Figure  13  EEMD decomposition components and spectrum diagram of ship signals
    Download: Full-Size Img

    EEMD decomposes the ship signal into six IMF components and sequentially separates the original ship signal from high-to-low frequency. Each component, starting from IMF2, exhibits a single signal scale, indicating the effective suppression of the modal mixing phenomenon. Additionally, the CC between each component and the original ship signal is calculated, and the results are shown in Figure 14.

    Figure  14  EEMD–CC variation curve of ship signals
    Download: Full-Size Img

    Figure 14 reveals the point of maximum curvature in the CC curve corresponds to the third-order position. The first three IMF components are considered noise signals and will be discarded, and the remaining components are classified as valid signals for the reconstruction process.

    5.4.3   EEMD–CC–PE–WSTD method

    From section 5.4.2, the first three IMF components are identified as noise components, and the PE value of each component is calculated, as shown in Table 3.

    Table  3  PEs of noise IMFs for measured ship signals
    IMF1 IMF2 IMF3
    0.765 8 0.533 1 0.994 9

    The results revealed that the PE of IMF2 is less than 0.75. Thus, IMF2 can be considered the noise-dominant component and is denoised via WSTD. Additionally, the second IMF component is reconstructed along with the last three IMF components.

    Through the adjustment of the time axis, the time-domain diagram of the original signal is shown in Figure 15.

    Figure  15  Time-domain diagrams of original ship signals
    Download: Full-Size Img

    The time-domain diagrams of the signal obtained from different denoising methods are shown in Figure 16.

    Figure  16  Time-domain diagrams of the ship signal denoised via different methods
    Download: Full-Size Img

    All four methods effectively reduce peaks and glitches in the signal via denoising (Figure 16). However, the signal denoised via the proposed method is smoother and retains more valuable information compared with other methods. Additionally, the proposed method significantly reduces the impurities and noise in the signal, making it more favorable for human hearing.

    Furthermore, the short-time Fourier transform diagram of the original signal is shown in Figure 17.

    Figure  17  Short-time Fourier transform diagram of the original signal
    Download: Full-Size Img

    The short-time Fourier transform diagrams of the signal obtained via different denoising methods are shown in Figure 18.

    Figure  18  Short-time Fourier transform diagram of the ship signal denoised via different methods
    Download: Full-Size Img

    For ship noise, the signal frequency range is mainly concentrated below 1000 Hz (Zhang et al., 2017). Figures 17 and 18 show the wide frequency distribution of the original signal, particularly with various frequency components present above 1 000 Hz. However, the frequency of the denoised signal is highly concentrated, mainly below 5 000 Hz, and the background noise is significantly reduced. Furthermore, the algorithm proposed in this paper exhibits a more significant denoising effect on the signal, with the signal mainly concentrated below 1 000 Hz.

    The following three parameters are used to evaluate the denoising effect of actual underwater acoustic signals (Table 4).

    Table  4  Denoising results of the measured ship signal
    Parameter original signal WSTD EMD–CC EEMD–CC EEMD–CC–PE–WSTD
    SNR (dB) 8.956 4 13.053 4 12.351 9 18.079 5 22.030 0
    RMSE - 0.068 5 0.057 1 0.023 6 0.017 4
    CC 1 0.912 5 0.903 3 0.980 4 0.990 1

    Table 4 indicates that the SNR of the reconstructed signal is improved using WSTD, EMD, and EEMD algorithms. Moreover, the SNR of the signal reconstructed through the improved EEMD method is 14 dB higher than that of the original signal. The RMSE and CC results revealed that signals processed through the improved EEMD exhibit lower RMSE and higher CC than the original signal. Therefore, the EEMD–CC–PE–WSTD algorithm exhibits an improved denoising effect and is effective and suitable for denoising underwater acoustic signals.

    To reduce the loss of effective signals during the denoising process of underwater acoustic signals, a noise reduction method based on EEMD, which incorporates CC, PE, and WSTD is proposed. The key innovations and conclusions of the proposed denoising method are as follows:

    1) The adaptive decomposition algorithm, EEMD, is used to denoise underwater acoustic data signal. This process decomposes original data into a set of IMF components with different scales.

    2) Compared with existing denoising methods, CC and PE classify IMFs into three parts (pure noise IMFs, noise-dominant IMFs, and real IMFs).

    3) The curvature value of the CC curve is used to distinguish between the noise and real components of the underwater acoustic signal based on the characteristics of the signal without the need for an arbitrary threshold.

    4) The proposed EEMD–CC–PE–WSTD method does not require human intervention. The calculation process is adaptive.

    5) Four types of signals (Blocks, Bumps, Doppler, and Heavysine) with different SNRs are denoised through EMD–CC, EEMD–CC, EEMD–CC–PE–WSTD, and WSTD methods. The proposed denoising method exhibits higher performance, with lower RMSE and higher SNR.

    6) Compared with other denoising methods, the EEMD–CC–PE–WSTD method is an effective method for denoising underwater acoustic signals. This study provides a solid foundation for the subsequent processing of ship-radiated noise signals, including prediction, detection, extraction, and classification.

    Competing interest  The authors have no competing interests to declare that are relevant to the content of this article.
  • Figure  1   Correlation coefficient diagram

    Download: Full-Size Img

    Figure  2   Flow chart of the proposed denoising algorithm for underwater acoustic signals

    Download: Full-Size Img

    Figure  3   Time-domain waveform of the noisy signal with 0 dB

    Download: Full-Size Img

    Figure  4   Decomposition results of the noisy Heavysine signal with 0 dB

    Download: Full-Size Img

    Figure  5   CC curve of the noisy Heavysine signal obtained via EMD and EEMD

    Download: Full-Size Img

    Figure  6   Denoising results of the noisy Heavysine signal

    Download: Full-Size Img

    Figure  7   SNRout of different methods for simulated signals

    Download: Full-Size Img

    Figure  8   RMSEout of different methods for simulated signals

    Download: Full-Size Img

    Figure  9   Ship signal clipping process

    Download: Full-Size Img

    Figure  10   Tested ship signal waveform

    Download: Full-Size Img

    Figure  11   EMD decomposition components and spectrum diagram of ship signals

    Download: Full-Size Img

    Figure  12   EMD–CC variation curve of ship signal

    Download: Full-Size Img

    Figure  13   EEMD decomposition components and spectrum diagram of ship signals

    Download: Full-Size Img

    Figure  14   EEMD–CC variation curve of ship signals

    Download: Full-Size Img

    Figure  15   Time-domain diagrams of original ship signals

    Download: Full-Size Img

    Figure  16   Time-domain diagrams of the ship signal denoised via different methods

    Download: Full-Size Img

    Figure  17   Short-time Fourier transform diagram of the original signal

    Download: Full-Size Img

    Figure  18   Short-time Fourier transform diagram of the ship signal denoised via different methods

    Download: Full-Size Img

    Table  1   PEs of noise IMFs for simulation signal

    IMF1 IMF2 IMF3 IMF4
    0.997 7 0.884 2 0.757 7 0.578 0

    Table  2   SNR of signals processed via EEMD under different A and M

    M (s) A (mV)
    0.1 0.2 0.3 0.4 0.5
    50 13.574 3 13.774 9 13.882 6 13.855 6 14.170 4
    100 13.627 4 13.776 6 14.079 5 14.021 7 14.073 0
    150 13.462 3 13.660 0 13.870 3 13.887 1 13.981 2

    Table  3   PEs of noise IMFs for measured ship signals

    IMF1 IMF2 IMF3
    0.765 8 0.533 1 0.994 9

    Table  4   Denoising results of the measured ship signal

    Parameter original signal WSTD EMD–CC EEMD–CC EEMD–CC–PE–WSTD
    SNR (dB) 8.956 4 13.053 4 12.351 9 18.079 5 22.030 0
    RMSE - 0.068 5 0.057 1 0.023 6 0.017 4
    CC 1 0.912 5 0.903 3 0.980 4 0.990 1
  • Bandt C, Pompe B (2002) Permutation entropy: A natural complexity measure for time series. Phys. Rev. Lett. 88(17): 1–4. https://doi.org/10.1103/PhysRevLett.88.174102
    Chen R, Tang B, Lu Z (2012) Ensemble empirical mode decomposition denoising method based on correlation coefficients for vibration signal of rotor system. J. Vib. Measur. Diagnosis. 32(4): 542–546
    Cheng X, Mao J, Li J, Zhao H, Zhou C, Gong X, Rao Z (2021) An EEMD-SVD-LWT algorithm for denoising a lidar signal. Measurement 168(3): 108405. DOI: 10.1016/j.measurement.2020.108405
    Cui B, Chen X (2015) Improved hybrid filter for fiber optic gyroscope signal denoising based on EMD and forward linear prediction. Sens. Actuator A Phys. 230: 150–155. https://doi.org/10.1016/j.sna.2015.04.021
    Du S, Liu T, Huang D, Li GL (2016) An optimal ensemble empirical mode decomposition method for vibration signal decomposition. Int. J. Acoust. Vib. 139: 1–18. https://doi.org/10.1115/L4035480
    Gong Y, Tong Li Yu Z, Zhang X (2022) Research on fault diagnosis method of rotating machinery misalignment based on Pearson correlation coefficient. New Technology & New Products of China (5): 48–50. DOI: 10.13612/j.cnki.cntp.2022.05.013
    Huang N, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen NC, Tung CC, Liu HH (1998) The empirical mode decomposition method and the Hilbert spectrum for non-stationary time series analysis. Proc. R. Soc. A Lond. 454: 903–995 https://doi.org/10.1098/rspa.1998.0193
    Islam MS, Chong U (2014) Noise reduction of continuous wave radar and pulse radar using matched filter and wavelets. J Image Video Proc 2014: 43. https://doi.org/10.1186/1687-5281-2014-43
    Jia Y, Li G, Dong X, He K (2021) A novel denoising method for vibration signal of hob spindle based on EEMD and grey theory. Measurement 169: 108490. DOI: 10.1016/j.measurement.2020.108490
    Li Q, Qin B, Si W, Wang R (2017a) Estimation algorithm for adaptive threshold of hybrid particle swarm optimization wavelet and its application in partial discharge signals denoising. High Voltage Engineering 43(5): 1485–1492. https://doi.org/10.13336/j.1003-6520.hve.20170428013
    Li YX, Li YA, Chen X, Yu J (2017b) Denoising and feature extraction algorithms using NPE combined with VMD and their applications in ship-radiated noise. Symmetry 9(11): 256. https://doi.org/10.3390/sym9110256
    Li YX, Li YA, Chen X, Yu J (2018a) Research on ship-radiated noise denoising using secondary variational mode decomposition and correlation coefficient. Sensors 18(1): 48. https://doi.org/10.3390/s18010048
    Li YX, Li YA, Chen X, Yu J, Yang H, Wang L (2018b) A new underwater acoustic signal denoising technique based on CEEMDAN, mutual information, permutation entropy, and wavelet threshold Denoising. Entropy 20(8): 563 https://doi.org/10.3390/e20080563
    Lu W, Zhang L, Liang W, Yu X (2016) Research on a small-noise reduction method based on EMD and its application pipeline leakage detection. Loss Prev. Process Ind. 41: 282–293. https://doi.org/10.1016/j.jlp.2016.02.017
    Ogundile OO, Usman AM, Versfeld J (2020) An empirical mode decomposition based hidden Markov model approach for detection of Bryde's whale pulse calls. J. Acoust. Soc. Am. 147(2): EL125–EL131. https://doi.org/10.1121/10.0000717
    Peng K, Guo H, Shang X (2021) EEMD and multiscale PCA-based signal denoising method and its application to seismic P-phase arrival picking. Sensors 21(16): 5271. https://doi.org/10.3390/s21165271
    Qiao J (2016) The estimation of technical efficiency based on double-lag stochastic Frontier model. Statistics & Information Forum 31(11): 44–48. DOI: 10.3969/j.issn.1007-3116.2016.11.008
    Rosenstein MT, Collins JJ, Luca CJD (1993) A practical method for calculating largest Lyapunov exponents from small data sets. Physica D 65: 117–134. https://doi.org/10.1016/0167-2789(93)90009-P
    Santos-Domínguez D, Torres-Guijarro S, Cardenal-López A, Pena-Gimenez A (2016) ShipsEar: An underwater vessel noise database. Appl Acoust. 113: 64–69. DOI: 10.1016/j.apacoust.2016.06.008
    Shang Z, Liu X, Liao X, Geng R, Gao M, Yun J (2019) Rolling bearing fault diagnosis method based on EEMD and GBDBN. Int. J. Performability Eng. 15(1): 230–240. DOI: 10.23940/ijpe.19.01.p23.230240
    Shaw J, Wang YH, Lin CY (2016) Sound and vibration analysis of a marine diesel engine via reverse engineering. J Mar Sci Technol. 26(5): 1–9. https://doi.org/10.6119/JMST.201810_26(5).0009
    Singh G, Kaur G, Kumar V (2014) ECG denoising using adaptive selection of IMFs through EMD and EEMD. International Conference on Data Science & Engineering, 228–231. DOI: 10.1109/ICDSE.2014.6974643
    Sun Z, Xi X, Yuan C, Yang Y, Hua X (2020) Surface electromyography signal denoising via EEMD and improved wavelet thresholds. Math Biosci Eng. 17(6): 6945–6962. DOI: 10.3934/mbe.2020359
    Tucker JD, Azimi-Sadjadi MR (2011) Coherence-based underwater target detection from multiple disparatesonar platforms. IEEE J. Ocean Eng. 36(1): 37–51. DOI: 10.1109/JOE.2010.2094230
    Wang LB, Zhang XD, Wang XL (2013) Chaotic signal denoising method based on independent component analysis and empirical mode decomposition. Acta Phys. Sin. 62(5): 050201. https://doi.org/10.7498/aps.62.050201
    Wang X, Xu J, Zhao Y (2018) Wavelet based denoising for the estimation of the state of charge for lithium-ion batteries. Energies 11(5): 1144. https://doi.org/10.3390/en11051144
    Wess LG, Dixon TL (1997) Wavelet-based denoising of underwater acoustic signals. J. Acoust. Soc. Am. 101(1): 377–383. https://doi.org/10.1121/1.417983
    Wu H, Huang NE (2011) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv. Adapt Data Analysis 1: 1–41. https://doi.org/10.1142/S1793536909000047
    Wu Z, Huang NE (2010) On the filtering properties of the empirical mode decomposition. Adv. Adapt Data Analysis 2(4): 397–414. https://doi.org/10.1142/S1793536910000604
    Xiong Q, Xu Y, Peng Y, Zhang W, Li Y, Tang L (2017) Low-speed rolling bearing fault diagnosis based on EMD denoising and parameter estimate with alpha stable distribution. J. Mech. Sci. Technol. 31(4): 1587–1601. https://doi.org/10.1007/s12206-017-0306-y
    Xue W, Dai X, Zhu J, Luo Y (2019) A noise suppression method of ground penetrating radar based on EEMD and permutation entropy. IEEE Geosci. Remote. Sens. Lett. 16(10): 1625–1639. DOI: 10.1109/LGRS.2019.2902123
    Yang G, Liu Y, Wang Y, Zhu Z (2015) EMD interval thresholding denoising based on similarity measure to select relevant modes. Signal Process 109: 95–109. https://doi.org/10.1016/j.sigpro.2014.10.038
    Yang H, Ning T, Zhang B, Yin X, Gao Z (2017) An adaptive denoising fault feature extraction method based on ensemble empirical mode decomposition and the correlation coefficient. Advances in Mechanical Engineering 9(4): 1–9. DOI: 10.1177/1687814017696448
    Yue GD, Cui XS, Zou YY, Bai XT, Wu YH, Shi HT (2019) A Bayesian wavelet packet denoising criterion for mechanical signal with non-Gaussian characteristic. Measurement 138: 702–712. https://doi.org/10.1016/j.measurement.2019.02.066
    Zanin M, Gómez-Andrés D, Pulido-Valdeolivas I, Martín-Gonzalo JA, López-López J, Pascual-Pascual SI, Rausell E (2018) Characterizing normal and pathological gait through permutation entropy. Entropy 20(1): 77. https://doi.org/10.3390/e20010077
    Zhang W, Zheng X, Yang R, Han J (2017) Research on identification technology of ship radiated noise and marine biological noise. IEEE International Conference on Signal Processing, Communications and Computing 3: 3142–5386. DOI: 10.1109/ICSPCC.2017.8242384
    Zhang XL, Cao LY, Chen Y, Jia RS, Lu XM (2022) Microseismic signal denoising by combining variational mode decomposition with permutation entropy. Applied Geophysics 19(1): 65–80. https://doi.org/10.1007/s11770-022-0926-6
    Zhao XJ (2015) Research on the correlation and complexity of time series. PhD thesis, Beijing Jiaotong University, Beijing. DOI: 10.7666/d.Y2917148
    Zhao Z, Liu J, Wang S (2011) Denoising ECG signal based on ensemble empirical mode decomposition. International Symposium on Signal Processing and Information Technology 23: 170–177. DOI: 10.1109/ISSPIT.2017.8388665
WeChat click to enlarge
Figures(18)  /  Tables(4)
Publishing history
  • Received:  20 October 2022
  • Accepted:  30 May 2023

目录

    /

    Return
    Return