A New Method for Denoising Underwater Acoustic Signals Based on EEMD, Correlation Coefficient, Permutation Entropy, and Wavelet Threshold Denoising
https://doi.org/10.1007/s11804-024-00386-6
-
Abstract
The complexities of the marine environment and the unique characteristics of underwater channels pose challenges in obtaining reliable signals underwater, necessitating the filtration of underwater acoustic noise. Herein, an underwater acoustic signal denoising method based on ensemble empirical mode decomposition (EEMD), correlation coefficient (CC), permutation entropy (PE), and wavelet threshold denoising (WTD) is proposed. Furthermore, simulation experiments are conducted using simulated and real underwater acoustic data. The experimental results reveal that the proposed denoising method outperforms other previous methods in terms of signal-to-noise ratio, root mean square error, and CC. The proposed method eliminates noise and retains valuable information in the signal.Article Highlights● This study mitigates partial loss in traditional denoising by combining denoising with PE and WTD, leveraging EEMD.● PE segregates noise into pure and noise-dominant components, preserving effective signals through targeted denoising of the noise-dominant component.● The method efficiently removes noise, eliminating local prominent peaks in the signal for a smoother and refined final output. -
1 Introduction
The Earth's ocean area significantly exceeds its land area, and the ocean contains abundant resources. Therefore, countries worldwide have increasingly focused on ocean development and protection. Owing to the complexity of ocean background noises and the non-linear, non-Gaussian, and non-stationary properties of underwater acoustic signals, the detection and mitigation of noise in underwater acoustic signals are vital (Tucker et al., 2016; Li et al., 2018b). Therefore, the research and application of underwater acoustic signal denoising methods are of great significance in the underwater acoustic field.
Matched filters have been widely used in the past (Islam and Chong, 2014). These filters effectively utilize the differences between signals and noise distributions and use specific filters that match the frequency characteristics of both signals to achieve noise reduction. However, the use of matched filters for denoising non-stationary signals has limitations. Moreover, wavelet-based methods are used for denoising signals, but these methods require pre-selection of basis functions, and their effectiveness is not ideal as the frequency bands of useful signals and noise overlap (Yue et al., 2019; Wess and Dixon, 1997). Huang et al. (1998) proposed a new signal decomposition method named empirical mode decomposition (EMD). EMD can decompose a signal into high-to-low components and serve as an adaptive signal processing technique suitable for non-linear and non-stationary processes. This method was significantly used for signal denoising (Cui and Chen, 2015; Lu et al., 2016). However, EMD is susceptible to mode mixing, which leads to signal loss and affects signal processing. To solve this issue, Wu and Huang (2011) improved EMD and proposed a modified signal decomposition method named ensemble empirical mode decomposition (EEMD). EEMD incorporates the properties of EMD, adaptively processes non-linear and non-stationary signals, and effectively mitigates mode mixing.
Recently, numerous denoising methods based on EEMD and its improved algorithms have been widely used in different fields. Zhao et al. (2011) used the EEMD algorithm for denoising Electrocardiogram (ECG) in the medical field. Sun et al. (2020) introduced a new surface electromyography (sEMG) denoising method based on EEMD and wavelet threshold to eliminate the random noise from sEMG signals. Yang et al. (2017) proposed a vibration signal denoising algorithm in the field of engineering vibration based on EEMD and correlation coefficients (CC). Moreover, Jia et al. (2021) introduced a novel method for denoising the vibration signal based on EEMD and gray theory. Xue et al. (2019) proposed a noise suppression method in the radar communication field based on EEMD and permutation entropy (PE) according to the characteristics of ground-penetrating radar signals. To effectively suppress noise in an atmospheric lidar return signal, Cheng et al. (2021) proposed a segmentation singular value decomposition (SVD) -lifting wavelet transform (LWT) denoising algorithm based on EEMD. Peng et al. (2021) proposed a signal denoising method in the field of seismic monitoring based on EEMD and multiscale principal component analysis. The method exhibited good prospects for processing microseismic waves.
However, no studies have been reported on underwater acoustic signal denoising using EEMD (Li et al., 2017b; Li et al., 2018a). Among the existing denoising algorithms, intrinsic mode functions (IMFs) are mainly classified into noise IMFs and real IMFs (Chen et al., 2012; Singh et al., 2014; Li et al., 2018). Generally, noise IMFs are frequently discarded during the denoising process, but this method often lacks accuracy.
Therefore, this study presents a hybrid algorithm for denoising underwater acoustic signals based on EEMD, CC, PE, and wavelet threshold denoising (WTD). Compared with existing denoising algorithms, the proposed algorithm categorizes IMFs into three parts and does not directly discard high-frequency noise signals but instead uses WTD for denoising. Thus, the proposed algorithm can effectively preserve features of the original signals.
2 Traditional underwater acoustic target signal denoising process
2.1 EMD
The basic concept of EMD methods involves the decomposition of the original signal into its IMFs (Ogundile et al., 2020). This process has two constraints:
1) The number of zero crossings and extrema must be either equal or differ by one.
2) At any point in the complete data, the mean value of the envelope defined by local minima and local maxima must be zero.
The specific decomposition steps are as follows (Xiong et al., 2017):
1) All local maxima and minima of the signal are determined, and the maxima and minima with a tertiary spline interpolation curve are connected to form the upper and lower envelopes of the signal yup(t) and ylow(t).
2) The average value curve of the upper and lower envelope is calculated as follows:
$$ m_1(t)=\frac{1}{2}\left[y_{\text {up }}(t)+y_{\text {low}}(t)\right] $$ (1) 3) x(t) is subtracted from m1(t) to obtain a new sequence h1(t) with the removal of low-frequency components:
$$ h_1(t)=x(t)-m_1(t) $$ (2) 4) Steps (1)–(3) are repeated, and then the signal is filtered m times to obtain IMF1(t) = m(t).
5) r(t) = x(t) − m1(t), the signal r(t) is further decomposed according to steps (1)–(4). As it becomes impossible to extract components that meet the IMF conditions from Residual component, we can obtain n components that satisfy IMF. Finally, we can obtain IMFn(t) = r(t).
2.2 EEMD
2.2.1 The basic principles of EEMD
When using the EEMD algorithm to process the signal, white Gaussian noise is added to compensate for any missing signal scale. After a sufficient number of additions, the final decomposition result is averaged. In this phase, the additional noise cancels each other out, leaving only the lasting and stable part that represents the signal. The summarized process of EEMD is expressed as follows (Jia et al., 2021):
1) A white noise with a specific amplitude A is added to the original signal.
2) EMD is performed on the signal with the added white noise, and K IMF components and one residual component are obtained.
3) Steps 1) and 2) are repeated with a specific number of trials. In each trial, K, the number of IMF, remains constant. Equation (3) shows the results of all trials.
$$ x(t)+n_1(t)=\sum\limits_{j=1}^k \mathrm{IMF}_{i j}(t)+r_i(t) $$ (3) where x(t) is the original signal; ni(t) is the added white noise of the ith trial; IMFij(t) is the jth IMF component of the ith trial; ri(t) is the residual component of the ith trial.
4) The ensemble mean of all trials is calculated. Corresponding formulas are expressed in Eq. (4) and (5). The final decomposition result of the original signal obtained via EEMD is shown in Eq. (6).
$$ \operatorname{IMF}_j(t)=\frac{1}{M} \sum\limits_{i=1}^M \operatorname{IMF}_{i j}(t) $$ (4) $$ r(t)=\frac{1}{M} \sum\limits_{i=1}^M r_i(t) $$ (5) $$x(t)=\sum\limits_{j=1}^k \mathrm{IMF}_j(t)+r(t) $$ (6) where M is the number of trials, and component of EEMD.
2.2.2 EEMD parameter selection
When introducing white noise to an original signal, the selection of the two vital parameters, amplitude (A) and ensemble number (M), becomes an urgent issue that needs to be addressed. The following relationship between A and M is established (Du et al., 2016):
$$ S=A / \sqrt{M} $$ (7) where S denotes the standard deviation of the error between the original signal and IMFs. Equation (7) reveals that the error is directly proportional to the A of the added white noise and inversely proportional to M.
When using the EEMD algorithm to process signals, if A is too small, the white noise may not be evenly distributed across the scales of each component, and the main frequency of the component may not be unique. Moreover, white noise cannot eliminate interruptions in signal decomposition and suppress mode mixing. If A is too large, noise interference may occur, and the added white noise cannot be eliminated during ensemble averaging. Therefore, this factor affects the final decomposition result. Because EEMD decomposition is more sensitive to noise, A is usually set to a relatively small value. Wu et al. (2010) analyzed large datasets and found that when A is set to 0.1–0.5, the maximum interference error caused by the added noise to the final decomposition result is ~1%.
As the same amount of A is added to the Gaussian white noise, a smaller value of M cannot eliminate the interference of the added white noise on the IMF component during the averaging process. As M continuously increases, the added white noise gradually reduces its impact on the decomposition result, thereby enhancing signal processing efficiency. However, at a larger M value, the computational load of signal decomposition increases, leading to a decrease in signal processing efficiency. Generally, an average of ~100 sets is considered the standard. Nevertheless, specific experiments are required for individual signals.
2.3 CC
After the original signal is decomposed via EEMD, k IMF components are obtained across k frequency bands. Among these components, high-frequency components mainly comprise noise signals, while low-frequency components mainly contain effective signals. Therefore, determining the critical point between noise components and effective signal components is vital (Shang et al., 2019).
According to the empirical energy of the IMF components, effective IMF components are selected for reconstruction to obtain the denoised signal, while the noise IMF components are discarded (Zhao et al., 2011). The corresponding threshold for this selection is determined according to the specific characteristics of each signal. Incorrect component selection can lead to erroneous results (Yang et al., 2015). Additionally, the correlation between each component and the original signal is calculated to distinguish noise components from effective signal components. Covariance is commonly used to quantify the degree of relationship and relevance between two random variables (Gong et al., 2022), as shown in Eq. (8):
$$ \operatorname{cov}(X, Y)=\mathrm{E}(X Y)-\mathrm{E}(X) \mathrm{E}(Y) $$ (8) However, the size of the covariance value is not an accurate measure of the degree of correlation between two random variables. The size of the covariance value is influenced by the dimensions of the two variables and is not suitable for direct comparison. To accurately measure the correlation degree between two random variables, CC is introduced to assess their linear correlation. Generally, the CC between the noise-dominant component and the original signal is smaller than that between the effective and original signals. Furthermore, the change in CC of the effective signal is more significant than that of the noise component. Figure 1 displays the critical point, and the turning point is visibly distinct.
$$ \mathrm{CC}(n, x)<\mathrm{CC}(s, x) $$ (9) The CC between each modal component and the original signal is calculated as follows:
$$ \mathrm{CC}\left[x(t), \mathrm{IMF}_i\right]=\frac{\sum\limits_{i=1}^N[x(t)-\bar{x}]\left[f_i(t)-\bar{f}_i\right]}{\left.\sqrt{\sum\limits_{i=1}^N[x(t)-\bar{x}]^2} \sqrt{\sum\limits_{i=1}^N\left[f_i(t)-\bar{f}_i\right.}\right]^2} $$ (10) $$ \bar{x}=\frac{1}{N} \sum\limits_{i=1}^N x(t) $$ (11) $$ \overline{f_i}=\frac{1}{N} \sum\limits_{i=1}^N f_i(t) $$ (12) where fi(t) is the ith IMF component, N is the number of sampling points, fi is the mean of N IMF components, and x is the sample mean of the original signal.
After calculating CCs, the turning point is identified as the point where the curvature value R of the correlation curve reaches its maximum (Shaw et al., 2016). The specific calculation formula is as follows:
$$ R=\frac{\mathrm{CC}^{\prime}}{\left(1+\mathrm{CC}^{\prime 2}\right)^{1.5}} $$ (13) $$ \mathrm{CC}^{\prime \prime}=\mathrm{CC}_{i+1}-2 \mathrm{CC}_i+\mathrm{CC}_{i-1} $$ (14) $$ \mathrm{CC}^{\prime}=\mathrm{CC}_i-\mathrm{CC}_{i-1} $$ (15) Therefore, the noise and signal components can be identified using the CC graph by locating the maximum curvature. This approach has the following advantages:
1) CC standardizes the correlations between two random variables by subtracting the covariance, thereby eliminating dimensional numerical influence (Zhao, 2015; Qiao, 2016).
2) CC is used to analyze the characteristic signals within long-term signal waveforms. The calculation of CC aids in the extraction of similar components (Li et al., 2017a).
3) A line chart is created based on the calculation of CC. This approach eliminates the need for setting artificial thresholds, thereby preventing the issue of setting different thresholds for different signals.
However, owing to the non-linearity, non-stationarity, and non-Gaussian properties of the underwater acoustic signal, the direct elimination of noise signals may result in the loss of some useful signals. Therefore, further classifying noise components into pure noise and noise-dominant components is vital.
2.4 PE
PE is a spatial complexity measurement used for analyzing one-dimensional time series and is characterized by its simple calculation and good noise resistance (Bandt and Pompe, 2002). The summarized process of PE is as follows (Zanin et al., 2018):
1) The time series X = {x1, x2, ⋯, xN} is reconstructed as follows:
$$ \left\{\begin{array}{l} \{x(1), x(1+\tau), \cdots, x(1+(m-1) \tau)\} \\ \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\vdots \\ \{x(j), x(j+\tau), \cdots, x(j+(m-1) \tau)\} \\ \;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\vdots \\ \{x(K), x(K+\tau), \cdots, x(K+(m-1) \tau)\}(K=n-(m-1) \tau) \end{array}\right. $$ (16) where τ and m denote the time lag and embedding dimension.
2) Each row vector is rearranged in an ascending order:
$$ x\left(i+\left(j_1-1\right) \tau\right) \leqslant x\left(i+\left(j_2-1\right) \tau\right) \leqslant \cdots \leqslant x\left(i+\left(j_m-1\right) \tau\right) $$ (17) 3) A symbol sequence for each row vector is obtained as:
$$ S(g)=\left(j_1, j_2, \cdots, j_m\right)(g=1, 2, \cdots, l \text { and } l \leqslant m !) $$ (18) 4) PE is defined as:
$$ H_p(m)=-\sum\limits_{j=1}^l P_j \ln P_j $$ (19) where Pj is the probability of one symbol sequence.
5) Normalized PE is defined as:
$$ H_p=H_p(m) / \ln (m !) $$ (20) In Equation (20), the $H_p$ value reflects the randomness of the time series. A smaller $H_p$ value indicates a more regular time series, while a larger $H_p$ value signifies a more random time series. In this study, we set $m=3$ and $\tau=1$ based on the recommendation from a previous study (Bandt and Pompe, 2002). Zhang et al. (2022) indicated that at $H_p \geqslant 0.75$, the corresponding mode is considered a noise signal, and at $H_p<0.75$, the corresponding mode is considered an effective signal. Therefore, in this paper, we use PE to further identify the noise components obtained from the CC diagram as noise-dominant components and pure noise components.
2.5 WTD
Compared with traditional filtering methods, wavelet transform exhibits multi-resolution characteristics. A one-dimensional noisy time series can be expressed as follows (Wang et al., 2018):
$$ s(t)=y(t)+n(t) $$ (21) where y(t) denotes the original signal, n(t) is the noise signal, and s(t) is the noisy signal.
Assuming that n(t) is Gaussian white noise, y(t) is usually represented as a low-frequency signal in practical engineering applications. Therefore, the following methods are used to reduce noise. The specific steps are as follows:
1) A suitable wavelet basis function and decomposition level are selected to perform wavelet decomposition on the noisy signal.
2) Thresholding is applied to high-frequency coefficients through the selection of an appropriate threshold method at different decomposition scales.
3) The signal is reconstructed using low-frequency coefficients from wavelet decomposition and threshold high-frequency coefficients at different scales.
Wavelet thresholding with different thresholds usually involves three methods, namely denoising with a default threshold, denoising with a specified threshold, and denoising with a hard threshold. Among these methods, denoising with a specified threshold can be further classified into soft thresholding and hard thresholding. In this paper, a soft thresholding method is selected to eliminate the threshold.
3 Proposed noise reduction algorithms
3.1 Denoising algorithms for underwater acoustic signals
When using the EMD algorithm, EEMD algorithm, and the CC method for denoising signals, the direct removal of the noise IMF components may also eliminate hydroacoustic signal components, thereby affecting noise reduction. This study introduces PE and WTD based on the EEMD method to develop the proposed underwater acoustic signal denoising algorithm (Figure 2). The specific procedures are summarized as follows:
1) The underwater signal is decomposed via EEMD to produce numerous IMFs, including noise IMFs and real IMFs;
2) CCs between each IMF component and the original signal are calculated;
3) Noise and signal IMFs are classified based on the inflection points of the CC graph. The maximum curvature of the CC curve serves as the critical point distinguishing between noise and signal IMFs.
4) Signal IMFs are screened out, and the PEs of other IMFs are calculated.
5) Noise-dominant IMFs are identified based on their PEs. If the PE of IMF is > 0.75, it is considered a pure noise IMF; if the PE of IMF is < 0.75, it is classified as a noise-dominant IMF.
6) The noise-dominant IMFs are denoised via WTD. Wavelet soft-threshold denoising (WSTD) is used for enhancing the quality of noise-dominant IMFs, with the wavelet basis function and decomposition level set to db4 and 4, respectively.
7) The denoised signal is obtained through the reconstruction of the denoised noise-dominant IMFs and real IMFs.
3.2 Evaluation method for denoising effect
To effectively evaluate the denoising effect, numerous scholars have proposed some evaluation methods, including signal-to-noise ratio (SNR), root mean square error (RMSE), CC, Lyapunov exponent, and noise intensity. In this study, we selected the first three evaluation criteria for the assessment of the denoising effect (Wang et al., 2013; Rosenstein et al., 1993).
3.2.1 SNR
SNR represents an energy relationship between signal and noise. A higher SNR indicates that the signal contains a larger amount of useful information and a lesser noise. Therefore, the SNR serves as an intuitive method for evaluating the quality of the denoised signal by analyzing whether the SNR has improved. SNR is defined as follows:
$$ \mathrm{SNR}=10 \times \lg \left(\frac{\|x\|^2}{\|\hat{x}-x\|^2}\right) $$ (22) where $x, \hat{x}$, and $\left\|^*\right\|$ indicate the noise signal, denoised signal, and norm, respectively.
3.2.2 RMSE
RMSE quantifies the numerical difference between the denoised signal and the original signal. A smaller RMSE indicates a more effective noise reduction. RMSE is defined as follows:
$$ \text { RMSE }=\sqrt{\frac{\|\hat{x}-x\|^2}{\operatorname{length}(x)}} $$ (23) where length(*) represents the length of signals.
3.2.3 CC
CC represents the correlation degree of the two signals. A larger coefficient indicates a stronger correlation degree between two signals. The specific calculation formula is introduced in section 2.3.
4 Denoising of simulation signal
According to the non-linear and non-stationary properties of underwater acoustic signals, four types of simulation signals are selected for denoising, namely Blocks, Bumps, Doppler, and Heavysine. The sampling frequency and data length are set to 1 kHz and 1 024, respectively. Noisy signals with different SNRs are obtained through the addition of Gaussian white noise. Figure 3 shows the original signal and noisy signal with an SNR of 0 dB.
Taking the Heavysine signal as an example, Figure 3 shows the complete submersion of the Heavysine signal in noise. The following is the complete denoising process.
4.1 Signal decomposition
The noisy Heavysine signal is decomposed via EMD and EEMD methods (Figure 4). In both decomposition methods, the IMF1 represents the shortest oscillation period, typically corresponding to a noise component or high-frequency components.
4.2 Identifying noise and signal IMFs
The CC between each IMF component obtained through EMD and EEMD methods and the original signal is calculated (Figure 5).
Figure 5 reveals that the maximum curvature of CC curves occurs at the fourth-order position in EEMD and the sixth-order position in EMD. The first four IMFs can be classified as noise IMFs in EEMD, and the first six IMFs can be identified as noise IMFs in EMD.
4.3 Identifying noise-dominant IMFs
Noise-dominant IMFs can be identified based on the PEs of noise IMFs in EEMD. For the noisy Heavysine signal with an SNR of 0 dB, the PEs of noise IMFs are shown in Table 1. The PE of IMF4 is less than 0.75, indicating that IMF4 is the noise-dominant IMF in EEMD.
IMF1 IMF2 IMF3 IMF4 0.997 7 0.884 2 0.757 7 0.578 0 4.4 Denoising and reconstruction of noise-dominant IMFs
WSTD is applied to IMF4 using the wavelet basis function of db4 and a decomposition level of 4. The denoised Heavysine signal is obtained through the reconstruction of the denoised IMF4 and real IMFs. Figure 6 displays the denoising results. The denoising method that involves the use of wavelet soft-threshold method to directly eliminate signals is referred to as WSTD. Denoising methods that combine CC with EMD and EEMD are denoted as EMD–CC and EEMD–CC. The proposed denoising method is denoted as EEMD–CC–PE–WSTD.
Figure 6 indicates that the denoised signal obtained through EEMD–CC–PE–WSTD can effectively fit the source signal. Sharp spikes are effectively eliminated, leading to a smoother and cleaner signal with minimal distortion. Compared with the signal obtained through the other three methods, the denoised signal obtained through the proposed method retains more features of the original signal. Moreover, the baseline of the denoised signal closely corresponds to that of the source signal, indicating effective baseline correction.
To further verify the effectiveness of the proposed method, four types of signals with different SNRin are denoised through EMD–CC, EEMD–CC, WSTD, and EEMD–CC–PE–WSTD methods. Figures 7 and 8 show the denoising results.
On average, EEMD–CC–PE–WSTD achieves the highest denoising effect. Particularly, at a low SNRin, EEMD–CC–PE–WSTD features a significantly improved denoising performance. Additionally, EEMD–CC–PE–WSTD exhibits a higher SNR and lower RMSE than the other three denoising methods.
5 Denoising of measured ship signals
EMD–CC, EEMD–CC, WSTD, and EEMD–CC–PE–WSTD algorithms are used to denoise the ship sound signal from an open-source database to further verify the effectiveness of the proposed underwater acoustic denoising algorithm. Subsequently, SNR, RMSE, and CC are used to evaluate the denoising results.
5.1 Ship signal source
The ocean acoustic recordings used to train the classifier were obtained from various open-source databases. Ship noise data were collected from the ShipsEar (Santos-Domínguez et al., 2016). Most of these recordings were collected in or near the Port of Vigo using a hydrophone with a sampling rate of 52 734 Hz. Recordings from different types of vessels in or near the docks included fishing boats, ocean liners, ferries of various sizes, containers, tugs, pilot boats, yachts, and small sailboats. In this study, the audio recordings of passenger ships were selected for analysis to assess the performance of the denoising algorithm.
5.2 Signal preprocessing
To facilitate the subsequent denoising processing, we use a 0.6-s frame to crop the filtered target audio signal. To ensure the integrity of the signal during trimming, the overlapping duration between two adjacent frames of signals is set to 0.05 s. The signal-trimming process is shown in Figure 9.
The time-domain diagram of the processed ship signal is shown in Figure 10.
The time-domain diagram of the collected ship signal contains numerous peaks and a significant amount of noise, indicating the need for denoising (Figure 10).
5.3 Selection of EEMD parameters for the ship signal
Notably, the A of the added white noise and the number of trials M are the two key parameters in EEMD. To select suitable parameters, the SNR of the ship signal after EEMD processing is calculated by varying A and M (Table 2).
M (s) A (mV) 0.1 0.2 0.3 0.4 0.5 50 13.574 3 13.774 9 13.882 6 13.855 6 14.170 4 100 13.627 4 13.776 6 14.079 5 14.021 7 14.073 0 150 13.462 3 13.660 0 13.870 3 13.887 1 13.981 2 Under a constant number of trials, a noise amplitude of 0.3 mV results in a maximum SNR value, while SNR decreases with increasing noise amplitude (Table 2). This is due to the effect of the added white noise on the signal, which increases overall noise and affects the decomposition results. At a constant noise amplitude of 0.3 mV, the maximum SNR value is achieved when M = 100. As the number of trials increases, the corresponding SNR value gradually stabilizes, indicating that excessively increasing the number of trials displays minimal effects on the decomposition results.
In summary, the experiment indicates that the optimal values are A = 0.3 mV and M = 100.
5.4 Ship signal denoising method
5.4.1 EMD–CC method
The processing effect of the decomposition of the ship signals obtained via EMD is shown in Figure 11. The ship signal decomposes into eight natural modal components through EMD (Figure 11). The first three components exhibit significant noise interference and mode mixing in the spectrogram. Subsequently, the signal is reconstructed, and the CC between each component and the original ship signal is calculated (Figure 12).
Figure 12 shows that the maximum curvature of CC occurs at the third-order position. The first three eigenmode components are considered noise signals and will be discarded, while the remaining components are classified as effective signals for reconstruction.
5.4.2 EEMD–CC method
The ship signal is decomposed using the two EEMD parameters determined in Figure 13.
EEMD decomposes the ship signal into six IMF components and sequentially separates the original ship signal from high-to-low frequency. Each component, starting from IMF2, exhibits a single signal scale, indicating the effective suppression of the modal mixing phenomenon. Additionally, the CC between each component and the original ship signal is calculated, and the results are shown in Figure 14.
Figure 14 reveals the point of maximum curvature in the CC curve corresponds to the third-order position. The first three IMF components are considered noise signals and will be discarded, and the remaining components are classified as valid signals for the reconstruction process.
5.4.3 EEMD–CC–PE–WSTD method
From section 5.4.2, the first three IMF components are identified as noise components, and the PE value of each component is calculated, as shown in Table 3.
IMF1 IMF2 IMF3 0.765 8 0.533 1 0.994 9 The results revealed that the PE of IMF2 is less than 0.75. Thus, IMF2 can be considered the noise-dominant component and is denoised via WSTD. Additionally, the second IMF component is reconstructed along with the last three IMF components.
5.5 Comparison of different denoising results
Through the adjustment of the time axis, the time-domain diagram of the original signal is shown in Figure 15.
The time-domain diagrams of the signal obtained from different denoising methods are shown in Figure 16.
All four methods effectively reduce peaks and glitches in the signal via denoising (Figure 16). However, the signal denoised via the proposed method is smoother and retains more valuable information compared with other methods. Additionally, the proposed method significantly reduces the impurities and noise in the signal, making it more favorable for human hearing.
Furthermore, the short-time Fourier transform diagram of the original signal is shown in Figure 17.
The short-time Fourier transform diagrams of the signal obtained via different denoising methods are shown in Figure 18.
For ship noise, the signal frequency range is mainly concentrated below 1000 Hz (Zhang et al., 2017). Figures 17 and 18 show the wide frequency distribution of the original signal, particularly with various frequency components present above 1 000 Hz. However, the frequency of the denoised signal is highly concentrated, mainly below 5 000 Hz, and the background noise is significantly reduced. Furthermore, the algorithm proposed in this paper exhibits a more significant denoising effect on the signal, with the signal mainly concentrated below 1 000 Hz.
5.6 Denoising effect evaluation
The following three parameters are used to evaluate the denoising effect of actual underwater acoustic signals (Table 4).
Parameter original signal WSTD EMD–CC EEMD–CC EEMD–CC–PE–WSTD SNR (dB) 8.956 4 13.053 4 12.351 9 18.079 5 22.030 0 RMSE - 0.068 5 0.057 1 0.023 6 0.017 4 CC 1 0.912 5 0.903 3 0.980 4 0.990 1 Table 4 indicates that the SNR of the reconstructed signal is improved using WSTD, EMD, and EEMD algorithms. Moreover, the SNR of the signal reconstructed through the improved EEMD method is 14 dB higher than that of the original signal. The RMSE and CC results revealed that signals processed through the improved EEMD exhibit lower RMSE and higher CC than the original signal. Therefore, the EEMD–CC–PE–WSTD algorithm exhibits an improved denoising effect and is effective and suitable for denoising underwater acoustic signals.
6 Conclusions
To reduce the loss of effective signals during the denoising process of underwater acoustic signals, a noise reduction method based on EEMD, which incorporates CC, PE, and WSTD is proposed. The key innovations and conclusions of the proposed denoising method are as follows:
1) The adaptive decomposition algorithm, EEMD, is used to denoise underwater acoustic data signal. This process decomposes original data into a set of IMF components with different scales.
2) Compared with existing denoising methods, CC and PE classify IMFs into three parts (pure noise IMFs, noise-dominant IMFs, and real IMFs).
3) The curvature value of the CC curve is used to distinguish between the noise and real components of the underwater acoustic signal based on the characteristics of the signal without the need for an arbitrary threshold.
4) The proposed EEMD–CC–PE–WSTD method does not require human intervention. The calculation process is adaptive.
5) Four types of signals (Blocks, Bumps, Doppler, and Heavysine) with different SNRs are denoised through EMD–CC, EEMD–CC, EEMD–CC–PE–WSTD, and WSTD methods. The proposed denoising method exhibits higher performance, with lower RMSE and higher SNR.
6) Compared with other denoising methods, the EEMD–CC–PE–WSTD method is an effective method for denoising underwater acoustic signals. This study provides a solid foundation for the subsequent processing of ship-radiated noise signals, including prediction, detection, extraction, and classification.
Competing interest The authors have no competing interests to declare that are relevant to the content of this article. -
Table 1 PEs of noise IMFs for simulation signal
IMF1 IMF2 IMF3 IMF4 0.997 7 0.884 2 0.757 7 0.578 0 Table 2 SNR of signals processed via EEMD under different A and M
M (s) A (mV) 0.1 0.2 0.3 0.4 0.5 50 13.574 3 13.774 9 13.882 6 13.855 6 14.170 4 100 13.627 4 13.776 6 14.079 5 14.021 7 14.073 0 150 13.462 3 13.660 0 13.870 3 13.887 1 13.981 2 Table 3 PEs of noise IMFs for measured ship signals
IMF1 IMF2 IMF3 0.765 8 0.533 1 0.994 9 Table 4 Denoising results of the measured ship signal
Parameter original signal WSTD EMD–CC EEMD–CC EEMD–CC–PE–WSTD SNR (dB) 8.956 4 13.053 4 12.351 9 18.079 5 22.030 0 RMSE - 0.068 5 0.057 1 0.023 6 0.017 4 CC 1 0.912 5 0.903 3 0.980 4 0.990 1 -
Bandt C, Pompe B (2002) Permutation entropy: A natural complexity measure for time series. Phys. Rev. Lett. 88(17): 1–4. https://doi.org/10.1103/PhysRevLett.88.174102 Chen R, Tang B, Lu Z (2012) Ensemble empirical mode decomposition denoising method based on correlation coefficients for vibration signal of rotor system. J. Vib. Measur. Diagnosis. 32(4): 542–546 Cheng X, Mao J, Li J, Zhao H, Zhou C, Gong X, Rao Z (2021) An EEMD-SVD-LWT algorithm for denoising a lidar signal. Measurement 168(3): 108405. DOI: 10.1016/j.measurement.2020.108405 Cui B, Chen X (2015) Improved hybrid filter for fiber optic gyroscope signal denoising based on EMD and forward linear prediction. Sens. Actuator A Phys. 230: 150–155. https://doi.org/10.1016/j.sna.2015.04.021 Du S, Liu T, Huang D, Li GL (2016) An optimal ensemble empirical mode decomposition method for vibration signal decomposition. Int. J. Acoust. Vib. 139: 1–18. https://doi.org/10.1115/L4035480 Gong Y, Tong Li Yu Z, Zhang X (2022) Research on fault diagnosis method of rotating machinery misalignment based on Pearson correlation coefficient. New Technology & New Products of China (5): 48–50. DOI: 10.13612/j.cnki.cntp.2022.05.013 Huang N, Shen Z, Long SR, Wu MC, Shih HH, Zheng Q, Yen NC, Tung CC, Liu HH (1998) The empirical mode decomposition method and the Hilbert spectrum for non-stationary time series analysis. Proc. R. Soc. A Lond. 454: 903–995 https://doi.org/10.1098/rspa.1998.0193 Islam MS, Chong U (2014) Noise reduction of continuous wave radar and pulse radar using matched filter and wavelets. J Image Video Proc 2014: 43. https://doi.org/10.1186/1687-5281-2014-43 Jia Y, Li G, Dong X, He K (2021) A novel denoising method for vibration signal of hob spindle based on EEMD and grey theory. Measurement 169: 108490. DOI: 10.1016/j.measurement.2020.108490 Li Q, Qin B, Si W, Wang R (2017a) Estimation algorithm for adaptive threshold of hybrid particle swarm optimization wavelet and its application in partial discharge signals denoising. High Voltage Engineering 43(5): 1485–1492. https://doi.org/10.13336/j.1003-6520.hve.20170428013 Li YX, Li YA, Chen X, Yu J (2017b) Denoising and feature extraction algorithms using NPE combined with VMD and their applications in ship-radiated noise. Symmetry 9(11): 256. https://doi.org/10.3390/sym9110256 Li YX, Li YA, Chen X, Yu J (2018a) Research on ship-radiated noise denoising using secondary variational mode decomposition and correlation coefficient. Sensors 18(1): 48. https://doi.org/10.3390/s18010048 Li YX, Li YA, Chen X, Yu J, Yang H, Wang L (2018b) A new underwater acoustic signal denoising technique based on CEEMDAN, mutual information, permutation entropy, and wavelet threshold Denoising. Entropy 20(8): 563 https://doi.org/10.3390/e20080563 Lu W, Zhang L, Liang W, Yu X (2016) Research on a small-noise reduction method based on EMD and its application pipeline leakage detection. Loss Prev. Process Ind. 41: 282–293. https://doi.org/10.1016/j.jlp.2016.02.017 Ogundile OO, Usman AM, Versfeld J (2020) An empirical mode decomposition based hidden Markov model approach for detection of Bryde's whale pulse calls. J. Acoust. Soc. Am. 147(2): EL125–EL131. https://doi.org/10.1121/10.0000717 Peng K, Guo H, Shang X (2021) EEMD and multiscale PCA-based signal denoising method and its application to seismic P-phase arrival picking. Sensors 21(16): 5271. https://doi.org/10.3390/s21165271 Qiao J (2016) The estimation of technical efficiency based on double-lag stochastic Frontier model. Statistics & Information Forum 31(11): 44–48. DOI: 10.3969/j.issn.1007-3116.2016.11.008 Rosenstein MT, Collins JJ, Luca CJD (1993) A practical method for calculating largest Lyapunov exponents from small data sets. Physica D 65: 117–134. https://doi.org/10.1016/0167-2789(93)90009-P Santos-Domínguez D, Torres-Guijarro S, Cardenal-López A, Pena-Gimenez A (2016) ShipsEar: An underwater vessel noise database. Appl Acoust. 113: 64–69. DOI: 10.1016/j.apacoust.2016.06.008 Shang Z, Liu X, Liao X, Geng R, Gao M, Yun J (2019) Rolling bearing fault diagnosis method based on EEMD and GBDBN. Int. J. Performability Eng. 15(1): 230–240. DOI: 10.23940/ijpe.19.01.p23.230240 Shaw J, Wang YH, Lin CY (2016) Sound and vibration analysis of a marine diesel engine via reverse engineering. J Mar Sci Technol. 26(5): 1–9. https://doi.org/10.6119/JMST.201810_26(5).0009 Singh G, Kaur G, Kumar V (2014) ECG denoising using adaptive selection of IMFs through EMD and EEMD. International Conference on Data Science & Engineering, 228–231. DOI: 10.1109/ICDSE.2014.6974643 Sun Z, Xi X, Yuan C, Yang Y, Hua X (2020) Surface electromyography signal denoising via EEMD and improved wavelet thresholds. Math Biosci Eng. 17(6): 6945–6962. DOI: 10.3934/mbe.2020359 Tucker JD, Azimi-Sadjadi MR (2011) Coherence-based underwater target detection from multiple disparatesonar platforms. IEEE J. Ocean Eng. 36(1): 37–51. DOI: 10.1109/JOE.2010.2094230 Wang LB, Zhang XD, Wang XL (2013) Chaotic signal denoising method based on independent component analysis and empirical mode decomposition. Acta Phys. Sin. 62(5): 050201. https://doi.org/10.7498/aps.62.050201 Wang X, Xu J, Zhao Y (2018) Wavelet based denoising for the estimation of the state of charge for lithium-ion batteries. Energies 11(5): 1144. https://doi.org/10.3390/en11051144 Wess LG, Dixon TL (1997) Wavelet-based denoising of underwater acoustic signals. J. Acoust. Soc. Am. 101(1): 377–383. https://doi.org/10.1121/1.417983 Wu H, Huang NE (2011) Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv. Adapt Data Analysis 1: 1–41. https://doi.org/10.1142/S1793536909000047 Wu Z, Huang NE (2010) On the filtering properties of the empirical mode decomposition. Adv. Adapt Data Analysis 2(4): 397–414. https://doi.org/10.1142/S1793536910000604 Xiong Q, Xu Y, Peng Y, Zhang W, Li Y, Tang L (2017) Low-speed rolling bearing fault diagnosis based on EMD denoising and parameter estimate with alpha stable distribution. J. Mech. Sci. Technol. 31(4): 1587–1601. https://doi.org/10.1007/s12206-017-0306-y Xue W, Dai X, Zhu J, Luo Y (2019) A noise suppression method of ground penetrating radar based on EEMD and permutation entropy. IEEE Geosci. Remote. Sens. Lett. 16(10): 1625–1639. DOI: 10.1109/LGRS.2019.2902123 Yang G, Liu Y, Wang Y, Zhu Z (2015) EMD interval thresholding denoising based on similarity measure to select relevant modes. Signal Process 109: 95–109. https://doi.org/10.1016/j.sigpro.2014.10.038 Yang H, Ning T, Zhang B, Yin X, Gao Z (2017) An adaptive denoising fault feature extraction method based on ensemble empirical mode decomposition and the correlation coefficient. Advances in Mechanical Engineering 9(4): 1–9. DOI: 10.1177/1687814017696448 Yue GD, Cui XS, Zou YY, Bai XT, Wu YH, Shi HT (2019) A Bayesian wavelet packet denoising criterion for mechanical signal with non-Gaussian characteristic. Measurement 138: 702–712. https://doi.org/10.1016/j.measurement.2019.02.066 Zanin M, Gómez-Andrés D, Pulido-Valdeolivas I, Martín-Gonzalo JA, López-López J, Pascual-Pascual SI, Rausell E (2018) Characterizing normal and pathological gait through permutation entropy. Entropy 20(1): 77. https://doi.org/10.3390/e20010077 Zhang W, Zheng X, Yang R, Han J (2017) Research on identification technology of ship radiated noise and marine biological noise. IEEE International Conference on Signal Processing, Communications and Computing 3: 3142–5386. DOI: 10.1109/ICSPCC.2017.8242384 Zhang XL, Cao LY, Chen Y, Jia RS, Lu XM (2022) Microseismic signal denoising by combining variational mode decomposition with permutation entropy. Applied Geophysics 19(1): 65–80. https://doi.org/10.1007/s11770-022-0926-6 Zhao XJ (2015) Research on the correlation and complexity of time series. PhD thesis, Beijing Jiaotong University, Beijing. DOI: 10.7666/d.Y2917148 Zhao Z, Liu J, Wang S (2011) Denoising ECG signal based on ensemble empirical mode decomposition. International Symposium on Signal Processing and Information Technology 23: 170–177. DOI: 10.1109/ISSPIT.2017.8388665