Speech enhancement with stacked frames and deep neural network for VoIP applications - Details

author：

Liu, Jiantao (Liu, Jiantao.) ^[1] | Yang, Xiaoxiang (Yang, Xiaoxiang.) ^[2] | Zhu, Mingzhu (Zhu, Mingzhu.) ^[3] | He, Bingwei (He, Bingwei.) ^[4] (Scholars：何炳蔚)

Indexed by：

CPCI-S EI Scopus

Abstract：

Speech　enhancement　is　a　critical　part　of　variety　types　of　communication　systems　and　automatic　speech　recognition　(ASR)　applications.　In　this　study　we　propose　a　speech　enhancement　method　for　real　time　VoIP　applications　with　stacked　frames　and　deep　neural　network,　a　novel　data　preparation　approach　is　also　introduced.　In　contrast　to　many　states　of　art　learning-based　method,　we　focused　on　real-time　implement　in　VoIP　applications.　Experiments　were　conducted　on　speech　degraded　by　different　noise　types　and　SNR　levels　which　were　not　seen　in　the　training　stage　of　the　deep　neural　network　and　achieved　a　significant　improvement　on　PESQ.　Important　traditional　real-time　speech　enhancement　method　and　most　recent　states　of　art　learning-based　method　were　also　tested　and　compared　with　proposed　method.　The　results　show　that　proposed　method　effectively　improve　the　speech　intelligibility,　greatly　outperform　traditional　real-time　minimum-mean　square　error　(MMSE)　algorithm　and　real-time　learning-based　CNN　method　in　PESQ.　We　also　achieve　comparable　PESQ　in　comparison　with　most　recent　state　of　the　art　learning-based　method,　but　outperform　it　in　time　complexity.　Making　this　method　attractive　in　VoIP　communication　system　applications　which　is　high　demand　on　communication　latency.

Keyword：

artificial neural network deep learning deep neural network machine learning speech enhancement

Community：

[ 1 ] [Liu, Jiantao]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
[ 2 ] [Yang, Xiaoxiang]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
[ 3 ] [Zhu, Mingzhu]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
[ 4 ] [He, Bingwei]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
[ 5 ] [Yang, Xiaoxiang]Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China

Reprint 's Address：

杨晓翔
[Yang, Xiaoxiang]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China;;[Yang, Xiaoxiang]Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China

Email：

Show more details

Version：