• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Liu, Jiantao (Liu, Jiantao.) [1] | Yang, Xiaoxiang (Yang, Xiaoxiang.) [2] | Zhu, Mingzhu (Zhu, Mingzhu.) [3] | He, Bingwei (He, Bingwei.) [4] (Scholars:何炳蔚)

Indexed by:

CPCI-S EI Scopus

Abstract:

Speech enhancement is a critical part of variety types of communication systems and automatic speech recognition (ASR) applications. In this study we propose a speech enhancement method for real time VoIP applications with stacked frames and deep neural network, a novel data preparation approach is also introduced. In contrast to many states of art learning-based method, we focused on real-time implement in VoIP applications. Experiments were conducted on speech degraded by different noise types and SNR levels which were not seen in the training stage of the deep neural network and achieved a significant improvement on PESQ. Important traditional real-time speech enhancement method and most recent states of art learning-based method were also tested and compared with proposed method. The results show that proposed method effectively improve the speech intelligibility, greatly outperform traditional real-time minimum-mean square error (MMSE) algorithm and real-time learning-based CNN method in PESQ. We also achieve comparable PESQ in comparison with most recent state of the art learning-based method, but outperform it in time complexity. Making this method attractive in VoIP communication system applications which is high demand on communication latency.

Keyword:

artificial neural network deep learning deep neural network machine learning speech enhancement

Community:

  • [ 1 ] [Liu, Jiantao]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
  • [ 2 ] [Yang, Xiaoxiang]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
  • [ 3 ] [Zhu, Mingzhu]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
  • [ 4 ] [He, Bingwei]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China
  • [ 5 ] [Yang, Xiaoxiang]Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China

Reprint 's Address:

  • 杨晓翔

    [Yang, Xiaoxiang]Fuzhou Univ, Sch Mech Engn & Automat, Fuzhou 350108, Fujian, Peoples R China;;[Yang, Xiaoxiang]Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China

Email:

Show more details

Related Keywords:

Source :

17TH INTERNATIONAL CONFERENCE ON OPTICAL COMMUNICATIONS AND NETWORKS (ICOCN2018)

ISSN: 0277-786X

Year: 2019

Volume: 11048

Language: English

Cited Count:

WoS CC Cited Count: 1

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Online/Total:107/10043790
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1