• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Xu, Saijuan (Xu, Saijuan.) [1] | Guo, Canyang (Guo, Canyang.) [2] | Zhu, Yuhan (Zhu, Yuhan.) [3] | Liu, Genggeng (Liu, Genggeng.) [4] (Scholars:刘耿耿) | Xiong, Neal (Xiong, Neal.) [5]

Indexed by:

EI Scopus SCIE

Abstract:

Collecting and analyzing data from all devices to improve the efficiency of business processes is an important task of Industrial Internet of Things (IIoT). In the age of data explosion, extensive text data generated by the IIoT have given birth to a variety of text representation methods. The task of text representation is to convert the natural language to a form that computer can understand with retaining the original semantics. However, these methods are difficult to effectively extract the semantic features among words and distinguish polysemy in natural language. Combining the advantages of convolutional neural network (CNN) and variational autoencoder (VAE), this paper proposes an intelligent CNN-VAE text representation algorithm as an advanced learning method for social big data within next-generation IIoT, which help users identify the information collected by sensors and perform further processing. This method employs the convolution layer to capture the local features of the context and uses the variational technique to reconstruct feature space to make it conform to the normal distribution. In addition, the improved word2vec model based on topical word embedding (TWE) is utilized to add topical information to word vectors to distinguish polysemy. This paper takes the social big data as an example to illustrate the way of the proposed algorithm applied in the next-generation IIoT and utilizes Cnews dataset to verify the performance of proposed method with four evaluating metrics (i.e., recall, accuracy, precision, and F1-score). Experimental results indicate that the proposed method outperforms word2vec-avg and CNN-AE in K-nearest neighbor (KNN), random forest (RF), and support vector machine (SVM) classifiers and distinguishes polysemy effectively.

Keyword:

Convolutional neural network Feature extraction Text representation Topical word embedding Variational autoencoder

Community:

  • [ 1 ] [Xu, Saijuan]Fujian Business Univ, Coll Informat Engn, Lianpan Rd 2, Fuzhou 350506, Fujian, Peoples R China
  • [ 2 ] [Guo, Canyang]Fuzhou Univ, Coll Comp & Data Sci, Xueyuan Rd 2, Fuzhou 350116, Fujian, Peoples R China
  • [ 3 ] [Zhu, Yuhan]Fuzhou Univ, Coll Comp & Data Sci, Xueyuan Rd 2, Fuzhou 350116, Fujian, Peoples R China
  • [ 4 ] [Liu, Genggeng]Fuzhou Univ, Coll Comp & Data Sci, Xueyuan Rd 2, Fuzhou 350116, Fujian, Peoples R China
  • [ 5 ] [Xiong, Neal]Sul Ross State Univ, Dept Comp Math & Phys Sci, 1404 East Highway 90, Alpine, TX 79830 USA

Reprint 's Address:

  • [Liu, Genggeng]Fuzhou Univ, Coll Comp & Data Sci, Xueyuan Rd 2, Fuzhou 350116, Fujian, Peoples R China;;

Show more details

Version:

Related Keywords:

Source :

JOURNAL OF SUPERCOMPUTING

ISSN: 0920-8542

Year: 2023

Issue: 11

Volume: 79

Page: 12266-12291

2 . 5

JCR@2023

2 . 5 0 0

JCR@2023

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:32

JCR Journal Grade:2

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Affiliated Colleges:

Online/Total:156/10046409
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1