• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Xu, Saijuan (Xu, Saijuan.) [1] | Guo, Canyang (Guo, Canyang.) [2] | Zhu, Yuhan (Zhu, Yuhan.) [3] | Liu, Genggeng (Liu, Genggeng.) [4] | Xiong, Neal (Xiong, Neal.) [5]

Indexed by:

EI

Abstract:

Collecting and analyzing data from all devices to improve the efficiency of business processes is an important task of Industrial Internet of Things (IIoT). In the age of data explosion, extensive text data generated by the IIoT have given birth to a variety of text representation methods. The task of text representation is to convert the natural language to a form that computer can understand with retaining the original semantics. However, these methods are difficult to effectively extract the semantic features among words and distinguish polysemy in natural language. Combining the advantages of convolutional neural network (CNN) and variational autoencoder (VAE), this paper proposes an intelligent CNN-VAE text representation algorithm as an advanced learning method for social big data within next-generation IIoT, which help users identify the information collected by sensors and perform further processing. This method employs the convolution layer to capture the local features of the context and uses the variational technique to reconstruct feature space to make it conform to the normal distribution. In addition, the improved word2vec model based on topical word embedding (TWE) is utilized to add topical information to word vectors to distinguish polysemy. This paper takes the social big data as an example to illustrate the way of the proposed algorithm applied in the next-generation IIoT and utilizes Cnews dataset to verify the performance of proposed method with four evaluating metrics (i.e., recall, accuracy, precision, and F1-score). Experimental results indicate that the proposed method outperforms word2vec-avg and CNN-AE in K-nearest neighbor (KNN), random forest (RF), and support vector machine (SVM) classifiers and distinguishes polysemy effectively. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

Keyword:

Big data Convolution Convolutional neural networks Embeddings Nearest neighbor search Normal distribution Semantics Support vector machines Variational techniques

Community:

  • [ 1 ] [Xu, Saijuan]College of Information Engineering, Fujian Business University, Lianpan Road No.2, Fujian, Fuzhou; 350506, China
  • [ 2 ] [Guo, Canyang]College of Computer and Data Science, Fuzhou University, Xueyuan Road No.2, Fujian, Fuzhou; 350116, China
  • [ 3 ] [Zhu, Yuhan]College of Computer and Data Science, Fuzhou University, Xueyuan Road No.2, Fujian, Fuzhou; 350116, China
  • [ 4 ] [Liu, Genggeng]College of Computer and Data Science, Fuzhou University, Xueyuan Road No.2, Fujian, Fuzhou; 350116, China
  • [ 5 ] [Xiong, Neal]Department of Computer, Mathematical and Physical Sciences, Sul Ross State University, 1404 East Highway 90, Alpine; TX; 79830, United States

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Journal of Supercomputing

ISSN: 0920-8542

Year: 2023

Issue: 11

Volume: 79

Page: 12266-12291

2 . 5

JCR@2023

2 . 5 0 0

JCR@2023

ESI HC Threshold:32

JCR Journal Grade:2

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Affiliated Colleges:

Online/Total:47/10057971
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1