• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Guo, Y. (Guo, Y..) [1] | Zhang, J. (Zhang, J..) [2] | Chen, X. (Chen, X..) [3]

Indexed by:

Scopus

Abstract:

Web pages contain a large amount of valuable information and resources, meanwhile may update at any time. However, the current Web-data extraction algorithms are generally targeted at specific web page structure. When web pages update, the problem which is caused by the changes of web pages may be encountered, leading to the inability to extract web page information or wrong information. In order to solve this problem, this paper proposes a new method to extract the feature values of each area in the web page through page rendering, and then combine the DOM tree structure of the page, semantic similarity and other information, so that it can still extract the target data correctly after the structure of the web page changes. © 2019 IEEE.

Keyword:

Adaptively; Similarity calculation; Web-data extraction

Community:

  • [ 1 ] [Guo, Y.]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China
  • [ 2 ] [Guo, Y.]Fujian Key Laboratory of Network Computing, Intelligent Information Processing, Fuzhou, China
  • [ 3 ] [Zhang, J.]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China
  • [ 4 ] [Zhang, J.]Fujian Key Laboratory of Network Computing, Intelligent Information Processing, Fuzhou, China
  • [ 5 ] [Chen, X.]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China
  • [ 6 ] [Chen, X.]Fujian Key Laboratory of Network Computing, Intelligent Information Processing, Fuzhou, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

SocialCom 2019

Year: 2019

Page: 1524-1525

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:186/10348593
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1