• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Zou, JQ (Zou, JQ.) [1] | Chen, GL (Chen, GL.) [2] (Scholars:陈国龙) | Guo, WZ (Guo, WZ.) [3] (Scholars:郭文忠)

Indexed by:

CPCI-S

Abstract:

Real-world applications often require the classification of web documents under the situation of noisy data. Support vector machines (SVM) work well for classification applications because of their high generalization ability. But they are very sensitive to noisy training data, which can degrade their classification accuracy. This paper presents a new algorithm to deal with noisy training data, which combines support vector machines and K-nearest neighbor (KNN) method. Given a training set, it employs K-nearest neighbor method to remove noisy training examples. Then the remained examples are selected to train SVM classifiers for web categorization. Empirical results show that this new algorithm has strong tolerance of noise, and it can greatly reduce the influence of noisy data on the SVM classifier.

Keyword:

K-nearest neighbor(KNN) noise-tolerant support vector machines(SVM) web classification

Community:

  • [ 1 ] Fuzhou Univ, Inst Math & Comp Sci, Fuzhou 35002, Peoples R China

Reprint 's Address:

  • 邹加棋

    [Zou, JQ]Fuzhou Univ, Inst Math & Comp Sci, Fuzhou 35002, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05)

Year: 2005

Page: 785-790

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:218/11249059
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1