Semi-supervised text categorization with only a few positive and unlabeled documents - Details

author：

Lu, Fang (Lu, Fang.) ^[1] | Bai, Qingyuan (Bai, Qingyuan.) ^[2]

Indexed by：

Abstract：

This　paper　studies　a　special　case　of　semi-supervised　text　categorization.　We　want　to　build　a　text　classifier　with　only　a　set　P　of　labeled　positive　documents　from　one　class　(called　positive　class)　and　a　set　U　of　a　large　number　of　unlabeled　documents　from　both　positive　class　and　other　diverse　classes　(called　negative　class).　This　kind　of　semi-supervised　text　classification　is　called　positive　and　unlabeled　learning　(PU-Learning).　Although　there　are　some　effective　methods　for　PU-Learning,　they　do　not　perform　very　well　when　the　labeled　positive　documents　are　very　few.　In　this　paper,　we　propose　a　refined　method　to　do　the　PU-Learning　with　the　known　technique　combining　Rocchio　and　K-means　algorithm.　Considering　the　set　P　may　be　very　small　(≤5%),　not　only　we　extract　more　reliable　negative　documents　from　U　but　also　enlarge　the　size　of　P　with　extracting　some　most　reliable　positive　documents　from　U.　Our　experimental　results　show　that　the　refined　method　can　perform　better　when　the　set　P　is　very　small.　©2010　IEEE.

Keyword：

Biomedical engineering Classification (of information) K-means clustering Machine learning Supervised learning Text processing

Community：

[ 1 ] [Lu, Fang]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China
[ 2 ] [Bai, Qingyuan]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Determining AR order for BCI based on motor imagery
2015，8th International Conference on BioMedical Engineering and Informatics, BMEI 2015
A real-time distributed computing mechanism for P300 speller BCI
2017，10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2017
Unsupervised deep feature representation using adversarial auto-encoder
2019，2019 IEEE International Conference on Industrial Cyber Physical Systems, ICPS 2019
Analysis on Functional Demand of Mobile-Health APP for Elders
2020，2nd IEEE Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability 2020, ECBIOS 2020
Functional Deployment of Drone Logistics
2020，2nd IEEE Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability 2020, ECBIOS 2020

Source ：

Year： 2010

Volume： 7

Page： 3075-3079

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 10

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to