ACTIVE LEARNING OF PAIR-WISE CONSTRAINTS IN SEMI-SUPERVISED CLUSTERING
JIANG Weijin1, XU Yuhui2, WANG Xin2
Author information+
1. School of Computer and information engineering, Hunan University of Commerce, Changsha 410205, School of computer science and technology, Wuhan university of technology, Wuhan 430070; 2.College of Information System and Management, National University of Defense Technology, Changsha 410073
Semi-suppervised learning is an important research direction in the field of machine learning in recent years. The performance of semi-supervised lustering depends greatly on the quality of supervision information, so it is necessary to actively learn high quality supervision information. An active learning algorithm based on pair-wise constraints with error correction is proposed in this paper. The algorithm searches the pair-wise constraints information which clustering algorithm cannot find, and leads them into the spectral clustering algorithm. Utilizing suppervised information adjusts the distance matrix between points
in the spectral clustering, and sorts the distances by the two-way search method. The algorithm makes the learninger can study actively even the learinger receives the data without flags, and gets better clustering result with less constraints. Meanwhile, the algorithm reduces the computational complexity of the semi-supervised algorithms based on constraints and resolves the singular problem of the pair-wise constraints in the clustering process. Experimental
results on UCI benchmark data sets and artificial data set show that clearly the performance of the algorithm is better than that of other compared algorithms, and than that of the spectral clustering which randomly selects the supervision information.
JIANG Weijin, XU Yuhui, WANG Xin.
ACTIVE LEARNING OF PAIR-WISE CONSTRAINTS IN SEMI-SUPERVISED CLUSTERING. Journal of Systems Science and Mathematical Sciences, 2013, 33(6): 708-723 https://doi.org/10.12341/jssms12124