Multi View Clustering with Fuzzy and View Weighting

Antim Yadav; Prof. Shadab Ali

doi:10.17577/IJERTV8IS080019

Volume 08, Issue 08 (August 2019)

Multi View Clustering with Fuzzy and View Weighting

DOI : 10.17577/IJERTV8IS080019

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 114
Total Downloads : 55
Authors : Antim Yadav , Prof. Shadab Ali
Paper ID : IJERTV8IS080019
Volume & Issue : Volume 08, Issue 08 (August 2019)
Published (First Online): 10-09-2019
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Multi View Clustering with Fuzzy and View Weighting

Antim Yadav, Prof. Shadab Ali

Institute of Engineering and Technology, Computer Science Rajasthan University Jaipur

Alwar -301001, Rajasthan, India

Abstract:- Internet and its related applications like social networking, online shopping, virtual classrooms etc are growing and access to more and more people around the world at a fast pace. A lot of information is collected through clustering from this data. Ordinarily clustering performed over a single view of data is not sufficient to derive proper information. Recently, practice of combining different views of data through clustering has increased. Many research works prove that such method, called multi-view clustering, produces better clustering results. As the dimensionality of data increases, inclusion of all dimensions in clustering becomes time consuming. Moreover, the semantics of data. inherently gives preference to some attributes of data objects over others. This leads to feature selection as an essential step before clustering is performed. When multiple views are involved, a ranking system among the views can also benefit by producing results oriented towards what analyst desires. Hence, this dissertation focuses on designing a multi-view clustering method which involves feature selection and view weighing.

scheme..

Key-words: Fuzzy, clustering, Anamoly, Hybridness

INTRODUCTION:

The data as present in todays world of increased Internet usage does not have single coherent view. The interactions among users are of many varieties and lead to many views of a single dataset. Different views of webpages are a classic example. Data analysts hold that if information from multiple views is combined through clustering, it gives better results. Thus, multiview clustering has become active research topic.
The need to learn multi-viewed data is emerging with each passing day. This chapter surveys the various approaches towards multi-view learning and the various field multi- view learning is applicable. Also, some of the noteworthy research works headed in the direction of multi-view learning are briefly discussed. One more topic under recent research is the hybrid nature of data as shown in Fig.of the webpage example discussed above.(While the term hybrid may even refer to the incompleteness or missing data, the scope of this dissertation is to handle the different types of data to be clustered together.
PROPOSED CLUSTERING ALGORITHM PROPOSED CLUSTERING ALGORITHM

A fuzzy clustering method for multi-view datasets is proposed here. It involves weight learning for views and features both. The proposed algorithm is based on a hard-clustering method WMCFS of Xu et al [11].
An iterative algorithm is proposed here to perform clustering over multiple views of a dataset simultaneously. The information in different views is combined through a combined global objective function. The process involves fuzzy clustering at view level, view weight learning and feature weight adjustment. The learning phase involves the clustering phase as iterative structure, shown in Fig 2.2.

Fig 2.2 Iterative structure of the proposed algorithm
CONCLUSION

Modern real life applications, like face recognition, sentiment analysis, handwritten character recognition, webpage design analysis etc have an underlying data bank which though referring to same set of objects has different representations.
1. Summary of proposal : The various representations of the data are called views of data. Owing to high dimensionality of data, feature selection through a weighing schme has been incorporated. A ranking (weighting) system for the views is also included. The cluster output produced is fuzzy in the sense that a data object may belong to more than one cluster as indicated
  
  by the output participation matrix. Thus, a global objective function has been proposed which considers fuzzy clustering of multiple views of data with feature and view weights. The clustering process involves automatic adjust-and-update computations for the feature and view weights. View level clustering is similar to Fuzzy C-Means.
2. Experimental Validation: Fuzzy Multi-view Clustering captures the underlying information from data better than the hard clustering of multi-view data. Also, the rate of convergence is high due to Fuzzy C-Means performed at view level. The global objective function which combines all views in linearly weighted manner serves better than the plain objective function of Fuzzy C-Means of single view clustering. All these observations are validated through experiments on real-life data with known ground truths. The results prove the proposal to be better both in terms of convergence speed and clustering accuracy.
FUTURE SCOPE

With datasets having objects uniformly divided over all classes, Fuzzy C-Means does not produce significant improvement over any crisp clustering. Hence, the proposal can be adapted in future with a fuzzification process which gives better results on such balanced datasets too. Other fuzzy clustering approaches, besides Fuzzy C-Means, can be included. The proposed FMVC algorithm works entirely on numeric data. A variant to deal with mixed type of data can be useful.
REFERENCES

Y. M. Xu, C. D. Wang and J. H. Lai, Weighted Multi-view Clustering with Feature Selection, Pattern Recognition Letters, Vol. 53, pp. 25 35, 2016.
J. Kettenring, Canonical analysis of several sets of variables, Biometrika, Vol. 58, pp. 433-451.
M. White, Y. Yu, X. Zhang and D. Schuurmans, Convex multi-view subspace learning, Advances in Neural Information Processing Systems, Vol. 25, pp.1-9, 2012.
M. E. Celebi and H. A. Kingravi, Deterministic Initialization of the K-Means Algorithm using Hierarchical Clustering, International Journal of Pattern Recognition and Artificial Intelligence, Vol. 26, No. 7, pp.1250018-1250041, 2012.
R. Duwairi and Md. A. Rahmeh A novel approach for initializing the spherical K-means clustering algorithm in Simulation Modeling practice and Theory papers, Volume 54, May 2015, Pages 4963
X. Wang, B. Qian, J. Ye and I. Davidson, Multi-Objective Multi-View Spectral Clustering via Pareto Optimization, Proceedings of the 2013 SIAM International Conference on Data Mining, 2013.
Y. Wang, X. Lin, L. Wu, W. Zhang, Q. Zhang, and X. Huang, Robust Subspace Clustering for Multi-View Data by Exploiting Correlation Consensus,

IEEE Transactions On Image Processing, Vol. 24, No. 11, pp. 3939-3949, 2015.
A. Serra , D. Greco and R. Tagliaferri, Impact of different Metrics on Multi-View Clustering, Proceedings of the 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1-8, 2015.
G. Tyagi, N. Patel and I. Sethi, Soft-Hard Clustering for Multiview Data, Proceedings of the 2015 IEEE 16th International Conference on Information Reuse and Integration, pp. 464-469, 2015.
Y. M. Xu, C. D. Wang and J. H. Lai, Weighted Multi-view Clustering with Feature Selection, Pattern Recognition Letters, Vol. 53, pp. 25 35, 2016.

Multi View Clustering with Fuzzy and View Weighting

Leave a Reply