Partitioned-based Fuzzy Clustering to Learn Documents' Triadic Similarity

Authors

  • Sonia Alouane Ksouri
  • Minyar Sassi Hidri Doctor
  • Kamel Barkaoui Professor

DOI:

https://doi.org/10.3991/ijes.v1i1.2932

Abstract


With the development of the Web and the high availability of storage spaces, more and more documents become accessible. For that reason, similarity learning suffers from a scalability problem in both memory use and computational time when a data set is large. This paper provides a fuzzy triadic similarity measure to calculate memberships in a context of document co-clustering. It allows computing simultaneously fuzzy co-similarity matrices between documents/sentences and sentences/words. Each one is built on the basis of the others. The proposed model is extended to tackle the problem of large data sets by a splitting architecture which deals with a new fuzzy triadic similarity to parallelize both memory use and computation on distributed computers. This architecture is based on fuzzy clustering for partitioning data sets into similar groups (or clusters) in order to create more coherent sub-sets.

Author Biographies

Sonia Alouane Ksouri

CEDRIC-CNAM Paris

Minyar Sassi Hidri, Doctor

LR-SITI Tunisia

Kamel Barkaoui, Professor

CEDRIC-CNAM Paris

Downloads

Published

2013-07-23

How to Cite

Ksouri, S. A., Hidri, M. S., & Barkaoui, K. (2013). Partitioned-based Fuzzy Clustering to Learn Documents’ Triadic Similarity. International Journal of Recent Contributions from Engineering, Science & IT (iJES), 1(1), pp. 5–12. https://doi.org/10.3991/ijes.v1i1.2932

Issue

Section

Papers