EXPLORE PUBLICATIONS BY COUNTRIES


	EUROPE

	MIDDLE EAST

	ASIA

	AFRICA
.............................

	United States of America

	United Kingdom

	Canada

	Australia

	Italy

	France

	Brazil

	Germany

	Malaysia

	Turkey

	China

	Taiwan

	Japan

	Saudi Arabia

	Jordan

	Egypt

	United Arab Emirates

	India

	Nigeria

Similarity-Based Estimation for Document Summarization using Fuzzy Sets

Masrah Azrifah Azmi Murad, Trevor Martin

Pages - 1 - 12 | Revised - 15-12-2007 | Published - 30-12-2007

Published in International Journal of Computer Science and Security (IJCSS)

Volume - 1 Issue - 4 | Publication Date - December 2007 Table of Contents

MORE INFORMATION

References | Cited By (9) | Abstracting & Indexing

KEYWORDS

fuzzy sets, mass assignment, asymmetric word similarity, topic similarity, summarization

ABSTRACT

Information is increasing every day and thousands of documents are produced and made available in the Internet. The amount of information available in documents exceeds our capacity to read them. We need access to the right information without having to go through the whole document. Therefore, documents need to be compressed and produce an overview so that these documents can be utilized effectively. Thus, we propose a similarity model with topic similarity using fuzzy sets and probability theories to extract the most representative sentences. Sentences with high weights are extracted to form a summary. On average, our model (known as MySum) produces summaries that are 60% similar to the manually created summaries, while tf.isf algorithm produces summaries that are 30% similar. Two human summarizers, named P1 and P2, produce summaries that are 70% similar to each other using similar sets of documents obtained from TREC.

CITED BY (9)

1	Ahmed, W. A., & Shamsuddin, S. M. Integration of Least Recently Used Algorithm and Neuro-Fuzzy System into Client-side Web Caching.

2	S. Mansor , R. B. Din and A. Samsudin , “Analysis of Natural Language Steganography”, International Journal of Computer Science and Security (IJCSS), 3(2), pp. 113 – 125, 2009.

3	W. A. Ahmed and S. M. Shamsuddin , “Integration of Least Recently Used Algorithm and Neuro-Fuzzy System into Client-side Web Caching” , International Journal of Computer Science and Security (IJCSS), 3(1), pp. 1 – 15, 2009.

4	R. Ahmad and A. Khanum , “Document Topic Generation in Text Mining by Using Cluster Analysis with EROCK”, International Journal of Computer Science and Security (IJCSS), 4(2), pp. 176 – 182, 2010.

5	M. S. Binwahlan, N. Salim and L. Suanmalui, “Fuzzy Swarm Diversity Hybrid Model for Text Summarization”, Information Processing & Management, 46(5), pp. 571–588, 2010.

6	Andriansyah¹, F., Baizal, Z. A., & Kurniati, A. P.Analisis peningkatan kualitas peringkasan teks menggunakan metode fuzzy dan algoritma genetika.

7	Wenerstrom, B., Ragade, R., & Kantardzic, M. (2012).ReClose Fuzz: Improved Automatic Summary Generation using Fuzzy Sets. ICSIIT 2012, 8.

8	Kavila, S. D., & Radhika, Y. (2015).Extractive Text Summarization Using Modified Weighing and Sentence Symmetric Feature Methods.

9	Barve, S., Desai, S., & Sardinha, R. (2016).Query-Based Extractive Text Summarization for Sanskrit. In Proceedings of the 4th International Conference on Frontiers in Intelligent Computing: Theory and Applications (FICTA) 2015 (pp. 559-568). Springer India.

ABSTRACTING & INDEXING

1	Google Scholar

2	Academic Journals Database

3	ScientificCommons

4	iSEEK

5	ResearchGATE

6	Libsearch

7	Bielefeld Academic Search Engine (BASE)

8	Scribd

9	WorldCat

10	SlideShare

11	PDFCAST

12	PdfSR

REFERENCES

D. Lin. “Extracting Collocations from Text Corpora”. Workshop on Computational Terminology, Montreal, Canada, 1998

DUC. “Document Understanding Conferences”. http://duc.nist.gov, 2002

G. Salton and C. Buckley. “Term-weighting Approaches in Automatic Text Retrieval”. Information Processing and Management 24, pp 513-523, 1988. Reprinted in: Sparck Jones K. and Willet P. (eds). Readings in Information Retrieval, Morgan Kaufmann, pp 323-328, 1997

G.J. Klir and B. Yuan. “Fuzzy Sets and Fuzzy Logic - Theory and Applications”. Prentice- Hall, Inc., Englewood Cliffs, New Jersey, 1995

H. Luhn “The Automatic Creation of Literature Abstracts”. IBM Journal of Research and Development, 2(92):159 - 165, 1958

J. Larocca Neto, A.D. Santos, C.A.A. Kaestner, and A.A. Freitas. “Document Clustering and Text Summarization”. In Proceedings of the 4th Int. Conf. Practical Applications of Knowledge Discovery and Data Mining (PADD-2000), London: The Practical Application Company, pp 41---55, 2000b

J.F. Baldwin, J. Lawry, and T.P. Martin. “A Mass Assignment Theory of the Probability of Fuzzy Events”. Fuzzy Sets and Systems, (83), pp. 353-367, 1996

J.F. Baldwin, T.P. Martin and B.W. Pilsworth. “Fril - Fuzzy and Evidential Reasoning in Artificial Intelligence”. Research Studies Press Ltd, England, 1995

J.F. Baldwin. “Combining Evidences for Evidential Reasoning”. International Journal of Intelligent Systems, 6(6), pp. 569-616, 1991a

J.F. Baldwin. “Fuzzy and Probabilistic Uncertainties”. In Encyclopedia of AI, 2nd ed., S.C. Shapiro, Editor 1992, Wiley, New York, pp. 528-537, 1992

K. Sparck Jones. “Automatic Summarizing: Factors and Directions”. In I. Mani and M.T. Maybury, Editors, Advances in Automatic Text Summarization, Cambridge, MA: The MIT Press, pp 1-12, 1999

M. Amini and P. Gallinari. “The Use of Unlabeled Data to Improve Supervised Learning for Unsupervised for Text Summarization”. In SIGIR, Tampere, Finland, 2002

M.A. Azmi-Murad. “Fuzzy Text Mining for Intelligent Information Retrieval”. PhD Thesis, University of Bristol, April 2005

M.F. Porter. “An Algorithm for Suffix Stripping”. Program, 14(3):130-137, 1980

S. Yohei ‘‘Sentence Extraction by tf/idf and Position Weighting from Newspaper Articles (TSC-8)’’ NTCIR Workshop 3 Meeting TSC, pp 55-59, 2002

S.H. Lo, H. Meng, and W. Lam. “Automatic Bilingual Text Document Summarization”. In Proceedings of the Sixth World Multiconference on Systematic, Cybernetics and Informatics. Orlando, Florida, USA, 2002

Z. Harris. “Distributional Structure”. In: Katz, J. J. (ed.) The Philosophy of Linguistics. New York: Oxford University Press, pp. 26-47, 1985

MANUSCRIPT AUTHORS

Miss Masrah Azrifah Azmi Murad

- Malaysia

masrah@fsktm.upm.edu.my

Mr. Trevor Martin

- United Kingdom

CREATE AUTHOR ACCOUNT

LAUNCH YOUR SPECIAL ISSUE

View all special issues >>

PUBLICATION VIDEOS