Text Analysis with Python: A Research Oriented Guide

Author(s): ​Mamta Mittal, ​Gopi Battineni, Bhimavarapu Usharani and Lalit Mohan Goyal

DOI: 10.2174/9789815049602122010007

Text Clustering in Python

Pp: 121-158 (38)

Buy Chapters
  • * (Excluding Mailing and Handling)

Text Analysis with Python: A Research Oriented Guide

Text Clustering in Python

Author(s): ​Mamta Mittal, ​Gopi Battineni, Bhimavarapu Usharani and Lalit Mohan Goyal

Pp: 121-158 (38)

DOI: 10.2174/9789815049602122010007

* (Excluding Mailing and Handling)

Abstract

In this chapter, we learn about clustering and how document and text clustering can be performed. This chapter explains the real-time applications of text clustering and the differences between soft and hard clustering types. The clustering algorithms, including KNN, hierarchical and Fuzzy clustering, were used . Fuzzy clustering or soft clustering types can add better value performance-wise than the other two clustering algorithms. Besides, we also presented how to conduct text clustering in python using unsupervised machine learning techniques. To explain this in detail, the IRIS dataset is considered famous in UCI Machine learning Repository and well presented with python script.


Keywords: Clustering, Fuzzy, IRIS dataset, KNN, Soft clustering.

Related Journals

Related Books