A machine learning framework for automated text categorization

This dissertation describes a machine learning framework for the development of an automated text categorization system for real-life problems. Conference paper classification will be used as a case study of a life text categorization problem. Unlike documents in benchmark collections, text document...

Full description

Bibliographic Details
Main Author: Bong, Chih How
Format: Thesis
Language:English
English
Published: Faculty of Computer Science and Information Technology 2001
Subjects:
Online Access:http://ir.unimas.my/1697/
http://ir.unimas.my/1697/1/bong%2Bchih%2Bhow.pdf
http://ir.unimas.my/1697/7/2013-02-thBongCHfull.pdf
Description
Summary:This dissertation describes a machine learning framework for the development of an automated text categorization system for real-life problems. Conference paper classification will be used as a case study of a life text categorization problem. Unlike documents in benchmark collections, text documents such as conference papers tend to be rather heterogeneous having a rich structure with variable length documents where each category consists of a variable number of documents.