SVM categorizer: a generic categorization tool using support vector machines

Kapoutsis, E, Theodoulidis, B and Saraee, MH ORCID: https://orcid.org/0000-0002-3283-1912 2004, SVM categorizer: a generic categorization tool using support vector machines , in: IC-AI 2004, 21-24 June 2004, Las Vegas, USA.

[img]
Preview
PDF - Published Version
Download (320kB) | Preview

Abstract

Supervised text categorisation is a significant tool considering the vast amount of structured, unstruc-tured, or semi-structured texts that are available from internal or external enterprise resources. The goal of supervised text categorisation is to assign text documents to finite pre-specified categories in order to extract and automatically organise information coming from these resources. This paper pro-poses the implementation of a generic application – SVM Categorizer using the Support Vector Ma-chines algorithm with an innovative statistical adjustment that improves its performance. The algo-rithm is able to learn from a pre-categorised document corpus and it is tested on another uncatego-rized one based on a business intelligence case study. This paper discusses the requirements, design and implementation and describes every aspect of the application that will be developed. The final output of the SVM Categorizer is evaluated using commonly accepted metrics so as to measure its per-formance and contrast it with other classification tools.

Item Type: Conference or Workshop Item (Paper)
Additional Information: ISBN: 1-932415-32-7
Themes: Media, Digital Technology and the Creative Economy
Schools: Schools > School of Computing, Science and Engineering > Salford Innovation Research Centre
Journal or Publication Title: Proceedings of the International Conference on Machine Learning; Models, Technologies and Applications
Publisher: CSREA Press
Refereed: Yes
Depositing User: Prof. Mo Saraee
Date Deposited: 02 Nov 2011 11:58
Last Modified: 16 Feb 2022 13:21
URI: https://usir.salford.ac.uk/id/eprint/18820

Actions (login required)

Edit record (repository staff only) Edit record (repository staff only)