A Relative Analysis on Machine Learning Approaches for Effective POS Tagging of Tamil Language

(13.59.243.64)

Users online: 12835

Ijournet

Email id

FREE

Sample Issue

Trial Access

Year : 2015, Volume : 6, Issue : 2
First page : ( 118) Last page : ( 123)
Print ISSN : 2249-3212. Online ISSN : 0975-8089. Published online : 2015 1.
Article DOI : 10.5958/0975-8089.2015.00015.9

A Relative Analysis on Machine Learning Approaches for Effective POS Tagging of Tamil Language

Sheshasaayee Ananthi^1,**, Angela Deepa V.R.^2,*

¹Research Supervisor, PG and Research, Department of Computer Science, Quaid-E- Millath Government College for Women (Autonomous), Chennai, 600 002, Tamil Nadu, India

²Research Scholar, PG and Research, Department of Computer Science, Quaid-E- Millath Government College for Women (Autonomous), Chennai, 600 002, Tamil Nadu, India

*Corresponding author E-mail id: angelrajan.research@gmail.com

**ananthi.research@gmail.com

Abstract

The process of identifying a suitable tag for each word in a document which articulates an analogous meaning in a particular context is termed as part-of-speech (POS). This process plays a key role in building an effective natural language processing (NLP) application. Morphological complexity and the varying grammatical constructs lead to a variety of approaches for tagging. For a highly agglutinative language like Tamil different approaches have been used for POS tagging, which include rule-based, stochastic or transformation-based learning approaches. This article deals with memory-based language processing (MBLP), a novice approach to NLP based on a symbolic machine learning method termed as memory-based learning (MBL). MBLP is like a support vector machine (SVM) in which the approach is language processing based on the idea guided by the direct reuse of memory traces of earlier language experiences rather than by rules extracted from such experiences. This article reflects the scope of differences that narrate the new way of dealing with taggers in Tamil language through a comparative study of the MBLP and SVM used in languages like Dutch and Malayalam.

Top

Keywords

Annotated corpora, Machine learning, Parts-of-speech, Memory-based language processing (MBLP), Support vector machine (SVM), Tagging, Agglutinative.

Top

║ Site map ║ Privacy Policy ║ Copyright ║ Terms & Conditions ║

750,902,146 visitor(s) since 30^th May, 2005.

Note: Please use Internet Explorer (6.0 or above). Some functionalities may not work in other browsers.

Agriculture
Applied Science/Technology
Biology
Botany
Business/Economics/Management
Chemistry
Civil Engineering
Commerce/Banking/Finance
Computers/Information Technology
Dental Science
Earthscience
Education
Engineering Mechanics/Materials
Environment
Health Science
Humanities
Library and Information Science
Management
Mathematics/Statistics
Medical Science
Nanotechnology
Nursing
Pharmacy
Physics
Social Science
Veterinary/Animal Sciences