Detection of Adjective Compound Word in Malay Language using Enhanced Syntactic Rules

Compound word is defined as combination two or more words and it will produce a new meaning. Generally, compound word is existed in many languages such as English, Mandarin, Arabic and others. Although, there are discussion of existing methods to detect compound word yet some limitations on detecti...

Full description

Saved in:
Bibliographic Details
Format: article
Language:EN
Published: Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Perlis 2021
Subjects:
T
Online Access:https://doaj.org/article/99534a36a7f04e9daee5524e3536fc52
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Compound word is defined as combination two or more words and it will produce a new meaning. Generally, compound word is existed in many languages such as English, Mandarin, Arabic and others. Although, there are discussion of existing methods to detect compound word yet some limitations on detecting Malay compound word. Thus, this study is done to improve accuracy towards adjective compound words. Training data is used in this study was Malay story books. Digitization data of Malay story book is used in this study. Then, the pre-processing method involved tokenization, stemming, bi-gram and part-of-speech (POS) tagging has been applied to produce the candidate compound word. Applying the enhanced syntactic rules shown the precision result is 70.3% through this study. Thus, this study will contribute to the academic research in improvise the issues on searching and document summarization application.