Public Article
-
verified
Malayalam Text Compression
ISSN: 2289 - 7615Publisher: author   
Malayalam Text Compression
Indexed in
Computer Science and Technology Section
ARTICLE-FACTOR
1.3
Article Basics Score: 2
Article Transparency Score: 3
Article Operation Score: 2
Article Articles Score: 2
Article Accessibility Score: 3
SUBMIT PAPER ASK QUESTION
International Category Code (ICC):
ICC-0202
Publisher: International Journal Of Information System And Engineerin..
International Journal Address (IAA):
IAA.ZONE/228954547615
eISSN
:
2289 - 7615
VALID
ISSN Validator
Abstract
In natural language processing and analysis, a very large number of problems remain unaddressed particularly in Malayalam computing. For instance, the informational analysis of Malayalam language text is itself not widely studied. Language studies of English, based on the concepts of information theory are quite well established, as evidenced by the success of text compression methods for English. However to the best of our knowledge, not a single attempt has been reported about Malayalam text compression even though the Unicode based Malayalam content is increasing in Malayalam blogs, Wikipedia and Websites. The general motivation behind every compression is the optimum use of resources such as data, space or transmission capacity. The availability of standard Unicode script and Google online language translation service in the internet triggers the use of Malayalam language. The statistics of Malayalam Wikipedia clearly indicates th...