Building Malayalam documents with Greenstone
Dear All, Recent trials on Greenstone 2.81 revealed that Malayalam full text documents (word, pdf etc) containing Unicode fonts can be built up into digital library collections with browsing classifiers and word search in Malayalam. Greenstone emerges, probably, as the only digital library software with capability to have full text search in Malayalam. The experimentation was surprising to many of us, in spite of NCSI team, who worked on this aspect earlier, had told us about this possibility much earlier. The complexity of Malayalam language and the long delay in standardizing the keyboard made many of us believe that it would be a difficult task to handle Malayalam text. Now Malayalam search is a reality in Greenstone. This indicates that Greenstone has a larger scope from now onwards in local language computing and digitization. Malayalam OCR is still under development and may not expect to have more than 80 percent accuracy in the beginning, according to the development team. Regards, K Rajasekharan Kerala Institute of Local Administration Thrissur -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean.
participants (1)
-
Rajan