The language of infinite words

I wrote this article for submission to a popular science writing competition by the Department of Science and Technology. Here is my research story, which did not make to grab the award. The story is based on our research on the morphological richness of Malayalam language. How many words are there in your language? How many of them do you know? Can you find all those words in a dictionary? [Read More]

Text Speech and Dialogue: TSD 2020

I presented a paper on Quantitative Analysis of the Morphological Complexity of Malayalam Language at 23rd International Conference on Text, Speech and Dialogue: TSD 2020, Brno, Czech Republic, September 8–11 2020. The year being 2020, the entire conference happened in remote participation mode. Conference proceedings and pre-recorded presentation videos were made available to the participants and we discussed it over online zoom sessions. It was a novel experience and I am super excited about how I got feedbacks and ideas to work on, even after the live sessions. [Read More]

Releasing Malayalam Speech Corpus

Originally Published in SMC Blog SMC announces the release of Malayalam Speech Corpus (MSC). It is the repository of curated speech samples collected using MSC web application. Speech samples are selected on the criteria that they have at least 3 positive reviews. MSC is a project launched by SMC to crowd source Malayalam speech samples from any contributor who can read out sentences and record them as speech samples. [Read More]

Phonetic description of Malayalam consonants

The orthography (system for writing a language) of Malayalam is considered phonemic in nature. It means the graphemes (written symbols) correspond to the phonemes (significant spoken sounds) of the language. But the correspondence between graphemes and phonemes is not precisely one-to-one. The pronunciation of graphemes can depend on its position in a word (word beginning, middle or end) and its proximity to other graphemes. It was two years back, I started to work on a grapheme to phoneme conversion tool for Malayalam. [Read More]