User talk:Priyapappachan/GSoC-spellchecker

From SMC Wiki
Revision as of 06:46, 28 April 2013 by സന്തോഷ് (talk | contribs) (Created page with "Hi, Would it be possible to do some more in depth study into the complexities of multi level suffix stripping or any kind of agglutination, inflection properties of Malayalam....")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Hi, Would it be possible to do some more in depth study into the complexities of multi level suffix stripping or any kind of agglutination, inflection properties of Malayalam. The challenge is not only technical but also linguistics. The suffixes that you mentioned, cannot be created for Malayalam as easy. It need systematic classification of possible suffixes based on 7 prathyaya rules. That too cannot be done without some level automation. I would recommend you to do a thorough reading of the following materials

  1. http://thottingal.in/documents/MalayalamComputingChallenges.pdf
  2. http://thottingal.in/documents/PSomanathan_On_Malayalam.pdf
  3. http://thottingal.in/documents/rachana-malayalam-collation.pdf
  4. http://www.chintha.com/node/2967
  5. http://lists.smc.org.in/pipermail/discuss-smc.org.in/2013-March/015062.html

You may want to try one or two word formation with Hunspell to get a feel of the project --സന്തോഷ് (talk) 23:46, 27 April 2013 (PDT)