User talk:Priyapappachan/GSoC-spellchecker

From SMC Wiki
Revision as of 17:54, 30 April 2013 by Priyapappachan (talk | contribs)

Hi, Would it be possible to do some more in depth study into the complexities of multi level suffix stripping or any kind of agglutination, inflection properties of Malayalam. The challenge is not only technical but also linguistics. The suffixes that you mentioned, cannot be created for Malayalam as easy. It need systematic classification of possible suffixes based on 7 prathyaya rules. That too cannot be done without some level automation. I would recommend you to do a thorough reading of the following materials

  1. http://thottingal.in/documents/MalayalamComputingChallenges.pdf
  2. http://thottingal.in/documents/PSomanathan_On_Malayalam.pdf
  3. http://thottingal.in/documents/rachana-malayalam-collation.pdf
  4. http://www.chintha.com/node/2967
  5. http://lists.smc.org.in/pipermail/discuss-smc.org.in/2013-March/015062.html

You may want to try one or two word formation with Hunspell to get a feel of the project --സന്തോഷ് (talk) 23:46, 27 April 2013 (PDT)


Hi, I've updated my proposal.I request you to go through it.

Priyapappachan (talk)