<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.smc.org.in/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Jaseem</id>
	<title>SMC Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.smc.org.in/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Jaseem"/>
	<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/Special:Contributions/Jaseem"/>
	<updated>2026-05-25T01:11:10Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.40.1</generator>
	<entry>
		<id>https://wiki.smc.org.in/index.php?title=User:Jaseem&amp;diff=4711</id>
		<title>User:Jaseem</title>
		<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/index.php?title=User:Jaseem&amp;diff=4711"/>
		<updated>2014-03-12T16:57:42Z</updated>

		<summary type="html">&lt;p&gt;Jaseem: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Personal Information==&lt;br /&gt;
*&#039;&#039;&#039;Email Address:&#039;&#039;&#039; jaseemumar@gmail.com&lt;br /&gt;
*&#039;&#039;&#039;Blog URL:&#039;&#039;&#039; http://jaseems.blogspot.com&lt;br /&gt;
*&#039;&#039;&#039;Freenode IRC nick:&#039;&#039;&#039; jaseem&lt;br /&gt;
*&#039;&#039;&#039;Current Education:&#039;&#039;&#039; 2nd Year BTech in Computer Science at Indian Institute of Technology, Bombay&lt;br /&gt;
*&#039;&#039;&#039;Why do you want to work with the Swathanthra Malayalam Computing?&#039;&#039;&#039;&lt;br /&gt;
Being a malayali, the cause of developing Malayalam computing aids is something I can relate to and am excited about. The possibility of being able to help people who speak the same language as mine, directly with what I learned is exciting.&lt;br /&gt;
*&#039;&#039;&#039;Do you have any past involvement with the Swathanthra Malayalam Computing or another open source project as a contributor?&#039;&#039;&#039;&lt;br /&gt;
No&lt;br /&gt;
*&#039;&#039;&#039;Did you participate with the past GSoC programs, if so which years, which organizations?&#039;&#039;&lt;br /&gt;
No&lt;br /&gt;
*&#039;&#039;&#039;Do you have other obligations between May and August ?&#039;&#039;&#039;&lt;br /&gt;
I have my college holidays from May to July middle, during which I don&#039;t have any obligations. I have to attend college during last two weeks of July and August; I am planning to make up for this period by starting coding a bit earlier during the community bonding period.&lt;br /&gt;
*&#039;&#039;&#039;Will you continue contributing/ supporting the Swathanthra Malayalam Computing after the GSoC 2014 program, if yes, which area(s), you are interested in?&amp;quot;&lt;br /&gt;
Yes, I am glad I found the organisation through GSoC and I am planning to actively contribute outside of the program.&lt;br /&gt;
*&#039;&#039;&#039;Why should we choose you over other applicants?&#039;&#039;&#039;&lt;br /&gt;
I am experienced in programming for over 6 years and am good at Python, the language of the existing spell-checker. I have direct access to the language resources (books and people) required for the project. &lt;br /&gt;
&lt;br /&gt;
==Proposal Description==&lt;br /&gt;
Please describe your proposal in detail.&lt;br /&gt;
&lt;br /&gt;
===Overview===&lt;br /&gt;
&lt;br /&gt;
===Implementation===&lt;br /&gt;
===Timeline===&lt;br /&gt;
&amp;lt;!--&lt;br /&gt;
&#039;&#039;&#039;NOTE&#039;&#039;&#039;:  Please do not verbatim copy text from the ideas page, or from other people&#039;s&lt;br /&gt;
discussions about your project, but rewrite it in your own words.If you&lt;br /&gt;
include any significant text or code from another source in your&lt;br /&gt;
application, it must be accompanied with a proper citation. All papers or&lt;br /&gt;
references that you use or plan to use must also be cited. Put all this in&lt;br /&gt;
a &amp;quot;References&amp;quot; section at the bottom of your application.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Include&#039;&#039;&#039;:&lt;br /&gt;
*  An overview of your proposal&lt;br /&gt;
*  The need you believe it fulfills&lt;br /&gt;
*  Any relevant experience you have&lt;br /&gt;
*  How you intend to implement your proposal&lt;br /&gt;
*  A rough timeline for your progress with phases &lt;br /&gt;
*  Any other details you feel we should consider&lt;br /&gt;
*  Tell us about something you have created.&lt;br /&gt;
*  Have you communicated with a potential mentor? If so, who?&lt;br /&gt;
*  SMC Wiki link of your proposal--&amp;gt;&lt;/div&gt;</summary>
		<author><name>Jaseem</name></author>
	</entry>
	<entry>
		<id>https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4707</id>
		<title>User:Jaseem/spellcheck</title>
		<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4707"/>
		<updated>2014-03-11T23:07:44Z</updated>

		<summary type="html">&lt;p&gt;Jaseem: /* Lttoolbox */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Malayalam Spell-checker =&lt;br /&gt;
== Problem==&lt;br /&gt;
English dictionaries &amp;quot;rely on complete lists of full word forms, a requirement that&lt;br /&gt;
cannot be met for morphologically complex languages&amp;quot; like Malayalam.&lt;br /&gt;
Theoretically, In Malayalam agglutination of  unlimited words can happen. Generally less than 10. Handling agglutinations and inflections in a spell-checker can be challenging.&lt;br /&gt;
&lt;br /&gt;
Refer http://thottingal.in/documents/MalayalamComputingChallenges.pdf&lt;br /&gt;
&lt;br /&gt;
=== Other Challenges ===&lt;br /&gt;
*Homophonic root words can have difference inflections&lt;br /&gt;
*;മറക്കുക &amp;amp; മറയുക; പറയുക &amp;amp; പറക്കുക&lt;br /&gt;
*Same word can inflect differently in same context (not common)&lt;br /&gt;
*; പോവുക, പോകുക&lt;br /&gt;
*Sandhi rules are complex.&lt;br /&gt;
&lt;br /&gt;
==Possible solutions==&lt;br /&gt;
===Hunspell===&lt;br /&gt;
Hunspell has an algorithm for figuring out agglutination. Need to figure out how to use it.&lt;br /&gt;
&lt;br /&gt;
===Implementation in other languages===&lt;br /&gt;
Spell Checking an Agglutinative Language: Quechua&lt;br /&gt;
http://www.zora.uzh.ch/52921/1/ltc-106-rios.pdf&lt;br /&gt;
Quechua, doesn&#039;t seem to have the complexity that malayalam sandhi&#039;s have. The automaton presented in the paper doesn&#039;t seem to work on malayalam.&lt;br /&gt;
*;kachichasqa= kachi + cha +sqa&lt;br /&gt;
&lt;br /&gt;
http://www.cmpe.boun.edu.tr/~akin/papers/spelling_checking_in_Turkish.pdf&lt;br /&gt;
&lt;br /&gt;
http://arxiv.org/pdf/cmp-lg/9410004.pdf&lt;br /&gt;
&lt;br /&gt;
;Stemmer: For finding root words&lt;br /&gt;
http://www.ldcil.org/up/conferences/morph/presentations/Vijay%20[Compatibility%20Mode].pdf&lt;br /&gt;
http://www.cse.iitb.ac.in/~pb/papers/cicling12-stemming.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Lttoolbox===&lt;br /&gt;
Lttoolbox from apertium package can be used to tokenize and lemmatize compounds/agglutination/inflections. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&amp;lt;dictionary&amp;gt;&lt;br /&gt;
  &amp;lt;alphabet&amp;gt;abcdefghijklmnopqrstuvwxyz&amp;lt;/alphabet&amp;gt;&lt;br /&gt;
  &amp;lt;sdefs&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;noun&amp;quot; /&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;s&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;pl&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;root&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;past&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;verb&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;compound-only-L&amp;quot;   c=&amp;quot;May only be the left-side of a compound&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;compound-R&amp;quot;    c=&amp;quot;May be the right-side of a compound, or a full word&amp;quot;/&amp;gt;&lt;br /&gt;
  &amp;lt;/sdefs&amp;gt;&lt;br /&gt;
  &lt;br /&gt;
&amp;lt;pardefs&amp;gt;&lt;br /&gt;
	 &lt;br /&gt;
	&amp;lt;pardef n=&amp;quot;poyi_v&amp;quot;&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l/&amp;gt;        &amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;verb&amp;quot;/&amp;gt; &amp;lt;/r&amp;gt; &amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;yi&amp;lt;/l&amp;gt;   &amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;verb&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;past&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;ya&amp;lt;/l&amp;gt;    &amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;compound-only-L&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;/pardef&amp;gt;&lt;br /&gt;
   &lt;br /&gt;
	&lt;br /&gt;
	&amp;lt;pardef n=&amp;quot;athu_n&amp;quot;&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;  &amp;lt;l&amp;gt;athu&amp;lt;/l&amp;gt;  &amp;lt;r&amp;gt;athu&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt; &amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;athu&amp;lt;/l&amp;gt;    &amp;lt;r&amp;gt;athu&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-only-L&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;thu&amp;lt;/l&amp;gt;    &amp;lt;r&amp;gt;athu&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-R&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-only-L&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;/pardef&amp;gt;&lt;br /&gt;
&lt;br /&gt;
	&amp;lt;pardef n=&amp;quot;kond&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;e&amp;gt;&amp;lt;p&amp;gt;&amp;lt;l/&amp;gt;&amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-R&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
    &amp;lt;/pardef&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
	&lt;br /&gt;
&amp;lt;/pardefs&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;section id=&amp;quot;main&amp;quot; type=&amp;quot;standard&amp;quot;&amp;gt;&lt;br /&gt;
	&amp;lt;e lm=&amp;quot;povuka&amp;quot;&amp;gt;&lt;br /&gt;
	  &amp;lt;i&amp;gt;po&amp;lt;/i&amp;gt;&lt;br /&gt;
	  &amp;lt;par n=&amp;quot;poyi_v&amp;quot;/&amp;gt;&lt;br /&gt;
	&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;e lm=&amp;quot;athu&amp;quot;&amp;gt;&lt;br /&gt;
	  &amp;lt;i&amp;gt;&amp;lt;/i&amp;gt;&lt;br /&gt;
	  &amp;lt;par n=&amp;quot;athu_n&amp;quot;/&amp;gt;&lt;br /&gt;
	&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;e lm=&amp;quot;kond&amp;quot;&amp;gt;&lt;br /&gt;
	  &amp;lt;i&amp;gt;kond&amp;lt;/i&amp;gt;&lt;br /&gt;
	  &amp;lt;par n=&amp;quot;kond&amp;quot;/&amp;gt;&lt;br /&gt;
	&amp;lt;/e&amp;gt;&lt;br /&gt;
&amp;lt;/section&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/dictionary&amp;gt;&amp;lt;/source&amp;gt;&lt;br /&gt;
The above given code returned this result:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Input:&#039;&#039;&#039;&amp;lt;br/&amp;gt;&lt;br /&gt;
poyathukond&amp;lt;br/&amp;gt;&lt;br /&gt;
poyaathukond&amp;lt;br/&amp;gt;&lt;br /&gt;
poyathu&amp;lt;br/&amp;gt;&lt;br /&gt;
athukond&amp;lt;br/&amp;gt;&lt;br /&gt;
thu&amp;lt;br/&amp;gt;&lt;br /&gt;
thukond&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Output:&#039;&#039;&#039;&amp;lt;br/&amp;gt;&lt;br /&gt;
^poyathukond/po+athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^poyaathukond/po+athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^poyathu/po+athu&amp;lt;noun&amp;gt;/po+athu&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^athukond/athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^thu/athu&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^thukond/athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
As can be seen, the program resolves even wrongly spelled compounds. The current lttoolbox markup doesn&#039;t give more control on that. The compound need to be regenerated from the stems and matched with the input to check the spelling.&lt;/div&gt;</summary>
		<author><name>Jaseem</name></author>
	</entry>
	<entry>
		<id>https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4706</id>
		<title>User:Jaseem/spellcheck</title>
		<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4706"/>
		<updated>2014-03-11T23:06:48Z</updated>

		<summary type="html">&lt;p&gt;Jaseem: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Malayalam Spell-checker =&lt;br /&gt;
== Problem==&lt;br /&gt;
English dictionaries &amp;quot;rely on complete lists of full word forms, a requirement that&lt;br /&gt;
cannot be met for morphologically complex languages&amp;quot; like Malayalam.&lt;br /&gt;
Theoretically, In Malayalam agglutination of  unlimited words can happen. Generally less than 10. Handling agglutinations and inflections in a spell-checker can be challenging.&lt;br /&gt;
&lt;br /&gt;
Refer http://thottingal.in/documents/MalayalamComputingChallenges.pdf&lt;br /&gt;
&lt;br /&gt;
=== Other Challenges ===&lt;br /&gt;
*Homophonic root words can have difference inflections&lt;br /&gt;
*;മറക്കുക &amp;amp; മറയുക; പറയുക &amp;amp; പറക്കുക&lt;br /&gt;
*Same word can inflect differently in same context (not common)&lt;br /&gt;
*; പോവുക, പോകുക&lt;br /&gt;
*Sandhi rules are complex.&lt;br /&gt;
&lt;br /&gt;
==Possible solutions==&lt;br /&gt;
===Hunspell===&lt;br /&gt;
Hunspell has an algorithm for figuring out agglutination. Need to figure out how to use it.&lt;br /&gt;
&lt;br /&gt;
===Implementation in other languages===&lt;br /&gt;
Spell Checking an Agglutinative Language: Quechua&lt;br /&gt;
http://www.zora.uzh.ch/52921/1/ltc-106-rios.pdf&lt;br /&gt;
Quechua, doesn&#039;t seem to have the complexity that malayalam sandhi&#039;s have. The automaton presented in the paper doesn&#039;t seem to work on malayalam.&lt;br /&gt;
*;kachichasqa= kachi + cha +sqa&lt;br /&gt;
&lt;br /&gt;
http://www.cmpe.boun.edu.tr/~akin/papers/spelling_checking_in_Turkish.pdf&lt;br /&gt;
&lt;br /&gt;
http://arxiv.org/pdf/cmp-lg/9410004.pdf&lt;br /&gt;
&lt;br /&gt;
;Stemmer: For finding root words&lt;br /&gt;
http://www.ldcil.org/up/conferences/morph/presentations/Vijay%20[Compatibility%20Mode].pdf&lt;br /&gt;
http://www.cse.iitb.ac.in/~pb/papers/cicling12-stemming.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Lttoolbox===&lt;br /&gt;
Lttoolbox from apertium package can be used to tokenize and lemmatize compounds/agglutination/inflections. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;source lang=&amp;quot;xml&amp;quot;&amp;gt;&amp;lt;dictionary&amp;gt;&lt;br /&gt;
  &amp;lt;alphabet&amp;gt;abcdefghijklmnopqrstuvwxyz&amp;lt;/alphabet&amp;gt;&lt;br /&gt;
  &amp;lt;sdefs&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;noun&amp;quot; /&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;s&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;pl&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;root&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;past&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;verb&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;compound-only-L&amp;quot;   c=&amp;quot;May only be the left-side of a compound&amp;quot;/&amp;gt;&lt;br /&gt;
		&amp;lt;sdef n=&amp;quot;compound-R&amp;quot;    c=&amp;quot;May be the right-side of a compound, or a full word&amp;quot;/&amp;gt;&lt;br /&gt;
  &amp;lt;/sdefs&amp;gt;&lt;br /&gt;
  &lt;br /&gt;
&amp;lt;pardefs&amp;gt;&lt;br /&gt;
	 &lt;br /&gt;
	&amp;lt;pardef n=&amp;quot;poyi_v&amp;quot;&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l/&amp;gt;        &amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;verb&amp;quot;/&amp;gt; &amp;lt;/r&amp;gt; &amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;yi&amp;lt;/l&amp;gt;   &amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;verb&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;past&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;ya&amp;lt;/l&amp;gt;    &amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;compound-only-L&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;/pardef&amp;gt;&lt;br /&gt;
   &lt;br /&gt;
	&lt;br /&gt;
	&amp;lt;pardef n=&amp;quot;athu_n&amp;quot;&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;  &amp;lt;l&amp;gt;athu&amp;lt;/l&amp;gt;  &amp;lt;r&amp;gt;athu&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt; &amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;athu&amp;lt;/l&amp;gt;    &amp;lt;r&amp;gt;athu&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-only-L&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
		&amp;lt;e&amp;gt;&amp;lt;p&amp;gt;    &amp;lt;l&amp;gt;thu&amp;lt;/l&amp;gt;    &amp;lt;r&amp;gt;athu&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-R&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-only-L&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;/pardef&amp;gt;&lt;br /&gt;
&lt;br /&gt;
	&amp;lt;pardef n=&amp;quot;kond&amp;quot;&amp;gt;&lt;br /&gt;
        &amp;lt;e&amp;gt;&amp;lt;p&amp;gt;&amp;lt;l/&amp;gt;&amp;lt;r&amp;gt;&amp;lt;s n=&amp;quot;noun&amp;quot;/&amp;gt;&amp;lt;s n=&amp;quot;compound-R&amp;quot;/&amp;gt;&amp;lt;/r&amp;gt;&amp;lt;/p&amp;gt;&amp;lt;/e&amp;gt;&lt;br /&gt;
    &amp;lt;/pardef&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
	&lt;br /&gt;
&amp;lt;/pardefs&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;section id=&amp;quot;main&amp;quot; type=&amp;quot;standard&amp;quot;&amp;gt;&lt;br /&gt;
	&amp;lt;e lm=&amp;quot;povuka&amp;quot;&amp;gt;&lt;br /&gt;
	  &amp;lt;i&amp;gt;po&amp;lt;/i&amp;gt;&lt;br /&gt;
	  &amp;lt;par n=&amp;quot;poyi_v&amp;quot;/&amp;gt;&lt;br /&gt;
	&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;e lm=&amp;quot;athu&amp;quot;&amp;gt;&lt;br /&gt;
	  &amp;lt;i&amp;gt;&amp;lt;/i&amp;gt;&lt;br /&gt;
	  &amp;lt;par n=&amp;quot;athu_n&amp;quot;/&amp;gt;&lt;br /&gt;
	&amp;lt;/e&amp;gt;&lt;br /&gt;
	&amp;lt;e lm=&amp;quot;kond&amp;quot;&amp;gt;&lt;br /&gt;
	  &amp;lt;i&amp;gt;kond&amp;lt;/i&amp;gt;&lt;br /&gt;
	  &amp;lt;par n=&amp;quot;kond&amp;quot;/&amp;gt;&lt;br /&gt;
	&amp;lt;/e&amp;gt;&lt;br /&gt;
&amp;lt;/section&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/dictionary&amp;gt;&amp;lt;/source&amp;gt;&lt;br /&gt;
The above given code returned gave this result:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;&lt;br /&gt;
&#039;&#039;&#039;Input:&#039;&#039;&#039;&amp;lt;br/&amp;gt;&lt;br /&gt;
poyathukond&amp;lt;br/&amp;gt;&lt;br /&gt;
poyaathukond&amp;lt;br/&amp;gt;&lt;br /&gt;
poyathu&amp;lt;br/&amp;gt;&lt;br /&gt;
athukond&amp;lt;br/&amp;gt;&lt;br /&gt;
thu&amp;lt;br/&amp;gt;&lt;br /&gt;
thukond&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;Output:&#039;&#039;&#039;&amp;lt;br/&amp;gt;&lt;br /&gt;
^poyathukond/po+athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^poyaathukond/po+athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^poyathu/po+athu&amp;lt;noun&amp;gt;/po+athu&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^athukond/athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^thu/athu&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
^thukond/athu&amp;lt;noun&amp;gt;+kond&amp;lt;noun&amp;gt;$&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
As can be seen, the program resolves even wrongly spelled compounds. The current lttoolbox markup doesn&#039;t give more control on that. The compound need to be regenerated from the stems and matched with the input to check the spelling.&lt;/div&gt;</summary>
		<author><name>Jaseem</name></author>
	</entry>
	<entry>
		<id>https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4673</id>
		<title>User:Jaseem/spellcheck</title>
		<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4673"/>
		<updated>2014-03-06T14:21:10Z</updated>

		<summary type="html">&lt;p&gt;Jaseem: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Malayalam Spell-checker =&lt;br /&gt;
== Problem==&lt;br /&gt;
English dictionaries &amp;quot;rely on complete lists of full word forms, a requirement that&lt;br /&gt;
cannot be met for morphologically complex languages&amp;quot; like Malayalam.&lt;br /&gt;
Theoretically, In Malayalam agglutination of  unlimited words can happen. Generally less than 10. Handling agglutinations and inflections in a spell-checker can be challenging.&lt;br /&gt;
&lt;br /&gt;
Refer http://thottingal.in/documents/MalayalamComputingChallenges.pdf&lt;br /&gt;
&lt;br /&gt;
=== Other Challenges ===&lt;br /&gt;
*Homophonic root words can have difference inflections&lt;br /&gt;
*;മറക്കുക &amp;amp; മറയുക; പറയുക &amp;amp; പറക്കുക&lt;br /&gt;
*Same word can inflect differently in same context (not common)&lt;br /&gt;
*; പോവുക, പോകുക&lt;br /&gt;
*Sandhi rules are complex.&lt;br /&gt;
&lt;br /&gt;
==Possible solutions==&lt;br /&gt;
===Hunspell===&lt;br /&gt;
Hunspell has an algorithm for figuring out agglutination. Need to figure out how to use it.&lt;br /&gt;
&lt;br /&gt;
===Implementation in other languages===&lt;br /&gt;
Spell Checking an Agglutinative Language: Quechua&lt;br /&gt;
http://www.zora.uzh.ch/52921/1/ltc-106-rios.pdf&lt;br /&gt;
Quechua, doesn&#039;t seem to have the complexity that malayalam sandhi&#039;s have. The automaton presented in the paper doesn&#039;t seem to work on malayalam.&lt;br /&gt;
*;kachichasqa= kachi + cha +sqa&lt;br /&gt;
&lt;br /&gt;
http://www.cmpe.boun.edu.tr/~akin/papers/spelling_checking_in_Turkish.pdf&lt;br /&gt;
&lt;br /&gt;
http://arxiv.org/pdf/cmp-lg/9410004.pdf&lt;br /&gt;
&lt;br /&gt;
;Stemmer: For finding root words&lt;br /&gt;
http://www.ldcil.org/up/conferences/morph/presentations/Vijay%20[Compatibility%20Mode].pdf&lt;br /&gt;
http://www.cse.iitb.ac.in/~pb/papers/cicling12-stemming.pdf&lt;/div&gt;</summary>
		<author><name>Jaseem</name></author>
	</entry>
	<entry>
		<id>https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4672</id>
		<title>User:Jaseem/spellcheck</title>
		<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4672"/>
		<updated>2014-03-06T14:13:24Z</updated>

		<summary type="html">&lt;p&gt;Jaseem: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Malayalam Spell-checker =&lt;br /&gt;
== Problem==&lt;br /&gt;
English dictionaries &amp;quot;rely on complete lists of full word forms, a requirement that&lt;br /&gt;
cannot be met for morphologically complex languages&amp;quot; like Malayalam.&lt;br /&gt;
Theoretically, In Malayalam agglutination of  unlimited words can happen. Generally less than 10. Handling agglutinations and inflections in a spell-checker can be challenging.&lt;br /&gt;
&lt;br /&gt;
Refer http://thottingal.in/documents/MalayalamComputingChallenges.pdf&lt;br /&gt;
&lt;br /&gt;
=== Other Challenges ===&lt;br /&gt;
*Homophonic root words can have difference inflections&lt;br /&gt;
*;മറക്കുക &amp;amp; മറയുക; പറയുക &amp;amp; പറക്കുക&lt;br /&gt;
*Same word can inflect differently in same context (not common)&lt;br /&gt;
*; പോവുക, പോകുക&lt;br /&gt;
*Sandhi rules are complex.&lt;br /&gt;
&lt;br /&gt;
==Possible solutions==&lt;br /&gt;
===Hunspell===&lt;br /&gt;
Hunspell has an algorithm for figuring out agglutination. Need to figure out how to use it.&lt;br /&gt;
&lt;br /&gt;
===Implementation in other languages===&lt;br /&gt;
Spell Checking an Agglutinative Language: Quechua&lt;br /&gt;
http://www.zora.uzh.ch/52921/1/ltc-106-rios.pdf&lt;br /&gt;
Quechua, doesn&#039;t seem to have the complexity that malayalam sandhi&#039;s have. The automaton presented in the paper doesn&#039;t seem to work on malayalam.&lt;br /&gt;
*;kachichasqa= kachi + cha +sqa&lt;br /&gt;
&lt;br /&gt;
http://www.cmpe.boun.edu.tr/~akin/papers/spelling_checking_in_Turkish.pdf&lt;br /&gt;
&lt;br /&gt;
http://arxiv.org/pdf/cmp-lg/9410004.pdf&lt;br /&gt;
&lt;br /&gt;
;Stemmer: For finding root words&lt;/div&gt;</summary>
		<author><name>Jaseem</name></author>
	</entry>
	<entry>
		<id>https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4664</id>
		<title>User:Jaseem/spellcheck</title>
		<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4664"/>
		<updated>2014-03-03T14:15:39Z</updated>

		<summary type="html">&lt;p&gt;Jaseem: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Malayalam Spell-checker =&lt;br /&gt;
== Problem==&lt;br /&gt;
English dictionaries &amp;quot;rely on complete lists of full word forms, a requirement that&lt;br /&gt;
cannot be met for morphologically complex languages&amp;quot; like Malayalam.&lt;br /&gt;
Theoretically, In Malayalam agglutination of  unlimited words can happen. Generally less than 10. Handling agglutinations and inflections in a spell-checker can be challenging.&lt;br /&gt;
&lt;br /&gt;
Refer http://thottingal.in/documents/MalayalamComputingChallenges.pdf&lt;br /&gt;
&lt;br /&gt;
=== Other Challenges ===&lt;br /&gt;
*Homophonic root words can have difference inflections&lt;br /&gt;
*;മറക്കുക &amp;amp; മറയുക; പറയുക &amp;amp; പറക്കുക&lt;br /&gt;
*Same word can inflect differently in same context (not common)&lt;br /&gt;
*; പോവുക, പോകുക&lt;br /&gt;
*Sandhi rules are complex.&lt;br /&gt;
&lt;br /&gt;
==Possible solutions==&lt;br /&gt;
===Hunspell===&lt;br /&gt;
Hunspell has an algorithm for figuring out agglutination. Need to figure out how to use it.&lt;br /&gt;
&lt;br /&gt;
===Implementation in other languages===&lt;br /&gt;
Spell Checking an Agglutinative Language: Quechua&lt;br /&gt;
http://www.zora.uzh.ch/52921/1/ltc-106-rios.pdf&lt;br /&gt;
Quechua, doesn&#039;t seem to have the complexity that malayalam sandhi&#039;s have. The automaton presented in the paper doesn&#039;t seem to work on malayalam.&lt;br /&gt;
*;kachichasqa= kachi + cha +sqa&lt;br /&gt;
&lt;br /&gt;
http://www.cmpe.boun.edu.tr/~akin/papers/spelling_checking_in_Turkish.pdf&lt;br /&gt;
&lt;br /&gt;
http://arxiv.org/pdf/cmp-lg/9410004.pdf&lt;/div&gt;</summary>
		<author><name>Jaseem</name></author>
	</entry>
	<entry>
		<id>https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4662</id>
		<title>User:Jaseem/spellcheck</title>
		<link rel="alternate" type="text/html" href="https://wiki.smc.org.in/index.php?title=User:Jaseem/spellcheck&amp;diff=4662"/>
		<updated>2014-03-03T11:31:12Z</updated>

		<summary type="html">&lt;p&gt;Jaseem: Created page with &amp;quot;= Malayalam Spell-checker = == Problem== English dictionaries &amp;quot;rely on complete lists of full word forms, a requirement that cannot be met for morphologically complex language...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;= Malayalam Spell-checker =&lt;br /&gt;
== Problem==&lt;br /&gt;
English dictionaries &amp;quot;rely on complete lists of full word forms, a requirement that&lt;br /&gt;
cannot be met for morphologically complex languages&amp;quot; like Malayalam.&lt;br /&gt;
Theoretically, In Malayalam agglutination of  unlimited words can happen. Generally less than 10. Handling agglutinations and inflections in a spell-checker can be challenging.&lt;br /&gt;
&lt;br /&gt;
Refer http://thottingal.in/documents/MalayalamComputingChallenges.pdf&lt;br /&gt;
&lt;br /&gt;
=== Other Challenges ===&lt;br /&gt;
*Homophonic root words can have difference inflections&lt;br /&gt;
*;മറക്കുക &amp;amp; മറയുക; പറയുക &amp;amp; പറക്കുക&lt;br /&gt;
*Same word can inflect differently in same context (not common)&lt;br /&gt;
*; പോവുക, പോകുക&lt;br /&gt;
*Sandhi rules are complex.&lt;br /&gt;
&lt;br /&gt;
==Possible solutions==&lt;br /&gt;
===Hunspell===&lt;br /&gt;
Hunspell has an algorithm for figuring out agglutination. Need to figure out how to use it.&lt;br /&gt;
&lt;br /&gt;
===Implementation in other languages===&lt;br /&gt;
Spell Checking an Agglutinative Language: Quechua&lt;br /&gt;
http://www.zora.uzh.ch/52921/1/ltc-106-rios.pdf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
http://www.cmpe.boun.edu.tr/~akin/papers/spelling_checking_in_Turkish.pdf&lt;br /&gt;
&lt;br /&gt;
http://arxiv.org/pdf/cmp-lg/9410004.pdf&lt;/div&gt;</summary>
		<author><name>Jaseem</name></author>
	</entry>
</feed>