Anonymous

Changes

From SMC Wiki

GSoC/2013/Project ideas

117 bytes added, 05:20, 26 January 2017
m
Reverted edits by Sperminator (talk) to last revision by Hrishikesh.kb
* Please Read the [http://wiki.smc.org.in/SoC/2013#FAQ FAQ]
===<big>'''Apart from the following ideas , you can propose your own ideas==='''<If you want to propose an idea, please do it in [http://lists.smc.org.in/listinfo.cgi/discuss-smc.org.in project mailing list]big>
== Internationalize SILPA project with Wikimedia jquery projects=='''Project''':  SILPA project has many Indic language applications, but as of now, if somebody If you want to input in Indian languagespropose an idea, there is no built in tool in please do it. Similarly, the application is not internationalized. Both of these can be achieved by using the [//github.com/wikimedia/jquery.ime jquery.ime] and [//github.com/wikimedia/jquery.ime jquery.i18n] libraries from Wikimedia. A sample implementation is avaliable in our [http://lists.smc.org.in website]. The i18n should be in the SILPA flask framework with a nice templating system. Similarly the interface should have webfonts using [https://githublistinfo.comcgi/wikimedia/jquery.webfonts jquery.webfonts] library. * '''[https://savannah.nongnudiscuss-smc.org/task/index.php?12557 Savannah Task]'''* '''Expertise required''': jquery, css, html5, python , flask* '''Mentor''' : Hrishikesh K B====More Details====* [https://github.com/wikimedia/jquery.i18n jquery.i18n]* [https://github.com/wikimedia/jquery.ime jquery.ime]* [https://github.com/wikimedia/jquery.webfonts jquery.webfontsin project mailing list]>
== A spell checker for Indic language that understands inflections ==
texexec --xetex <file.tex>
==Automated Rendering Testing=='''Project''': Automated Rendering Testing system for Indic languages. Currently there exists 3 main rendering engines in computing world - Uniscribe of Microsoft, CoreText (Apple Advance Typography - AAT) of Apple and Harfbuzz for *nix systems. The Opentype font specification is maintained by Microsoft and implemented in Uniscribe, which is used as baseline for Harfbuzz. At present, there is no automated mechanism to determine if Harfbuzz is rendering complex Indic text correctly or not - someone expert in relevant language has to manually inspect the output from hb-view. The project aim is to identify and implement an automated method to test the rendering.One method to do this might be to check the order of glyphs/glyph indices output by the rendering engine - this depends on the font too. A related topic is UTRRS  * '''[https://savannah.nongnu.org/task/index.php?12560 Savannah Task]'''* '''Expertise required''': Knowledge of Indic language rendering and Opentype specification.* '''Mentor''' : Rajeesh K Nambiar ====More Details====* https://fedorahosted.org/utrrs/ * http://tdil-dc.in/utrrs/home/about ==Flask Based SILPABASED==
===Port remaining modules to the new flask based Silpa===
'''Project''': Silpa is being re-written using flask framework. Core part is almost complete. But most of the sub modules written under old framework are need to be ported to [http://flasksilpa-indic.rhcloud.com/ new framework].
'''Mentor''' : Vasudev
== Converting indic processing modules currently in SILPA into Jquery library =='''Project''': Port some of the silpa algorithms to Jquery. Several modules, alogorithms in SILPA project is done in python now. But porting them to javascript(jQuery) helps developers. For example, cross language transliteration can be done jquery too if we port the algorithm and transliteration rules. Similarly the approximate search can be ported. A flexibile fuzzy search on the web pages will be possible if we have the algorithm in jquery. ALternatively these js libraries can be used with node.js using server side js concept too. '''Expertise required''': javascript,JQuery, python '''Mentor''' :Jishnu == Improving cross language transliteration system. ===
'''Project''':
'''Mentor''' : Vasudev/Jishnu
== = Internationalize SILPA project with Wikimedia jquery projects , Improve Improving the webfonts module in Silpa using jquery.webfonts and proving provide more Indic and complex fonts as part of it. ==='''Project''': '''Internationalize SILPA''' :- SILPA project has many Indic language applications, but as of now, if somebody want to input in Indian languages, there is no built in tool in it. Similarly, the application is not internationalized. Both of these can be achieved by using the [//github.com/wikimedia/jquery.ime jquery.ime] and [//github.com/wikimedia/jquery.ime jquery.i18n] libraries from Wikimedia. A sample implementation is avaliable in our [http://smc.org.in website]. The i18n should be in the SILPA flask framework with a nice templating system. Similarly the interface should have webfonts using [https://github.com/wikimedia/jquery.webfonts jquery.webfonts] library. '''Improve the webfonts ''' :-
* Currently Silpa provides 36 webfonts. add more fonts to this collection.
* Rewrote webfonts module to use the features of jquery.webfonts
* Provide font preview and download options
'''Expertise required'''====More Details====* [https:jQuery, Python , technical understanding about fonts //github.com/wikimedia/jquery.i18n jquery.i18n]* [https://github.com/wikimedia/jquery.ime jquery.ime]'''Mentor''' * [https: Jishnu ==Adding Indic Sript Rendering Support to GIS applications ==//github.com/wikimedia/jquery.webfonts jquery.webfonts]
===Add Indic / Malayalam rendering to MapServer + OpenLayers stack.===Both are OSGeo projects, and used in most of the WebGIS applications recently. MapServer is an open source development environment for building spatially enabled internet applications. OpenLayers is an open source JavaScript library for displaying map data in web browsers. OpenLayers is used by OpenStreetMap for its "slippy map" map interface.* '''[https://savannah.nongnu.org/task/index.php?12555 Savannah Task]Expertise required''': jQuery, css, html5, Python , flask , technical understanding about fonts
====More Details====* http'''Mentor''' :Hrishikesh K B //www.mapserver.org/* http://www.openlayers.org/* http://www.osgeo.org/Sajjad Anwar
==Building a system and API's for accessing and upadating Malayalamgrandham Bibligiography Data==
==Developing Malayalam Calendar Support ==
This project is to build Malayalam Calendar support to KDE. Korganiser have support for many calendars like Saka calender, Hijri, Jalali. The project is to build support for other calendars with a special focus on malayalam calendarMalayalam Calendar (Kolla Varsham)
Following tasks are part of this work .
* Building Malayalam (kollavarsham) Calendar support to KDE/KOrganiser .* Build All possible indian Calendars as a generic library/api Add holidays and other special times.
====More Details====
* https://github.com/smc/smc/tree/master/calendar/java/src/org/panchanga/indic
==== Starting Points ==== * [https://github.com/smc/smc/tree/master/calendar/kde Build KDE from source]* Familiarize with how Saka and other calendars are done. '''Mentor''': Praveen A, Mahesh M
==Language model and Acoustic model for Malayalam language for speech recognition system in CMU Sphinx==
CMU Sphinx is a large vocabulary, speaker independent speech recognition codebase and suite of tools, which can be used to develop speech recognition system in any language. To develop an automatic speech recognition system in a language, acoustic model and language model has to framed for that particular language. Acoustic models characterize how sound changes over time. It captures the characteristics of basic recognition units. The language model describes the likelihood, probability, or penalty taken when a sequence or collection of words is seen. It attempts to convey behavior of the language and tries to predict the occurrence of specific word sequences possible in the language. Once these two models are developed, it will be useful to every one doing research in speech processing. For Indian languages Hindi, Tamil, Telugu and Marati, ASR systems have been developed using sphinx engine. In this project work is aimed at developing acoustic model and language model for Malayalam.
'''Mentor''': Deepa P. Gopinath=== Background Reading ===  * [http://www.cs.cmu.edu/~gopalakr/publications/spdatabases_specom05.pdf 'Development of Indian Language Speech Databases for Large Vocabulary Speech Recognition Systems'], Gopalakrishna Anumanchipalli, Rahul Chitturi, Sachin Joshi, Rohit Kumar, Satinder Pal Singh, R.N.V. Sitaram, S P Kishore * [http://www.aclweb.org/anthology/W/W12/W12-5808.pdf "Automatic Pronunciation Evaluation And Mispronunciation Detection Using CMUSphinx"], Ronanki Srikanth, James Salsman * http://www.speech.cs.cmu.edu/* http://cmusphinx.sourceforge.net/wiki/tutorial * [http://www.ijarcsse.com "HTK Based Telugu Speech Recognition"], P. Vijai Bhaskar, AVNIET ,Hyderabad, Prof. Dr. S. Rama Mohan Rao, A.Gopi  * [http://www.cs.cmu.edu/~araza/Automatic_Speech_Recognition_System_for_Urdu.PDF "Design and Development of an Automatic Speech Recognition System for Urdu"], Agha Ali Raza, M.Sc. Thesis, FAST‐National University of Computer and Emerging Sciences  * [http://www.ccis2k.org/iajit/PDF/vol.6,no.2/11IASRUCSS186.pdf "Investigation Arabic Speech Recognition Using CMU Sphinx System"], Hassan Satori1, 2, Hussein Hiyassat3, Mostafa Harti1, 2, and Noureddine Chenfour * [http://www.try.idv.tw/static-resources/homework/pr/PR_Final_Report.pdf "Understanding the CMU Sphinx Speech Recognition System"], Chun-Feng Liao ==Enhancement of Indic Language Support in Scribus== ===Project Description=== Scribus is an Open Source program that brings professional page layout to Linux/UNIX. Find more about scribus [http://www.scribus.net here]. Indic Languages are not supported completely in Scribus as of now. As a result of the work done by a group of developers from India, there is a branch available in Scribus public git repo which supports Indic Languages. The aim of this idea is to Enhance the Indic language support solving the following issues: * Harfbuzz is being used as a thrid party code in Scribus. Maintaining thirdparty code is a tedious task, this should be added as a dependency instead.* Enhance hyphenation by adding hyphenation rules. '''Expertise Required:''' Good understanding of C, C++, Qt, Harfbuz etc, cmake. ===More information=== * Branch indic in scribus public git repository. Clone url: git clone git://git.scribus.net/scribus.git* [[Hyphenation]]* [http://wiki.scribus.net/canvas/Git#To_sum_it_up_:_how_to_get_and_compile_a_specific_branch_:_indic_of_scribus.git Compile scribus indic branch]  '''Mentor''' : Jain Basil Aliyas (jainbasil in IRC) '''Complexity''' : Advanced