User:Yash
Lets get started:9 Apr'13
Hello all!
I am Yash Sinha, currently a student of BITS Pilani, India. Starting my WikiPage today.
Setting Up Repo:9 Apr'13
Today I tried to setup silpa git repo on my machine. I started early, because I knew there could be some difficulties. Initially, by mistake I cloned the old repository which made me face a lot of errors.
Later, I used the current on progress version of silpa at (github.com/Project-SILPA/) I cloned the folowing modules:
- [Soudex ] (github.com/Project-SILPA/Soundex)
- [ApproxSearch] (github.com/Project-SILPA/ApproxSearch)
- [Transliteration] (github.com/Project-SILPA/Transliteration)
- [Spellchecker] (github.com/Project-SILPA/spellchecker)
- [Hyphenation] (github.com/Project-SILPA/Hyphenation)
- [Chardetails] (github.com/Project-SILPA/chardetails)
- [Payyans] (github.com/Project-SILPA/payyans)
And installed the following modules: Flask, Jinja2, Werkzeug and Virtualenv. Yeah there I did a mistake. Instead of Flask I should have use flask. To identify that mistake I had to almost reinstall my modules again.
Initially, I had used sudo python setup.py install to install the cloned modules. This was also an error (I suppose). Later I logged in as root and used python setup.py install.
What I learnt:
- Python is a case-sensitive language. (F/flask) Yeah it is :)
- use sudo command only when needed.
- I also learnt to login to root both via terminal and gui.
Unicode .. Devanagari.. Transliteration …10 Apr'13
Today, I had a hectic but good day.
I tried to learn what transliteration is all about and how it works.
It is done using CMUDict, a pronunciation dictionary. If we have this word:
BENGAL B EH NG AH L
We have this mapping in the dictionary. This mapping is used to find its equivalent which is language specific. We then modify it according to the construct of the language and form a proper transliterated word.
I also made ‘hindi_english_dict’, ‘hi_vowels’, ‘hi_vowel_signs’ dictionaries, which are basically, sound mappings from hi to en_IN.
I also learnt about various unicode symbols of hindi like CANDRABINDU, ANUSVARA, VISARGA, VIRAMA etc. and their similar counterparts in kannada, telgu and malyalam.
Thanks to my friends for helping me here.
These websites also helped me a lot:
- people.w3.org/rishida/scripts/uniview.fr/chars-devanagari.html
- en.wiktionary.org/wiki/Appendix:Unicode/Devanagari
- www.infowebservices.in/hindi/
- en.wikipedia.org/wiki/Devanagari and
- jrgraphix.net/r/Unicode/0900-097F
There is a nice list of devanagari characters also, in pdf format: www.unicode.org/charts/PDF/U0900.pdf
Finally, सभी को धन्यवाद!