Part-of-Speech Tagging with Hidden Markov Models

  Part-of-Speech tagging is a common sequence-tagging problem in natural language processing. It is the process of assigning a single word class label to each token in the input sentence. For example, for input: इराक सीमाबाट सेना हटाइने।, the output of the tagger is इराक-कुवेत/NN सीमा/NN बाट/II सेना/NN हटाइने/NN ।/YF.…

0 Comments

Processing Unicode(Devnagari) in Python

Source: Devanagari (Unicode block)     Unicode is a standard for representing characters in different languages using four digit hexadecimal number called code points. Each character is associated with a unique code point. In python, these code points are represented as \uXXXX, where \u indicates Unicode and XXXX is the four…

0 Comments

Iterative Rule-based Stemming in Nepali

Nepali, being a highly inflectional and derivational language, a single word can represent various grammatical forms and meanings. For example a verb root लेख्(lekh) can show different forms such as: लेख्छु(lekh-chu), लेख्छस्(lekh-chas), लेखछेस्(lekh-ches), लेख्छ(lekh-cha), लेखी(lekh-i), लेख्यो(lekh-yo), लेखे(lekh-e). Stemming is the process of reducing inflectional(or sometimes derivational) forms of words to…

0 Comments

Verbal Inflections in Nepali

  Like nominal inflections, verbal inflections in Nepali occur due to suffixation. Verbs in Nepali are highly inflected. Inflectional suffixes operate on verb roots to represent different grammatical categories of verb. In this blog post, we'll discuss verbal inflections in Nepali.   Different grammatical categories encoded by verbal inflections(suffixes) in…

0 Comments

Word Formation and Nominal Inflections in Nepali

Figure 1: Word Formation in Nepali   The process of word formations(शब्दनिर्माण / शब्दरचना) in Nepali can be categorized into: inflection(रुपायन) and derivation(व्युत्पादन). Types of word formation process in Nepali are given in Figure 1.   In this blog post, we'll discuss nominal(noun, pronoun and adjective) inflections in Nepali.  …

0 Comments

Scrapping Nepali News using Beautiful Soup

The increasing amount of information being shared over the web makes it a huge source of data/information. To extract this data for analysis you need web scrapping. It is a popular technique to get data from web page in whatever format suitable for your analysis. Also, make sure that you…

0 Comments