Welcome!

Nepali NLP Group conducts research and development activities in the field of natural language processing. Our research combines findings from linguistics with methods in machine leaning to develop efficient algorithms to process texts in Nepali.


Broadly, we work in the following areas:


  • Nepali NLP, morphology, parsing

  • Information extraction, data mining

  • Text analytics, social medial analytics

  • Linguistics resource development: corpora, lexicons




  • Applications of Regular Expression in Text Analysis

    by Ingroj Shrestha on Sept. 4, 2017


    Text analysis applications require frequent pattern matching and searching. For this reason, regular expressions play an important role in text analysis. Regular expressions are special sequence of characters that are useful for searching in texts. They can be used to ...

    Read More

    Tag: Text Analysis , Python , Regular Expressions


    Processing Unicode(Devnagari) in Python

    by Ingroj Shrestha on Sept. 14, 2017

    Source: Devanagari (Unicode block)


    Unicode is a standard for representing characters in different languages using four digit hexadecimal number called code points. Each character is associated with a unique code point. In python, these code points are represented as \uXXXX ...

    Read More

    Tag: Python , Regular Expressions