Welcome!

Nepali NLP Group conducts research and development activities in the field of natural language processing. Our research combines findings from linguistics with methods in machine leaning to develop efficient algorithms to process texts in Nepali.


Broadly, we work in the following areas:


  • Nepali NLP, morphology, parsing

  • Information extraction, data mining

  • Text analytics, social medial analytics

  • Linguistics resource development: corpora, lexicons




  • Data Acquisition and Preparation

    by Shreeya Singh Dhakal on July 8, 2017


    Data are key to any natural language processing and machine learning application. Machine learning algorithms learn from a predefined set of data. So, it is important that you feed the algorithms the right data. Also, it is equally important that ...

    Read More

    Tag: Data Mining


    Scrapping Nepali News using Beautiful Soup

    by Ingroj Shrestha on July 9, 2017


    The increasing amount of information being shared over the web makes it a huge source of data/information. To extract this data for analysis you need web scrapping. It is a popular technique to get data from web page in ...

    Read More

    Tag: Data Mining


    Clustering Text Documents: TF-IDF Weighting

    by Ingroj Shrestha on Dec. 13, 2017


    This blog post is the first post in the series "Clustering Text Documents". In this blog post, we'll mathematically define the TF-IDF algorithm along with an example and its python implementation.


    TF-IDF ...

    Read More

    Tag: Data Mining , Information Retrieval , Machine Learning