NLP library for Turkish

NLP library for Turkish

We address the problem of Part of Speech tagging for noisy texts in Turkish language. The popular social media network Twitter has gained so much attention in Turkey recent years and for this reason we considered Twitter as a perfect candidate for noisy text source for Turkish language and consequently have decided to supply our data set from Twitter. We used Hidden Markov Models (HMMs) for modeling our problem.We have created a data set in order to train and test our model which constitutes around 500 tweets. Considering the fact that no open-source data and tool has been available for POS tagging task in noisy texts for Turkish language we will make our data and tools available for the research community with a view of enabling further contributions to the problem.

Project Poster: 

Project Members: 

Murat Akif Dumlu

Project Advisor: 

Arzucan Özgür

Project Status: 

Project Year: 

2015
  • Fall

Bize Ulaşın

Bilgisayar Mühendisliği Bölümü, Boğaziçi Üniversitesi,
34342 Bebek, İstanbul, Türkiye

  • Telefon: +90 212 359 45 23/24
  • Faks: +90 212 2872461
 

Bizi takip edin

Sosyal Medya hesaplarımızı izleyerek bölümdeki gelişmeleri takip edebilirsiniz