Testability of robotic applications is difficult due to several reasons including the battery limitations of mobile robots, hardware wear and tear, inadequate number of the robots, etc.
Collecting data through a game with a purpose (GWAP) mechanism has become a popular approach due to its numerous benefits. We developed previously a game playing application related to the identification of some properties of Turkish verbs. The current project will be a continuation of the previous project. We aim at extending this research and improving the application by developing a new mechanism. We will collect data, measure the success rate of the mechanism, and show the usefullness of the new mechanism.
Read the following papers related to the previous project:
A multi-word expression (MWE) is a lexeme (term) made up of a sequence of two or more lexemes that has properties that are not predictable from the properties of the individual lexemes or their normal mode of combination. In short, a MWE is like an idiom. In several languages, there are built-in dictionaries of MWEs. In this project, we aim at forming a dictionary (lexicon) of MWEs for Turkish.
Text summarization is the task of forming a summary of a given document. There are two types of summaries: extracts (just selecting some of the important sentences in the document as the summary) and abstracts (generating new sentences from scratch to form the summary). In this project, we deal only with extract type of summarization. The classical metric used to evaluate the success of a summarization system is the Rouge metric.
Given a text document and a predetermined set of classes, text classification (categorization) is the task of finding the correct class of the document. Centroid-based classification is a simple type of classification in which the given document is assigned to a class based on the similarity between the document and the centroids of the classes. In this project, we begin with reading the paper “A New Centroid Based Classification Model for Text Categorization”, Liu, C., Wang, W., Tu, G., Xiang, Y. and Konan, M., 2016.
Comparative analysis is the task of identifying comparison sentences in the documents, analyzing the word groups and structures in these sentences, and extracting the compared objects and their properties. For instance, in the comparison sentence “the sound quality of CD player X is better than that of CD player Y”, the objects CD players X and Y are compared and the result of the comparison is that X is better than Y in terms of sound quality. In this project, we aim at doing comparative analysis for Turkish sentences.
In this two-terms project, we aim at identifying some architectural properties of buildings using machine learning techniques. The properties may be the architect of the building, the era of the building, the architectural type of the building, etc. First a number of photographs of the buildings will be compiled. Then they will be analyzed using the OpenCV (Open Source Computer Vision) library and important features will be extracted. Then a machine learning method (artificial neural network) will be applied to determine the desired properties.