Options
Influence of language morphological complexity on information retrieval
Auteur(s)
Dolamic, Ljiljana
Date de parution
2010
Résumé
In this dissertation two aspects of information retrieval are elaborated. The frst involves the creation and evaluation of various linguistic tools for languages less studied than English, and in our case we have chosen to work with the two Slavic languages Czech and Russian, and three languages widely spoken on the Indian subcontinent, Hindi, Marathi and Bengali. To do so we compare various indexing strategies and IR models most likely to obtain the best possible performance. The second part involves an evaluation of the effectiveness of queries written in different languages when searching collections written in either English or French. To cross the language barriers we apply publicly available machine translation services, analyze the results and then explain the poor performances obtained by the translated queries.
Notes
Thèse de doctorat : Université de Neuchâtel, 2010
Identifiants
Type de publication
doctoral thesis
Dossier(s) à télécharger