Repository logo
Research Data
Publications
Projects
Persons
Organizations
English
Français
Log In(current)
  1. Home
  2. Publications
  3. Article de recherche (journal article)
  4. Searching strategies for the Hungarian language

Searching strategies for the Hungarian language

Author(s)
Savoy, Jacques  
Institut d'informatique  
Date issued
2008
In
Information Processing & Management
Vol
1
No
44
From page
310
To page
324
Subjects
Hungarian information retrieval Hungarian language CLEF evaluation decompounding n-gram indexing TEXT RETRIEVAL PROBABILISTIC MODELS INFORMATION CLEF-2003 ALGORITHM
Abstract
This paper reports on the underlying IR problems encountered when dealing with the complex morphology and compound constructions found in the Hungarian language. It describes evaluations carried out on two general stemming strategies for this language, and also demonstrates that a light stemming approach could be quite effective. Based on searches done on the CLEF test collection, we find that a more aggressive suffix-stripping approach may produce better MAP. When compared to an IR scheme without stemming or one based on only a light stemmer, we find the differences to be statistically significant. When compared with probabilistic, vector-space and language models, we find that the Okapi model results in the best retrieval effectiveness. The resulting MAP is found to be about 35% better than the classical tf Of approach, particularly for very short requests. Finally, we demonstrate that applying an automatic decompounding procedure for both queries and documents significantly improves IR performance (+10%), compared to word-based indexing strategies. (c) 2007 Elsevier Ltd. All rights reserved.
Publication type
journal article
Identifiers
https://libra.unine.ch/handle/20.500.14713/53187
Université de Neuchâtel logo

Service information scientifique & bibliothèques

Rue Emile-Argand 11

2000 Neuchâtel

contact.libra@unine.ch

Service informatique et télématique

Rue Emile-Argand 11

Bâtiment B, rez-de-chaussée

Powered by DSpace-CRIS

libra v2.1.0

© 2025 Université de Neuchâtel

Portal overviewUser guideOpen Access strategyOpen Access directive Research at UniNE Open Access ORCIDWhat's new