Recherche documentaire en langues arabe et asiatique
Project responsable | Jacques Savoy |
Abstract |
This research proposal will focus on retrieval models for Asian
languages (monolingual IR) as well as effective merging strategies
for both data fusion and collection fusion problems (merging
various search results achieved by different IR models). In this
research proposal we only consider textual information and exclude
any spoken text, images or video sequences. There are three objectives in carrying out this research: (1) Obtain better knowledge of the relative retrieval effectiveness of various probabilistic models (Okapi), Divergence from Randomness or language models) when using different indexing schemes for Asian languages (monolingual IR). (2) Design and implement effective data fusion (when searching the same document collection using different search systems) and collection merging procedures (searching into distinct document collections). 3) Propose a simple and effective query translation strategy capable of effectively crossing language barriers (bilingual IR). In this case, the request will be written in one given language and document collections in another. This research is based in part on our participation on two NTCIR evaluation campaigns (http://research.nii.ac.jp/ntcir/) where we obtain very good IR performance. |
Keywords |
Information retrieval (IR), cross-lingual IR (CLIR), multilingual IR (MLIR), Asian languageprocessing (Chinese, Korean, Information retrieval, Cross-language information retrieval, natural language processing, Chinese IR, Japanese IR, Korean IR, Asian language processing, Chinese, Japanes |
Type of project | Fundamental research project |
Research area | Informatique |
Method of financing | FNS - Encouragement de projets (Div. I-III) |
Status | Completed |
Start of project | 1-5-2007 |
End of project | 31-8-2007 |
Overall budget | 47'838.00 |
Contact | Jacques Savoy |