Voici les éléments 1 - 5 sur 5
  • Publication
    Accès libre
    Frequency Dictionary French / Dictionnaire de fréquence du français
    (Leipzig: Leipziger Universitätsverlag, 2013)
    Eckart, Thomas
    ;
    ; ;
    Quasthoff, Uwe
    The Frequency Dictionaries series aims at producing dictionaries with comparable frequency data for a large number of different languages. For many of the languages featured in this collection, this series is the first comprehensive compilation to use a large-scale empirical base. The dictionaries are available in both print and electronic versions. Each dictionary provides the 1,000 most frequent word forms in order of frequency and the 10,000 most frequent word forms in alphabetical order. They provide an introductory description of the data and the methodological approach used. In addition, language-specific statistical information is provided with regard to letters, word structure and structural changes. The enclosed CD-ROM contains a more comprehensive version of the dictionary as an e-book. This includes data on the relative frequency of up to 1,000,000 word forms presented in alphabetical order. The number of word forms for a particular language depends on the size and composition of the corpus used. This list of words (with frequency classes) is also available as a plain text file on the CD-ROM and is ordered both alphabetically and by frequency. Using this file, word lists for various applications can be generated easily. The word forms in the printed part of the dictionary have been checked carefully by hand to identify incorrect forms or words that are spelled according to the new spelling rules. In contrast, the more comprehensive list on the CD-ROM has been inspected by means of automatic plausibility criteria alone. For the compilation, comprehensive electronically available sources of the Leipzig Corpora Collection were used consistently. The corpora on which the individual frequency dictionaries are based include newspaper texts, Wikipedia articles and other randomly collected texts available on the Internet. They can be accessed online at http://corpora.informatik.uni-leipzig.de/. This series of dictionaries provides the opportunity to explore comparative linguistic topics and such monolingual issues as studies on word formation and frequency-based examinations of lexical areas for use in dictionaries or language teaching. The statistical results presented here can offer initial suggestions for several areas of research.
  • Publication
    Métadonnées seulement
    Einleitung
    (Neuchâtel: Institut des sciences du langage et de la communication, 2011) ;
  • Publication
    Métadonnées seulement
    Europäisch eingestellt – Valenzforschung mit Parallelkorpora
    The aim of this research is to demonstrate with a case study the significance of corpus linguistics within the field of verb valency and bilingual lexicography. Specifically, we will introduce a corpus-based process that determines context-sensitive translations of polysemous word forms. Three steps are considered here in detail. First, text evidences of the verb einstellen in the monolingual Deutsches Referenzkorpus (DeReKo) will be examined with a collocation analysis. With help of the analytical instrument COSMAS II, the collocation profiles will then be summarized into a typology (senses and subsenses, valency structures and typical collocations). In a further step, the determined senses can be attributed to the corresponding translations of the word form einstellen in other languages (English, French and Italian) by means of the multilingual parallel corpus Europarl (Open Source Parallel Corpus OPUS). Finally, the results will be compared to the codifications of commonly used bilingual dictionaries.