Options
Institut des sciences du langage
Pour une politique de la langue: la Délégation à la langue française de Suisse romande (DLF)
1999-2-11, Béguelin, Marie-José, de Pietro, Jean-François, Zahnd, Gabriela
Diagnostic des erreurs dans un corpus d'écrits techniques
2000, Béguelin, Marie-José
Présentation du corpus OFROM - corpus oral de français de Suisse romande
2012-2-12, Avanzi, Mathieu, Béguelin, Marie-José, Diémoz, Federica
Anotación multicamada externa e o enriquecemento de edicións dixitais
2018, Bermúdez Sabel, Helena
From a technological point of view, there are no limits to the amount of information an editor can add to a text. However, the encoding of the kind of multiple and miscellaneous data that is prone to be embedded in a digital edition might not be straightforward for inexperienced editors. Most beginners apply the most common method, that is, inline encoding. This technique entails the direct introduction of elements into the primary document which, together with their attributes, contain the structural and analytical information of the text. Such a method is perfectly adequate for simple ecdotic models. However, it becomes less efficient when we want to introduce multiple (and overlapping) layers of information. n this contribution, we present a stand-off annotation method which allows multi-layered and hierarchical annotation with the goal of designing a complex, flexible and processable critical apparatus.
Communication entre soignants et patients migrants: quels moyens pour quels services?
2001, Gajo, Laurent
This article aims at a better understanding of the communication between care-givers and migrant patients in relation to several services (medicine, surgery, obstetrics, emergency, geriatrics and psychiatry). Efficiency of different communication strategies is examined through a linguistic analysis of interviews and effective interactions. Among these strategies, the practical and theoretical issues of the cultural mediation are particularly focused.
Louise Behe, Marion Carel, Corentin Denuc & Julio Cesar Machado (Dir.). 2021. Cours de sémantique argumentative
2022-3-21, Pamuksaç, Aylin
Ce compte rendu expose les diverses leçons qui composent "Le cours de sémantique argumentative" dirigé par Louise Behe, Marion Carel, Corentin Denuc & Julio Cesar Machado. Long de 520 pages, cet ouvrage présente de nombreux avantages et comble un vide indéniable dans le domaine de la sémantique argumentative.
Routines macro-syntaxiques et grammaticalisation: le cas des clauses en n'importe
2002, Béguelin, Marie-José, Leth Andersen, Anne, Nølke, Henning
The Diachronic Spanish Sonnet Corpus: TEI and linked open data encoding, data distribution, and metrical findings
2020, Ruiz Fabo, Pablo, Bermúdez Sabel, Helena, Martínez Cantón, Clara, González-Blanco, Elena
How has the sonnet form in Spanish evolved over the centuries? What is the distribution of metrical patterns and combinations thereof, considering diachronic, geographical, and social factors? What rhyme schemes are favoured in different periods and regions? How is enjambment distributed within the sonnet? Providing quantitative answers to such questions requires a corpus spanning several centuries, annotated for the relevant literary features and containing author metadata. The absence of appropriate digital resources to undertake a macroanalytic study of the evolution of the sonnet in Spanish led us to create the Diachronic Spanish Sonnet Corpus. This article presents how the corpus was designed for providing quantitative evidence on the evolution of sonnets in Spanish, and our findings regarding metrics and enjambment. The corpus contains 4,085 sonnets by 1,204 Spanish and Latin American authors (15th to 19th centuries), encoded in TEI, with RDFa attributes. The corpus aims at breadth, including many peripheral authors besides some major ones. Author metadata were encoded (dates, origin, gender). Scansion and enjambment were annotated automatically, with the ADSO and ANJA tools. The range of authors and periods, the use of TEI and RDFa for interoperability, and the combination of metrical and enjambment annotations goes beyond previously available digital resources. The corpus allowed us to examine the evolution of metrical patterns and their combinations after the Golden Age, complementing earlier studies. We also observed an increase in enjambment across the tercets in the 19th century, which may indicate increased variety in the discourse organization of sonnets in the period.
Setting Up Bilingual Comparable Corpora with Non-Contemporary Languages
2022, Bermúdez Sabel, Helena, Dell'Oro, Francesca, Montrichard, Cyrielle, Rossari, Corinne
This paper presents the project “Les corpora latins et français: une fabrique pour l’accès à la représentation des connaissances” (Latin and French Corpora: a Factory For Accessing Knowledge Representation) whose focus is the study of modality in both Latin and French by means of multi-genre, diachronic comparable corpora. The setting up of such corpora involves a number of conceptualisation challenges, in particular with regard to how to compare two asynchronous textual productions corresponding to different cultural frameworks. In this paper we outline the rationale of designing comparable corpora to explore our research questions and then focus on some of the issues that arise when comparing different diachronic spans of Latin and French. We also explain how these issues were dealt with, thus providing some grounds upon which other projects could build their methodology.
Décalages. Les niveaux de l'analyse linguistique
1989-2-11, Berrendonner, Alain, Béguelin, Marie-José