Publications

(in press). One lexeme, many classes: inflection class systems as lattices. One-to-Many Relations in Morphology, Syntax and Semantics. PDF
Descriptions of inflection classes usually take the form of broad or fine-grained (Stump & Finkel 2013) partitions of the set of lexeme, or link both in a hierarchic system of classes (Corbett & Fraser 1993; Dressler & Thornton 1996). Recent efforts to infer those automatically (Brown & Hippisley 2012; Lee & Goldsmith 2013; Bonami 2014) all rely on the assumption that the …
(2020). Opening the Romance Verbal Inflection Dataset 2.0: A CLDF lexicon. Proceedings of The 12th Language Resources and Evaluation Conference. PDF
We introduce the Romance Verbal Inflection Dataset 2.0, a multilingual lexicon of Romance inflection covering 73 varieties. The lexicon provide verbal paradigm forms in broad IPA phonemic notation. Both lexemes and paradigm cells are organized to reflect cognacy. Such multi-lingual inflected lexicons annotated for two dimensions of cognacy are necessary to study the evolution of inflectional …
(2020). Automated Parsing of Interlinear Glossed Text from Page Images of Grammatical Descriptions. Proceedings of The 12th Language Resources and Evaluation Conference. PDF
Linguists seek insight from all human languages, however accessing information from most of the full store of extant global linguistic descriptions is not easy. One of the most common kinds of information that linguists have documented is vernacular sentences, as recorded in descriptive grammars. Typically these sentences are formatted as interlinear glossed text (IGT). Most descriptive grammars, …
(2018). Classifications flexionnelles: Étude quantitative des structures de paradigmes. Université Sorbonne Paris Cité - Université Paris Diderot (Paris 7), PhD thesis under the supervision of Olivier Bonami. PDF
This dissertation adopts the Word and Paradigm approach and elaborates computationaltools to investigate precisely the similarity structure of inflection class systems based on in-flectional lexicon. We study Arabic, Yaitepec Chatino, Zenzontepec Chatino, English, French,Navajo and European Portuguese verbs as well as Russian nouns.
(2017). When segmentation helps. Implicative structure and morph boundaries in the Navajo verb. First International Symposium on Morphology (ISMo). PDF Slides
Recent work in Word and Paradigm morphology argues that the implicative structure of paradigms is expressed in terms of relations between surface words, and that studying the structure of paradigms in terms of sub-word units is misleading if not outright impossible (Ackerman et al, 2009; Blevins, 2006, 2016; Bonami & Beniamine, 2016). The argument typically rests on the observation that a word …
(2017). Inferring Inflection Classes with Description Length. Journal of Language Modelling. PDF DOI
We discuss the notion of an inflection class system, a traditional ingredient of the description of inflection systems of nontrivial complexity. We distinguish systems of microclasses, which partition a set of lexemes in classes with identical behavior, and systems of macroclasses,which group lexemes that are similar enough in a few larger classes. On the basis of the intuition that macroclasses …
(2017). Un algorithme universel pour l'abstraction automatique d'alternances morphophonologiques. 24e Conférence sur le Traitement Automatique des Langues Naturelles (TALN). PDF
We present an implemented algorithm for the inference of morphophonological alternation patterns between word-forms. It is universal in that it leads to comparable classifications across languages without expectations on the shape of the alternations it searches for. Alternation patterns are anecessary first step for the quantitative study of morphology in the Word and Paradigm framework.
(2016). Joint predictiveness in inflectional paradigms. Word Structure. PDF DOI
This paper contributes to addressing the Paradigm Cell Filling Problem (PCFP) in inflectional paradigms, as defined by Ackerman et al. (2009). We define a method for extending the use of conditional entropy to address the PCFP to prediction based on multiple paradigm cells. We apply this method to French and European Portugese and show that, on average, knowledge of multiple paradigm cells is …
(2015). Implicative structure and joint predictiveness. Proceedings of the NetWordS Final Conference on Word Knowledge and Word Usage: Representations and Processes in the Mental Lexicon. PDF
Please cite my first name as “Sacha” in references to this talk.