Using token-based semantic vector spaces for corpus-linguistic analyses: From practical applications to tests of theoretical claims
Date issued
2017
In
Corpus Linguistics and Linguistic Theory, De Gruyter, 2017///1-32
Subjects
semantic vector spaces token-based word sense disambiguation asymmetric priming
Abstract
This paper presents token-based semantic vector spaces as a tool that can be applied in corpus-linguistic analyses such as word sense comparisons, comparisons of synonymous lexical items, and matching of concordance lines with a given text. We demonstrate how token-based semantic vector spaces are created, and we illustrate the kinds of result that can be obtained with this approach. Our main argument is that token-based semantic vector spaces are not only useful for practical corpus-linguistic applications but also for the investigation of theory-driven questions. We illustrate this point with a discussion of the asymmetric priming hypothesis (Jäger and Rosenbach 2008). The asymmetric priming hypothesis, which states that grammaticalizing constructions will be primed by their lexical sources but not vice versa, makes a number of empirically testable predictions. We operationalize and test these predictions, concluding that token-based semantic vector spaces yield conclusions that are relevant for linguistic theory-building.
Publication type
journal article
File(s)![Thumbnail Image]()
Loading...
Name
Hilpert_Martin_-_Using_token-based_semantic_vector_spaces_for_corpus-linguistic_20181126.pdf
Type
Main Article
Size
1.87 MB
Format
Adobe PDF
