Repository logo
Research Data
Publications
Projects
Persons
Organizations
English
Français
Log In(current)
  1. Home
  2. Publications
  3. Article de recherche (journal article)
  4. Authorship Attribution Based on Specific Vocabulary

Authorship Attribution Based on Specific Vocabulary

Author(s)
Savoy, Jacques  
Institut d'informatique  
Date issued
2012
In
ACM Transactions on Information Systems (TOIS)
Vol
30
No
3
From page
Art. 1
Subjects
Performance Experimentation
Abstract
In this article we propose a technique for computing a standardized Z score capable of defining the specific vocabulary found in a text (or part thereof) compared to that of an entire corpus. Assuming that the term occurrence follows a binomial distribution, this method is then applied to weight terms (words and punctuation symbols in the current study), representing the lexical specificity of the underlying text. In a final stage, to define an author profile we suggest averaging these text representations and then applying them along with a distance measure to derive a simple and efficient authorship attribution scheme. To evaluate this algorithm and demonstrate its effectiveness, we develop two experiments, the first based on 5,408 newspaper articles (<i>Glasgow Herald</i>) written in English by 20 distinct authors and the second on 4,326 newspaper articles (<i>La Stampa</i>) written in Italian by 20 distinct authors. These experiments demonstrate that the suggested classification scheme tends to perform better than the Delta rule method based on the most frequent words, better than the chi-square distance based on word profiles and punctuation marks, better than the KLD scheme based on a predefined set of words, and better than the naïve Bayes approach.
Publication type
journal article
Identifiers
https://libra.unine.ch/handle/20.500.14713/65142
DOI
10.1145/2180868.2180874
File(s)
Loading...
Thumbnail Image
Download
Name

Savoy_Jacques-Authorship_attribution_based_on_specific_vocabulary-20130108.pdf

Type

Main Article

Size

1.64 MB

Format

Adobe PDF

Université de Neuchâtel logo

Service information scientifique & bibliothèques

Rue Emile-Argand 11

2000 Neuchâtel

contact.libra@unine.ch

Service informatique et télématique

Rue Emile-Argand 11

Bâtiment B, rez-de-chaussée

Powered by DSpace-CRIS

libra v2.1.0

© 2025 Université de Neuchâtel

Portal overviewUser guideOpen Access strategyOpen Access directive Research at UniNE Open Access ORCIDWhat's new