Logo du site
  • English
  • Français
  • Se connecter
Logo du site
  • English
  • Français
  • Se connecter
  1. Accueil
  2. Université de Neuchâtel
  3. Publications
  4. Authorship Attribution Based on Specific Vocabulary
 
  • Details
Options
Vignette d'image

Authorship Attribution Based on Specific Vocabulary

Auteur(s)
Savoy, Jacques 
Institut d'informatique 
In
ACM Transactions on Information Systems (TOIS), 2012/30/3/Art. 12
Mots-clés
  • Performance
  • Experimentation
  • Performance

  • Experimentation

Résumé
In this article we propose a technique for computing a standardized Z score capable of defining the specific vocabulary found in a text (or part thereof) compared to that of an entire corpus. Assuming that the term occurrence follows a binomial distribution, this method is then applied to weight terms (words and punctuation symbols in the current study), representing the lexical specificity of the underlying text. In a final stage, to define an author profile we suggest averaging these text representations and then applying them along with a distance measure to derive a simple and efficient authorship attribution scheme. To evaluate this algorithm and demonstrate its effectiveness, we develop two experiments, the first based on 5,408 newspaper articles (<i>Glasgow Herald</i>) written in English by 20 distinct authors and the second on 4,326 newspaper articles (<i>La Stampa</i>) written in Italian by 20 distinct authors. These experiments demonstrate that the suggested classification scheme tends to perform better than the Delta rule method based on the most frequent words, better than the chi-square distance based on word profiles and punctuation marks, better than the KLD scheme based on a predefined set of words, and better than the naïve Bayes approach.
Identifiants
https://libra.unine.ch/handle/123456789/9568
_
10.1145/2180868.2180874
Type de publication
journal article
Dossier(s) à télécharger
 main article: Savoy_Jacques-Authorship_attribution_based_on_specific_vocabulary-20130108.pdf (1.64 MB)
google-scholar
Présentation du portailGuide d'utilisationStratégie Open AccessDirective Open Access La recherche à l'UniNE Open Access ORCIDNouveautés

Service information scientifique & bibliothèques
Rue Emile-Argand 11
2000 Neuchâtel
contact.libra@unine.ch

Propulsé par DSpace, DSpace-CRIS & 4Science | v2022.02.00