Search for Meaning Through the Study of Co-occurrences in Texts - Université Paris 1 Panthéon-Sorbonne Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Search for Meaning Through the Study of Co-occurrences in Texts

Résumé

In this paper, we combine several tools used in text-mining in order to study both the lexicon and the semantic structure of a set of medieval texts. On the one hand, the study of occurrences (Principal Component Analysis, Topic Models, Self-Organizing Maps, Hierarchical Cluster Analysis) allows a wide scope of tools to extract and display information from big data. On the other hand, the study of co-occurrences (words belonging to a sentence, a paragraph) allows to keep track of the structure of each text, but is more tedious to handle and often leads to messy visualizations. Here we use the SOM algorithm to reduce the size of the data (clustering, removal of fickle information) while preserving the semantic structure ; then we can rely on classical but slower algorithms (HCA, graph representation) to purpose data visualization.
Fichier principal
Vignette du fichier
papier_iwann_2015_revised.pdf (758.56 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01519217 , version 1 (06-05-2017)

Licence

Copyright (Tous droits réservés)

Identifiants

  • HAL Id : hal-01519217 , version 1

Citer

Nicolas Bourgeois, Marie Cottrell, Stéphane Lamasse, Madalina Olteanu. Search for Meaning Through the Study of Co-occurrences in Texts. International Work-Conference on Artificial Neural Networks, Jun 2015, Palma de Mallorca, Spain. ⟨hal-01519217⟩
204 Consultations
976 Téléchargements

Partager

Gmail Facebook X LinkedIn More