Geo-linguistic fingerprint and the evolution of languages in Twitter - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2012

Geo-linguistic fingerprint and the evolution of languages in Twitter

Résumé

Having access to content of messages sent by some given group of subscribers of a social network may be used to identify (and quantify) some features of that group. The feature can stand for the level of interest in some event or product, or for the popularity of some idea, or a musical hit or of a political figure. The feature can also stand for the way the written language is used and transformed, the way words are spelled and grammer is used. In this paper we shall be interested in identifying features of groups of subscribers that have their geographic location and their language in common. We develop a methodology that allows one to perform such a study using a statistical tool which is freely available, and which makes use of a part of all tweets which twitter makes available for free over the Internet. The methodology is based on the fact that one can differentiate among some geographic areas according to the activity pattern of tweets during the time of the day. We present an application of this methodology to the study of new spellings or of new words created in twitter messages
Fichier principal
Vignette du fichier
c6.pdf (2.33 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00674853 , version 1 (28-02-2012)
hal-00674853 , version 2 (24-04-2012)

Identifiants

  • HAL Id : hal-00674853 , version 2

Citer

Eitan Altman, Yonathan Portilla. Geo-linguistic fingerprint and the evolution of languages in Twitter. [Research Report] 2012, pp.14. ⟨hal-00674853v2⟩
373 Consultations
623 Téléchargements

Partager

Gmail Facebook X LinkedIn More