FamilyID: A Hybrid Approach to Identify Family Information from Microblogs - Archive ouverte HAL Access content directly
Conference Papers Year : 2015

FamilyID: A Hybrid Approach to Identify Family Information from Microblogs

(1) , (2) , (3)
1
2
3
Shu Huang
  • Function : Author
  • PersonId : 1029892
Bo Luo
  • Function : Author
  • PersonId : 1029893

Abstract

With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and easily extracted from Twitter. In this paper, we present a hybrid information retrieval mechanism, namely FamilyID, to identify and extract family related information of a user from his/her microblogs (tweets). The proposed model takes into account part-of-speech tagging, pattern matching, lexical similarity, and semantic similarity of the tweets. Experiment results show that FamilyID provides both high precision and recall. We expect the project to serve as a warning to users that they may have accidentally revealed too much personal/family information to the public. It could also help microblog users to evaluate the amount of information that they have already revealed.
Fichier principal
Vignette du fichier
340025_1_En_14_Chapter.pdf (339.78 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01745822 , version 1 (28-03-2018)

Licence

Attribution - CC BY 4.0

Identifiers

Cite

Jamuna Gopal, Shu Huang, Bo Luo. FamilyID: A Hybrid Approach to Identify Family Information from Microblogs. 29th IFIP Annual Conference on Data and Applications Security and Privacy (DBSEC), Jul 2015, Fairfax, VA, United States. pp.215-222, ⟨10.1007/978-3-319-20810-7_14⟩. ⟨hal-01745822⟩
232 View
91 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More