FamilyID: A Hybrid Approach to Identify Family Information from Microblogs

Abstract : With the growing popularity of social networks, extremely large amount of users routinely post messages about their daily life to online social networking services. In particular, we have observed that family related information, including some very sensitive information, are freely available and easily extracted from Twitter. In this paper, we present a hybrid information retrieval mechanism, namely FamilyID, to identify and extract family related information of a user from his/her microblogs (tweets). The proposed model takes into account part-of-speech tagging, pattern matching, lexical similarity, and semantic similarity of the tweets. Experiment results show that FamilyID provides both high precision and recall. We expect the project to serve as a warning to users that they may have accidentally revealed too much personal/family information to the public. It could also help microblog users to evaluate the amount of information that they have already revealed.
Document type :
Conference papers
Complete list of metadatas

Cited literature [18 references]  Display  Hide  Download

https://hal.inria.fr/hal-01745822
Contributor : Hal Ifip <>
Submitted on : Wednesday, March 28, 2018 - 3:57:44 PM
Last modification on : Wednesday, March 28, 2018 - 3:59:29 PM
Long-term archiving on : Thursday, September 13, 2018 - 11:43:11 AM

File

340025_1_En_14_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Jamuna Gopal, Shu Huang, Bo Luo. FamilyID: A Hybrid Approach to Identify Family Information from Microblogs. 29th IFIP Annual Conference on Data and Applications Security and Privacy (DBSEC), Jul 2015, Fairfax, VA, United States. pp.215-222, ⟨10.1007/978-3-319-20810-7_14⟩. ⟨hal-01745822⟩

Share

Metrics

Record views

192

Files downloads

62