Skip to Main content Skip to Navigation
Conference papers

GenPR: Generative PageRank Framework for Semi-supervised Learning on Citation Graphs

Abstract : Nowadays, Semi-Supervised Learning (SSL) on citation graph data sets is a rapidly growing area of research. However, the recently proposed graph-based SSL algorithms use a default adjacency matrix with binary weights on edges (citations), that causes a loss of the nodes (papers) similarity information. In this work, therefore, we propose a framework focused on embedding PageRank SSL in a generative model. This framework allows one to do joint training of nodes latent space representation and label spreading through the reweighted adjacency matrix by node similarities in the latent space. We explain that a generative model can improve accuracy and reduce the number of iteration steps for PageRank SSL. Moreover, we show that our framework outperforms the best graph-based SSL algorithms on four public citation graph data sets and improves the interpretability of classification results.
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download
Contributor : Konstantin Avrachenkov Connect in order to contact the contributor
Submitted on : Saturday, October 24, 2020 - 4:45:05 PM
Last modification on : Thursday, December 10, 2020 - 4:23:02 PM
Long-term archiving on: : Monday, January 25, 2021 - 6:08:24 PM


Files produced by the author(s)




Mikhail Kamalov, Konstantin Avrachenkov. GenPR: Generative PageRank Framework for Semi-supervised Learning on Citation Graphs. INL 2020 - 9th Conference on Artificial Intelligence and Natural Language, Oct 2020, Helsinki, Finland. pp.158-165, ⟨10.1007/978-3-030-59082-6_12⟩. ⟨hal-02977308⟩



Record views


Files downloads