Skip to Main content Skip to Navigation
Conference papers

GenPR: Generative PageRank Framework for Semi-supervised Learning on Citation Graphs

Abstract : Nowadays, Semi-Supervised Learning (SSL) on citation graph data sets is a rapidly growing area of research. However, the recently proposed graph-based SSL algorithms use a default adjacency matrix with binary weights on edges (citations), that causes a loss of the nodes (papers) similarity information. In this work, therefore, we propose a framework focused on embedding PageRank SSL in a generative model. This framework allows one to do joint training of nodes latent space representation and label spreading through the reweighted adjacency matrix by node similarities in the latent space. We explain that a generative model can improve accuracy and reduce the number of iteration steps for PageRank SSL. Moreover, we show that our framework outperforms the best graph-based SSL algorithms on four public citation graph data sets and improves the interpretability of classification results.
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download
Contributor : Konstantin Avrachenkov <>
Submitted on : Saturday, October 24, 2020 - 4:45:05 PM
Last modification on : Tuesday, November 17, 2020 - 12:10:13 PM


Files produced by the author(s)




Mikhail Kamalov, Konstantin Avrachenkov. GenPR: Generative PageRank Framework for Semi-supervised Learning on Citation Graphs. The 9th Conference on Artificial Intelligence and Natural Language (AINL 2020), Oct 2020, Helsinki, Finland. pp.158-165, ⟨10.1007/978-3-030-59082-6_12⟩. ⟨hal-02977308⟩



Record views


Files downloads