Abstract
The effects of social influence and homophily suggest that both network structure and node-attribute information should inform the tasks of link prediction and node-attribute inference. Recently, Yin et al. [2010a, 2010b] proposed an attribute-augmented social network model, which we call Social-Attribute Network (SAN), to integrate network structure and node attributes to perform both link prediction and attribute inference. They focused on generalizing the random walk with a restart algorithm to the SAN framework and showed improved performance. In this article, we extend the SAN framework with several leading supervised and unsupervised link-prediction algorithms and demonstrate performance improvement for each algorithm on both link prediction and attribute inference. Moreover, we make the novel observation that attribute inference can help inform link prediction, that is, link-prediction accuracy is further improved by first inferring missing attributes. We comprehensively evaluate these algorithms and compare them with other existing algorithms using a novel, large-scale Google+ dataset, which we make publicly available (http://www.cs.berkeley.edu/~stevgong/gplus.html).
- L. A. Adamic and E. Adar. 2003. Friends and neighbors on the web. Social Netw. 25, 3, 211--230. Google ScholarDigital Library
- L. Backstrom and J. Leskovec. 2011. Supervised random walks: Predicting and recommending links in social networks. In Proceedings of the WSDM Conference. Google ScholarDigital Library
- A.-L. Barabási and R. Albert. 1999. Emergence of scaling in random networks. Science 286, 509--512.Google ScholarCross Ref
- S. Bartunov, A. Korshunov, S.-T. Park, W. Ryu, and H. Lee. 2012. Joint link-attribute user identity resolution in online social networks. In Proceedings of the Workshop on Social Network Mining and Analysis (SNA-KDD). Google ScholarDigital Library
- M. Bilgic, G. Namata, and L. Getoor. 2007. Combining collective classification and link prediction. In Proceedings of the ICDM Workshops. IEEE Computer Society, 381--386. Google ScholarDigital Library
- S. Brin and L. Page. 1998. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 30, 1--7, 107--117. Google ScholarDigital Library
- A. Clauset, C. Moore, and M. E. J. Newman. 2008. Hierarchical structure and the prediction of missing links in networks. Nature 453, 7191, 98--101. Google ScholarDigital Library
- J. R. Doppa, J. Yu, P. Tadepalli, and L. Getoor. 2010. Learning algorithms for link prediction based on chance constraints. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD). 344--360. Google ScholarDigital Library
- T. L. Fond and J. Neville 2011. Randomization tests for distinguishing social influence and homophily effects. In Proceedings of the World Wide Web Conference (WWW). ACM, New York, NY, 601--610. Google ScholarDigital Library
- N. Z. Gong, A. Talwalkar, L. Mackey, L. Huang, E. C. R. Shin, E. Stefanov, E. Shi, and D. Song. 2012a. Jointly predicting links and inferring attributes using a social-attribute network (san). In Proceedings of the Workshop on Social Network Mining and Analysis (SNA-KDD).Google Scholar
- N. Z. Gong, W. Xu, L. Huang, P. Mittal, E. Stefanov, V. Sekar, and D. Song. 2012b. Evolution of social-attribute networks: Measurements, modeling, and implications using google+. In Proceedings of the Internet Measurement Conference (IMC). Google ScholarDigital Library
- D. J. Hand and R. J. Till. 2001. A simple generalisation of the area under the ROC curve for multiple class classification problems. Machine Learn. 45, 171--186. Google ScholarDigital Library
- M. A. Hasan, V. Chaoji, S. Salem, and M. Zaki. 2006. Link prediction using supervised learning. In Proceedings of the SIAM Workshop on Link Analysis, Counterterrorism and Security.Google Scholar
- T. Joachims. 1999. Making large-scale SVM learning practical. In Advances in Kernel Methods - Support Vector Learning, MIT Press, 169--184. Google ScholarDigital Library
- M. Kim and J. Leskovec. 2011. Modeling social networks with node attributes using the multiplicative attribute graph model. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence (UAI).Google Scholar
- G. Kossinets. 2006. Effects of missing data in social networks. Social Netw. 28, 247--268. Google ScholarDigital Library
- G. Kossinets and D. Watts. 2006. Empirical analysis of an evolving social network. Science 311, 5757, 88--90.Google Scholar
- R. Kumar, J. Novak, P. Raghavan, and A. Tomkin. 2004. Structure and evolution of blogspace. Communi. ACM 47, 12, 35--39. Google ScholarDigital Library
- J. Leskovec, L. Backstrom, R. Kumar, and A. Tomkins. 2008. Microscopic evolution of social networks. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). ACM, 462--470. Google ScholarDigital Library
- R.-H. Li, J. X. Yu, and J. Liu. 2011. Link prediction: The Power of maximal entropy random walk. In Proceedings of the ACM Conference on Information and Knowledge Management (CIKM). Google ScholarDigital Library
- D. Liben-Nowell and J. Kleinberg. 2003. The link prediction problem for social networks. In Proceedings of the ACM Conference on Information and Knowledge Management (CIKM). 556--559. Google ScholarDigital Library
- R. N. Lichtenwalter, J. T. Lussier, and N. V. Chawla. 2010. New perspectives and methods in link prediction. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). Google ScholarDigital Library
- F. McSherry and M. Najork. 2008. Computing information retrieval performance measures efficiently in the presence of tied scores. In Proceedings of the European Conference on Information Retrieval (ECIR). Google ScholarDigital Library
- P. Melville and V. Sindhwani. 2010. Recommender systems. In Encyclopedia of Machine Learning. Springer.Google Scholar
- A. K. Menon and C. Elkan. 2011. Link prediction via matrix factorization. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD). Google ScholarDigital Library
- K. T. Miller, T. L. Griffiths, and M. I. Jordan. 2009. Nonparametric latent feature models for link prediction. In Proceedings of the Neural Information Processing Systems Conference (NIPS).Google Scholar
- J.-Y. Pan, H.-J. Yang, C. Faloutsos, and P. Duygulu. 2003. Automatic multimedia cross-modal correlation discovery. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). Google ScholarDigital Library
- D. Rao, M. Paul, C. Fink, D. Yarowsky, T. Oates, and G. Coppersmith. 2011. Hierarchical bayesian models for latent attribute detection in social networks. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM).Google Scholar
- D. Rao, D. Yarowsky, A. Shreevats, and M. Gupta. 2010. Classifying latent user attributes in twitter. In Proceedings of the 2nd International Workshop on Search and Mining User-Generated Contents (SMUC). ACM, New York, NY, 37--44. Google ScholarDigital Library
- J. Scripps, P.-N. Tan, F. Chen, and A.-H. Esfahanian. 2009. A matrix alignment approach for collective classification. In Proceedings of the Intenational Conference on Advances in Social Networks Analysis and Mining (ASONAM). Google ScholarDigital Library
- A. P. Singh and G. J. Gordon. 2008. Relational learning via collective matrix factorization. In Proceedings of the KDD. Google ScholarDigital Library
- P. Symeonidis, E. Tiakas, and Y. Manolopoulos. 2010. Transitive node similarity for link prediction in social networks with positive and negative links. In Proceedings of the ACM Recommender System Conference (RecSys). Google ScholarDigital Library
- A. Talwalkar, S. Kumar, and H. Rowley. 2008. Large-scale manifold learning. In Proceedings of the CVPR. 273--297. Google ScholarDigital Library
- B. Taskar, M.-F. Wong, P. Abbeel, and D. Koller. 2003. Link prediction in relational data. In Proceedings of the NIPS. Google ScholarDigital Library
- H. Tong, C. Faloutsos, and J.-Y. Pan. 2006. Fast random walk with restart and its applications. In Proceedings of the ICDM. Google ScholarDigital Library
- S. H. Yang, B. Long, A. Smola, N. Sadagopan, Z. Zheng, and H. Zha. 2011. Like like alike—joint friendship and interest propagation in social networks. In Proceedings of the WWW. 537--546. Google ScholarDigital Library
- Z. Yin, M. Gupta, T. Weninger, and J. Han. 2010a. LINKREC: A unified framework for link recommendation with user attributes and graph structure. In Proceedings of the WWW. 1211--1212. Google ScholarDigital Library
- Z. Yin, M. Gupta, T. Weninger, and J. Han. 2010b. A unified framework for link recommendation using random walks. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining (ASONAM). Google ScholarDigital Library
- K. Yu, W. Chu, S. Yu, V. Tresp, and Z. Xu. 2006. Stochastic relational models for discriminative link prediction. In Proceedings of the NIPS.Google Scholar
- E. Zheleva and L. Getoor. 2009. To join or not to join: The illusion of privacy in social networks with mixed public and private user profiles. In Proceedings of the WWW. Google ScholarDigital Library
Index Terms
- Joint Link Prediction and Attribute Inference Using a Social-Attribute Network
Recommendations
Joint Link Prediction Via Inference from a Model
CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge ManagementA Joint Link Prediction Query (JLPQ) specifies a set of links to be predicted, given another set of links as well as node attributes as evidence. While single link prediction has been well studied in literature on deep graph learning, predicting multiple ...
Link Prediction in Heterogeneous Social Networks
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge ManagementA heterogeneous social network is characterized by multiple link types which makes the task of link prediction in such networks more involved. In the last few years collective link prediction methods have been proposed for the problem of link prediction ...
Link Prediction Across Multiple Social Networks
ICDMW '10: Proceedings of the 2010 IEEE International Conference on Data Mining WorkshopsThe problem of link prediction has been studied extensively in literature. There are various versions of the link prediction problem \textit{e.g.,} link existence problem, link removal problem, predicting edge weights over time etc. In this paper we ...
Comments