Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Joint Link Prediction and Attribute Inference Using a Social-Attribute Network

Published:30 April 2014Publication History
Skip Abstract Section

Abstract

The effects of social influence and homophily suggest that both network structure and node-attribute information should inform the tasks of link prediction and node-attribute inference. Recently, Yin et al. [2010a, 2010b] proposed an attribute-augmented social network model, which we call Social-Attribute Network (SAN), to integrate network structure and node attributes to perform both link prediction and attribute inference. They focused on generalizing the random walk with a restart algorithm to the SAN framework and showed improved performance. In this article, we extend the SAN framework with several leading supervised and unsupervised link-prediction algorithms and demonstrate performance improvement for each algorithm on both link prediction and attribute inference. Moreover, we make the novel observation that attribute inference can help inform link prediction, that is, link-prediction accuracy is further improved by first inferring missing attributes. We comprehensively evaluate these algorithms and compare them with other existing algorithms using a novel, large-scale Google+ dataset, which we make publicly available (http://www.cs.berkeley.edu/~stevgong/gplus.html).

References

  1. L. A. Adamic and E. Adar. 2003. Friends and neighbors on the web. Social Netw. 25, 3, 211--230. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Backstrom and J. Leskovec. 2011. Supervised random walks: Predicting and recommending links in social networks. In Proceedings of the WSDM Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. A.-L. Barabási and R. Albert. 1999. Emergence of scaling in random networks. Science 286, 509--512.Google ScholarGoogle ScholarCross RefCross Ref
  4. S. Bartunov, A. Korshunov, S.-T. Park, W. Ryu, and H. Lee. 2012. Joint link-attribute user identity resolution in online social networks. In Proceedings of the Workshop on Social Network Mining and Analysis (SNA-KDD). Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Bilgic, G. Namata, and L. Getoor. 2007. Combining collective classification and link prediction. In Proceedings of the ICDM Workshops. IEEE Computer Society, 381--386. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Brin and L. Page. 1998. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 30, 1--7, 107--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Clauset, C. Moore, and M. E. J. Newman. 2008. Hierarchical structure and the prediction of missing links in networks. Nature 453, 7191, 98--101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. R. Doppa, J. Yu, P. Tadepalli, and L. Getoor. 2010. Learning algorithms for link prediction based on chance constraints. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD). 344--360. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. L. Fond and J. Neville 2011. Randomization tests for distinguishing social influence and homophily effects. In Proceedings of the World Wide Web Conference (WWW). ACM, New York, NY, 601--610. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. N. Z. Gong, A. Talwalkar, L. Mackey, L. Huang, E. C. R. Shin, E. Stefanov, E. Shi, and D. Song. 2012a. Jointly predicting links and inferring attributes using a social-attribute network (san). In Proceedings of the Workshop on Social Network Mining and Analysis (SNA-KDD).Google ScholarGoogle Scholar
  11. N. Z. Gong, W. Xu, L. Huang, P. Mittal, E. Stefanov, V. Sekar, and D. Song. 2012b. Evolution of social-attribute networks: Measurements, modeling, and implications using google+. In Proceedings of the Internet Measurement Conference (IMC). Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. J. Hand and R. J. Till. 2001. A simple generalisation of the area under the ROC curve for multiple class classification problems. Machine Learn. 45, 171--186. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. A. Hasan, V. Chaoji, S. Salem, and M. Zaki. 2006. Link prediction using supervised learning. In Proceedings of the SIAM Workshop on Link Analysis, Counterterrorism and Security.Google ScholarGoogle Scholar
  14. T. Joachims. 1999. Making large-scale SVM learning practical. In Advances in Kernel Methods - Support Vector Learning, MIT Press, 169--184. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Kim and J. Leskovec. 2011. Modeling social networks with node attributes using the multiplicative attribute graph model. In Proceedings of the 27th Conference on Uncertainty in Artificial Intelligence (UAI).Google ScholarGoogle Scholar
  16. G. Kossinets. 2006. Effects of missing data in social networks. Social Netw. 28, 247--268. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. G. Kossinets and D. Watts. 2006. Empirical analysis of an evolving social network. Science 311, 5757, 88--90.Google ScholarGoogle Scholar
  18. R. Kumar, J. Novak, P. Raghavan, and A. Tomkin. 2004. Structure and evolution of blogspace. Communi. ACM 47, 12, 35--39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Leskovec, L. Backstrom, R. Kumar, and A. Tomkins. 2008. Microscopic evolution of social networks. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). ACM, 462--470. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. R.-H. Li, J. X. Yu, and J. Liu. 2011. Link prediction: The Power of maximal entropy random walk. In Proceedings of the ACM Conference on Information and Knowledge Management (CIKM). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Liben-Nowell and J. Kleinberg. 2003. The link prediction problem for social networks. In Proceedings of the ACM Conference on Information and Knowledge Management (CIKM). 556--559. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. R. N. Lichtenwalter, J. T. Lussier, and N. V. Chawla. 2010. New perspectives and methods in link prediction. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. F. McSherry and M. Najork. 2008. Computing information retrieval performance measures efficiently in the presence of tied scores. In Proceedings of the European Conference on Information Retrieval (ECIR). Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. P. Melville and V. Sindhwani. 2010. Recommender systems. In Encyclopedia of Machine Learning. Springer.Google ScholarGoogle Scholar
  25. A. K. Menon and C. Elkan. 2011. Link prediction via matrix factorization. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML/PKDD). Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. K. T. Miller, T. L. Griffiths, and M. I. Jordan. 2009. Nonparametric latent feature models for link prediction. In Proceedings of the Neural Information Processing Systems Conference (NIPS).Google ScholarGoogle Scholar
  27. J.-Y. Pan, H.-J. Yang, C. Faloutsos, and P. Duygulu. 2003. Automatic multimedia cross-modal correlation discovery. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD). Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. D. Rao, M. Paul, C. Fink, D. Yarowsky, T. Oates, and G. Coppersmith. 2011. Hierarchical bayesian models for latent attribute detection in social networks. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM).Google ScholarGoogle Scholar
  29. D. Rao, D. Yarowsky, A. Shreevats, and M. Gupta. 2010. Classifying latent user attributes in twitter. In Proceedings of the 2nd International Workshop on Search and Mining User-Generated Contents (SMUC). ACM, New York, NY, 37--44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. J. Scripps, P.-N. Tan, F. Chen, and A.-H. Esfahanian. 2009. A matrix alignment approach for collective classification. In Proceedings of the Intenational Conference on Advances in Social Networks Analysis and Mining (ASONAM). Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. A. P. Singh and G. J. Gordon. 2008. Relational learning via collective matrix factorization. In Proceedings of the KDD. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. P. Symeonidis, E. Tiakas, and Y. Manolopoulos. 2010. Transitive node similarity for link prediction in social networks with positive and negative links. In Proceedings of the ACM Recommender System Conference (RecSys). Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. A. Talwalkar, S. Kumar, and H. Rowley. 2008. Large-scale manifold learning. In Proceedings of the CVPR. 273--297. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. B. Taskar, M.-F. Wong, P. Abbeel, and D. Koller. 2003. Link prediction in relational data. In Proceedings of the NIPS. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. H. Tong, C. Faloutsos, and J.-Y. Pan. 2006. Fast random walk with restart and its applications. In Proceedings of the ICDM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. S. H. Yang, B. Long, A. Smola, N. Sadagopan, Z. Zheng, and H. Zha. 2011. Like like alike—joint friendship and interest propagation in social networks. In Proceedings of the WWW. 537--546. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Z. Yin, M. Gupta, T. Weninger, and J. Han. 2010a. LINKREC: A unified framework for link recommendation with user attributes and graph structure. In Proceedings of the WWW. 1211--1212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Z. Yin, M. Gupta, T. Weninger, and J. Han. 2010b. A unified framework for link recommendation using random walks. In Proceedings of the International Conference on Advances in Social Networks Analysis and Mining (ASONAM). Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. K. Yu, W. Chu, S. Yu, V. Tresp, and Z. Xu. 2006. Stochastic relational models for discriminative link prediction. In Proceedings of the NIPS.Google ScholarGoogle Scholar
  40. E. Zheleva and L. Getoor. 2009. To join or not to join: The illusion of privacy in social networks with mixed public and private user profiles. In Proceedings of the WWW. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Joint Link Prediction and Attribute Inference Using a Social-Attribute Network

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Intelligent Systems and Technology
          ACM Transactions on Intelligent Systems and Technology  Volume 5, Issue 2
          Special Issue on Linking Social Granularity and Functions
          April 2014
          347 pages
          ISSN:2157-6904
          EISSN:2157-6912
          DOI:10.1145/2611448
          Issue’s Table of Contents

          Copyright © 2014 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 30 April 2014
          • Accepted: 1 March 2013
          • Revised: 1 February 2013
          • Received: 1 October 2012
          Published in tist Volume 5, Issue 2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader