Abstract
The User Identity Resolution (UIR) problem is concerned with recognizing the same person with multiple virtual profiles in different online social networks (OSNs). Most of the existing methods focus only on the similarity of profile attributes or simply combine the profile attributes and linkages of friends. In this paper, we propose a novel Greedy-based Cross-Matching (GCM) algorithm, which combines profile attributes with linkage information of both friend and non-friend users. In the GCM algorithm, we first propose a greedy strategy for detecting candidate matching users using Profile Attributes Similarity (PAS) and User Surrounding Score (USS). We then define the User Matching Score (UMS), which combines PAS with network structures, to greedily determine matched users for the candidate ones. Finally, we utilize a novel cross-matching process inspired by Stable Marriage Problem (SMP) to further improve the matching accuracy. Experiments on Twitter and Facebook demonstrate that our method significantly improves the matching performance and outperforms the state-of-the-art algorithms.
This work was supported by National Science Foundation of China (No. 61272374,61300190), Specialized Research Fund for the Doctoral Program of Higher Education (No.20120041110046) and Key Project of Chinese Ministry of Education (No. 313011).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bartunov, S., Korshunov, A., Park, S.T., Ryu, W., Lee, H.: Joint link-attribute user identity resolution in online social networks. In: Proc. of the Sixth SNA-KDD Workshop at KDD (2012)
Cortis, K., Scerri, S., Rivera, I., Handschuh, S.: An ontology-based technique for online profile resolution. In: Jatowt, A., et al. (eds.) SocInfo 2013. LNCS, vol. 8238, pp. 284–298. Springer, Heidelberg (2013)
Gale, D., Shapley, L.S.: College admissions and the stability of marriage. American Mathematical Monthly, 9–15 (1962)
Goga, O., Perito, D., Lei, H., Teixeira, R., Sommer, R.: Large-scale correlation of accounts across social networks. Tech. rep. (2013)
Irani, D., Webb, S., Li, K., Pu, C.: Large online social footprints–an emerging threat. In: International Conference on Computational Science and Engineering, CSE 2009, vol. 3, pp. 271–276. IEEE (2009)
Jain, P., Kumaraguru, P., Joshi, A.: @ i seek ’fb.me’: identifying users across multiple online social networks. In: Proceedings of the 22nd International Conference on World Wide Web, pp. 1259–1268 (2013)
Joshi, A., Sodhi, J.: Attributes similarities supports profile matching in social network
Kong, X., Zhang, J., Yu, P.S.: Inferring anchor links across multiple heterogeneous social networks. In: Proceedings of the 22nd ACM International Conference on Information & Knowledge Management, pp. 179–188. ACM (2013)
Lu, C.T., Shuai, H.H., Yu, P.S.: Identifying your customers in social networks. In: Proceedings of the 23rd ACM International Conference on Information and Knowledge Management, pp. 391–400. ACM (2014)
Malhotra, A., Totti, L., Meira Jr., W., Kumaraguru, P., Almeida, V.: Studying user footprints in different online social networks. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 1065–1070. IEEE Computer Society (2012)
Motoyama, M., Varghese, G.: I seek you: searching and matching individuals in social networks. In: Proceedings of the Eleventh International Workshop on Web Information and Data Management, pp. 67–75. ACM (2009)
Narayanan, A., Shmatikov, V.: De-anonymizing social networks. In: 2009 30th IEEE Symposium on Security and Privacy, pp. 173–187. IEEE (2009)
Quinlan, J.R.: C4.5: programs for machine learning, vol. 1. Morgan Kaufmann (1993)
Raad, E., Chbeir, R., Dipanda, A.: User profile matching in social networks. In: 2010 13th International Conference on Network-Based Information Systems (NBiS), pp. 297–304. IEEE (2010)
Shehab, M., Ko, M.N., Touati, H.: Social networks profile mapping using games. In: Proceedings of the 3rd USENIX Conference on Web Application Development, WebApps (2012)
Vosecky, J., Hong, D., Shen, V.Y.: User identification across multiple social networks. In: First International Conference on Networked Digital Technologies, NDT 2009, pp. 360–365. IEEE (2009)
Webb, G.I.: Multiboosting: A technique for combining boosting and wagging. Machine Learning 40(2), 159–196 (2000)
You, G.W., Hwang, S.W., Nie, Z., Wen, J.-R.: Socialsearch: enhancing entity search with social network matching. In: Proceedings of the 14th International Conference on Extending Database Technology, pp. 515–519. ACM (2011)
Zafarani, R., Liu, H.: Connecting users across social media sites: a behavioral-modeling approach. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 41–49. ACM (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Liang, W., Meng, B., He, X., Zhang, X. (2015). GCM: A Greedy-Based Cross-Matching Algorithm for Identifying Users Across Multiple Online Social Networks. In: Chau, M., Wang, G., Chen, H. (eds) Intelligence and Security Informatics. PAISI 2015. Lecture Notes in Computer Science(), vol 9074. Springer, Cham. https://doi.org/10.1007/978-3-319-18455-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-18455-5_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18454-8
Online ISBN: 978-3-319-18455-5
eBook Packages: Computer ScienceComputer Science (R0)