[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2597008.2597145acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
Article

How the evolution of emerging collaborations relates to code changes: an empirical study

Published: 02 June 2014 Publication History

Abstract

Developers contributing to open source projects spontaneously group into "emerging'' teams, reflected by messages exchanged over mailing lists, issue trackers and other communication means. Previous studies suggested that such teams somewhat mirror the software modularity. This paper empirically investigates how, when a project evolves, emerging teams re-organize themselves-e.g., by splitting or merging. We relate the evolution of teams to the files they change, to investigate whether teams split to work on cohesive groups of files. Results of this study-conducted on the evolution history of four open source projects, namely Apache httpd, Eclipse JDT, Netbeans, and Samba-provide indications of what happens in the project when teams reorganize. Specifically, we found that emerging team splits imply working on more cohesive groups of files and emerging team merges imply working on groups of files that are cohesive from structural perspective. Such indications serve to better understand the evolution of software projects. More important, the observation of how emerging teams change can serve to suggest software remodularization actions.

References

[1]
J. Aranda and G. Venolia. The secret life of bugs: Going past the errors and omissions in software repositories. In 31st International Conference on Software Engineering, ICSE 2009, May 16-24, 2009, Vancouver, Canada, Proceedings, pages 298–308, 2009.
[2]
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley, 1999.
[3]
G. Bavota, B. Dit, R. Oliveto, M. Di Penta, D. Poshyvanyk, and A. De Lucia. An empirical study on the developers’ perception of software coupling. In 35th International Conference on Software Engineering, ICSE ’13, San Francisco, CA, USA, May 18-26, 2013, pages 692–701. IEEE / ACM, 2013.
[4]
A. Begel, Y. P. Khoo, and T. Zimmermann. Codebook: discovering and exploiting relationships in software repositories. In Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1, pages 125–134. ACM.
[5]
J. Bezdek. Pattern recognition with fuzzy objective function algorithms. New York: Plenum, 1981.
[6]
C. Bird, A. Gourley, P. T. Devanbu, M. Gertz, and A. Swaminathan. Mining email social networks. In Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, Shanghai, China, May 22-23, 2006, pages 137–143, 2006.
[7]
C. Bird, D. S. Pattison, R. M. D’Souza, V. Filkov, and P. T. Devanbu. Latent social structure in open source projects. In Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2008, Atlanta, Georgia, USA, November 9-14, 2008, pages 24–35. ACM, 2008.
[8]
I. T. Bowman, R. C. Holt, and N. V. Brewster. Linux as a case study: Its extracted software architecture. In Proceedings of the 1999 International Conference on Software Engineering, ICSE’ 99, Los Angeles, CA, USA, May 16-22, 1999, pages 555–563. ACM, 1999.
[9]
G. Canfora, L. Cerulo, M. Cimitile, and M. Di Penta. Social interactions around cross-system bug fixings: the case of FreeBSD and OpenBSD. In Proceedings of the 8th International Working Conference on Mining Software Repositories, MSR 2011, Waikiki, Honolulu, HI, USA, May 21-28, 2011, pages 143–152, 2011.
[10]
G. Canfora, M. Di Penta, R. Oliveto, and S. Panichella. Who is going to mentor newcomers in open source projects? In Proceedings of the 20th ACM SIGSOFT Symposium on the Foundations of Software Engineering, pages 44:1–44:11, Cary, NC, USA, 2012.
[11]
E. Capra and A. I. Wasserman. A framework for evaluating managerial styles in open source projects. In Open Source Development, Communities and Quality, IFIP 20th World Computer Congress, Working Group 2.3 on Open Source Software, OSS 2008, September 7-10, 2008, Milano, Italy, pages 1–14. Springer, 2008.
[12]
M. Cataldo, J. D. Herbsleb, and K. M. Carley. Socio-technical congruence: a framework for assessing the impact of technical and work dependencies on software development productivity. In Proceedings of the Second International Symposium on Empirical Software Engineering and Measurement, ESEM 2008, October 9-10, 2008, Kaiserslautern, Germany, pages 2–11. ACM, 2008.
[13]
M. Cataldo, P. Wagstrom, J. D. Herbsleb, and K. M. Carley. Identification of coordination requirements: implications for the design of collaboration and awareness tools. In Proceedings of the 2006 ACM Conference on Computer Supported Cooperative Work, CSCW 2006, Banff, Alberta, Canada, November 4-8, 2006, pages 353–362, 2006.
[14]
J. Cohen. Statistical power analysis for the behavioral sciences. Lawrence Earlbaum Associates, 2nd edition, 1988.
[15]
W. J. Conover. Practical Nonparametric Statistics. Wiley, 3rd edition edition, 1998.
[16]
K. Crowston and J. Howison. The social structure of free and open source software development. First Monday, 10(2), 2005.
[17]
B. Dagenais, H. Ossher, R. K. E. Bellamy, M. P. Robillard, and J. de Vries. Moving into a new software project landscape. In Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1, ICSE 2010, Cape Town, South Africa, 1-8 May 2010, pages 275–284. ACM, 2010.
[18]
A. Gordon. Classification (2nd edition). CRC Press, 1988.
[19]
R. J. Grissom and J. J. Kim. Effect sizes for research: A broad practical approach. Lawrence Earlbaum Associates, 2nd edition edition, 2005.
[20]
A. Guzzi, A. Bacchelli, M. Lanza, M. Pinzger, and A. van Deursen. Communication in open source software development mailing lists. In Proceedings of the 10th Working Conference on Mining Software Repositories, MSR ’13, San Francisco, CA, USA, May 18-19, 2013, pages 277–286. IEEE / ACM, 2013.
[21]
Q. Hong, S. Kim, S. C. Cheung, and C. Bird. Understanding a developer social network and its evolution. In IEEE 27th International Conference on Software Maintenance, ICSM 2011, Williamsburg, VA, USA, September 25-30, 2011, pages 323–332. IEEE, 2011.
[22]
L. Kaufman and P. J. Rousseeuw. Finding Groups in Data: An Introduction to Cluster Analysis. Wiley-Interscience, 2005.
[23]
L. Lopez, J. M. Gonzalez-Barahona, and G. Robles. Applying social network analysis to the information in cvs repositories. In Proceedings of the International Workshop on Mining Software Repositories. ACM Press, 2004.
[24]
S. Mancoridis, B. S. Mitchell, C. Rorres, Y.-F. Chen, and E. R. Gansner. Using automatic clustering to produce high-level system organizations of source code. In IWPC, pages 45–. IEEE Computer Society.
[25]
B. S. Mitchell and S. Mancoridis. On the automatic modularization of software systems using the bunch tool. IEEE Trans. Software Eng., 32(3):193–208, 2006.
[26]
D. Poshyvanyk, A. Marcus, R. Ferenc, and T. Gyimóthy. Using information retrieval based coupling measures for impact analysis. Empirical Software Engineering, 14(1):5–32, 2009.
[27]
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria, 2012. ISBN 3-900051-07-0.
[28]
E. S. Raymond. The Cathedral and the Bazaar. O’Reilly & Associates, Inc., Sebastopol, CA, USA, 1st edition, 1999.
[29]
P. V. Singh. The small-world effect: The influence of macro-level properties of developer collaboration networks on open-source project success. ACM Trans. Softw. Eng. Methodol., 20(2), 2010.
[30]
D. Surian, D. Lo, and E.-P. Lim. Mining collaboration patterns from a large developer network. Reverse Engineering, Working Conference on, pages 269–273, 2010.
[31]
P. Wagstrom, J. Herbsleb, and K. Carley. A social network approach to free/open source software simulation. In Proceedings of the 1st International Conference on Open Source Systems, Genova, Italy, 2005.
[32]
J. Xu, Y. Gao, S. Christley, and G. Madey. A topological analysis of the open source software development community. In Proceedings of the 38th Annual Hawaii International Conference on System Sciences, pages 198.1–. IEEE Computer Society, 2005.
[33]
M. Zhou and A. Mockus. Does the initial environment impact the future of developers. In Proceedings of the 33rd International Conference on Software Engineering, ICSE 2011, Waikiki, Honolulu, HI, USA, May 21-28, 2011, pages 271–280. ACM, 2011.

Cited By

View all
  • (2024)Vulnerabilities Introduced by LLMs Through Code SuggestionsLarge Language Models in Cybersecurity10.1007/978-3-031-54827-7_9(87-97)Online publication date: 12-Apr-2024
  • (2023)Research on code entity recognition matching technologyEighth International Conference on Electronic Technology and Information Science (ICETIS 2023)10.1117/12.2682370(29)Online publication date: 20-Jun-2023
  • (2022)Quantifying community evolution in developer social networksProceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3540250.3549106(157-169)Online publication date: 7-Nov-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICPC 2014: Proceedings of the 22nd International Conference on Program Comprehension
June 2014
325 pages
ISBN:9781450328791
DOI:10.1145/2597008
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

  • TCSE: IEEE Computer Society's Tech. Council on Software Engin.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 June 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Developers' Communication
  2. Empirical Studies
  3. Mining Software Repositories
  4. Open Source Projects

Qualifiers

  • Article

Conference

ICSE '14
Sponsor:

Upcoming Conference

ICSE 2025

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)1
Reflects downloads up to 13 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Vulnerabilities Introduced by LLMs Through Code SuggestionsLarge Language Models in Cybersecurity10.1007/978-3-031-54827-7_9(87-97)Online publication date: 12-Apr-2024
  • (2023)Research on code entity recognition matching technologyEighth International Conference on Electronic Technology and Information Science (ICETIS 2023)10.1117/12.2682370(29)Online publication date: 20-Jun-2023
  • (2022)Quantifying community evolution in developer social networksProceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3540250.3549106(157-169)Online publication date: 7-Nov-2022
  • (2021)Exploiting Natural Language Structures in Software Informal DocumentationIEEE Transactions on Software Engineering10.1109/TSE.2019.293051947:8(1587-1604)Online publication date: 1-Aug-2021
  • (2021)Do Communities in Developer Interaction Networks align with Subsystem Developer Teams? An Empirical Study of Open Source Systems2021 IEEE/ACM Joint 15th International Conference on Software and System Processes (ICSSP) and 16th ACM/IEEE International Conference on Global Software Engineering (ICGSE)10.1109/ICSSP-ICGSE52873.2021.00016(61-71)Online publication date: May-2021
  • (2020)Automating Intention MiningIEEE Transactions on Software Engineering10.1109/TSE.2018.287634046:10(1098-1119)Online publication date: 1-Oct-2020
  • (2019)Understanding Effects of Collaborations in Developing Mobile Computing Systems: Popularity, Efficiency, and QualityIEEE Access10.1109/ACCESS.2019.29043337(33380-33392)Online publication date: 2019
  • (2019)Measuring and analyzing code authorship in 1 + 118 open source projectsScience of Computer Programming10.1016/j.scico.2019.03.001Online publication date: Mar-2019
  • (2019)The life-cycle of merge conflicts: processes, barriers, and strategiesEmpirical Software Engineering10.1007/s10664-018-9674-xOnline publication date: 5-Feb-2019
  • (2018)How do developers resolve merge conflicts? an investigation into the processes, tools, and improvementsProceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3236024.3275430(952-955)Online publication date: 26-Oct-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media