[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2652524.2652564acmconferencesArticle/Chapter ViewAbstractPublication PagesesemConference Proceedingsconference-collections
research-article

Patterns of folder use and project popularity: a case study of github repositories

Published: 18 September 2014 Publication History

Abstract

Context: Every software development project uses folders to organize software artifacts. Goal: We would like to understand how folders are used and what ramifications different uses may have. Method: In this paper we study the frequency of folders used by 140k Github projects and use regression analysis to model how folder use is related to project popularity, i.e., the extent of forking. Results: We find that the standard folders, such as document, testing, and examples, are not only among the most frequently used, but their presence in a project is associated with increased chances that a project's code will be forked (i.e., used by others) and an increased number of forks. Conclusions: This preliminary study of folder use suggests opportunities to quantify (and improve) file organization practices based on folder use patterns of large collections of repositories.

References

[1]
D. Barreau and B. A. Nardi. Finding and reminding: File organization from the desktop. SIGCHI Bull., 27(3):39--43, July 1995.
[2]
O. Bergman, S. Whittaker, M. Sanderson, R. Nachmias, and A. Ramamoorthy. The effect of folder structure on personal file navigation. Journal of the American Society for Information Science and Technology, 61(12):2426--2441, 2010.
[3]
A. Capiluppi, M. Morisio, and J. F. Ramil. The evolution of source folder structure in actively evolved open source systems. In 10th International Symposium on Software Metrics, pages 2--13, 2004.
[4]
L. Dabbish, C. Stuart, J. Tsay, and J. Herbsleb. Leveraging transparency. Software, IEEE, 30(1):37--43, 2013.
[5]
K. H. Fung, A. Aurum, and D. Tang. Social forking in open source software: An empirical study. In CAiSE Forum, pages 50--57, 2012.
[6]
S. Henderson. How do people manage their documents?: an empirical investigation into personal document management practices among knowledge workers. PhD thesis, ResearchSpace@ Auckland, 2009.
[7]
W. Jones, A. J. Phuwanartnurak, R. Gill, and H. Bruce. Don't take my folders away!: Organizing personal information to get things done. In CHI '05 Extended Abstracts on Human Factors in Computing Systems, pages 1505--1508, New York, NY, USA, 2005.
[8]
N. McDonald and S. Goggins. Performance and participation in open source software on github. In CHI'13 Extended Abstracts on Human Factors in Computing Systems, pages 139--144. ACM, 2013.
[9]
A. Mockus. Software support tools and experimental work. In V. Basili and et al, editors, Empirical Software Engineering Issues: Critical Assessments and Future Directions, volume LNCS 4336, pages 91--99. Springer, 2007.
[10]
M. Nagappan, T. Zimmermann, and C. Bird. Diversity in software engineering research. In Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering, pages 466--476. ACM, 2013.
[11]
K. Peterson. The github open source development process. Technical report, Technical report, Mayo Clinic, 2013.
[12]
M. Zhou, A. Mockus, and D. Weiss. Learning in offshored and legacy software projects: How product structure shapes organization. In ICSE Workshop on Socio-Technical Congruence, Vancouver, Canada, May 19 2009.

Cited By

View all
  • (2024)The Impact of Code Ownership of DevOps Artefacts on the Outcome of DevOps CI BuildsProceedings of the 21st International Conference on Mining Software Repositories10.1145/3643991.3644924(543-555)Online publication date: 15-Apr-2024
  • (2024)COLARE: Commit Classification via Fine-grained Context-aware Representation of Code Changes2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)10.1109/SANER60148.2024.00082(752-763)Online publication date: 12-Mar-2024
  • (2024)Is There a Correlation Between Readme Content and Project Meta‐Characteristics?Software: Practice and Experience10.1002/spe.339055:3(589-609)Online publication date: 18-Nov-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ESEM '14: Proceedings of the 8th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement
September 2014
461 pages
ISBN:9781450327749
DOI:10.1145/2652524
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 September 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. folder use
  2. mining software repository
  3. project popularity

Qualifiers

  • Research-article

Funding Sources

Conference

ESEM '14
Sponsor:

Acceptance Rates

ESEM '14 Paper Acceptance Rate 23 of 123 submissions, 19%;
Overall Acceptance Rate 130 of 594 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)29
  • Downloads (Last 6 weeks)4
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)The Impact of Code Ownership of DevOps Artefacts on the Outcome of DevOps CI BuildsProceedings of the 21st International Conference on Mining Software Repositories10.1145/3643991.3644924(543-555)Online publication date: 15-Apr-2024
  • (2024)COLARE: Commit Classification via Fine-grained Context-aware Representation of Code Changes2024 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER)10.1109/SANER60148.2024.00082(752-763)Online publication date: 12-Mar-2024
  • (2024)Is There a Correlation Between Readme Content and Project Meta‐Characteristics?Software: Practice and Experience10.1002/spe.339055:3(589-609)Online publication date: 18-Nov-2024
  • (2023)Approach to Formalizing Software Projects for Solving Design Automation and Project Management TasksSoftware10.3390/software20100062:1(133-162)Online publication date: 8-Mar-2023
  • (2023)Evaluating Code Metrics in GitHub Repositories Related to Fake News and Misinformation2023 IEEE/ACIS 21st International Conference on Software Engineering Research, Management and Applications (SERA)10.1109/SERA57763.2023.10197739(182-188)Online publication date: 23-May-2023
  • (2023)Exploring the Characteristics of Popular Deep Learning GitHub Repositories*2023 International Conference on Intelligent Computing and Next Generation Networks(ICNGN)10.1109/ICNGN59831.2023.10396759(1-6)Online publication date: 17-Nov-2023
  • (2023)A Comparative Study of Software Secrets Reporting by Secret Detection Tools2023 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM)10.1109/ESEM56168.2023.10304853(1-12)Online publication date: 26-Oct-2023
  • (2023)Study the correlation between the readme file of GitHub projects and their popularityJournal of Systems and Software10.1016/j.jss.2023.111806205(111806)Online publication date: Nov-2023
  • (2022)On the application of machine learning models to assess and predict software reusabilityProceedings of the 6th International Workshop on Machine Learning Techniques for Software Quality Evaluation10.1145/3549034.3561177(17-22)Online publication date: 7-Nov-2022
  • (2022)Toward Using Package Centrality Trend to Identify Packages in DeclineIEEE Transactions on Engineering Management10.1109/TEM.2021.312201269:6(3618-3632)Online publication date: Dec-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media