4.3.1 Reasons for Company Withdrawal.
The respondents mentioned diverse and comprehensive reasons as to why their companies withdrew from OpenStack. The thematic analysis of the e-mail responses reveals eight reasons classified into four categories. Note that respondents (six in this case) may cite multiple reasons. We synthesize all the reasons that emerged from the survey in Table
2.
The first category includes the reasons for withdrawal from the Company side, pointed out by 21 different companies. More specifically, one of the most mentioned reasons is “Commercial goal achieved” (i.e., 11 responses). For example, one respondent says “When we started using OpenStack, it needed more work to be usable. As it became more mature, this was no longer necessary, and we were able to effectively use it without dedicating scarce personnel time to bug fixes or feature development…”. The other most mentioned reason is “Commercial goal failed” with 11 responses. For example, one respondent says “Our commercial efforts to make a public cloud were ultimately unsuccessful…”. The last two reasons from the company aspect are being “acquired” (three companies belong to this type) and “closed” (only one company belongs to this type).
The second category indicates the reasons for withdrawal from the Community side. More specifically, three respondents complain that OpenStack is dominated by other companies. For example, one respondent says: “… We no longer believed we could compete in the IAAS space using OpenStack given the direction these large contributors were taking it. … More focused on pleasing traditional hardware and appliance vendors than simplicity… so we decided to get rid of it and as a result, leave the IT infrastructure market…”
The third category contains the withdrawal reasons from the Developer side. Three respondents indicated that their companies’ withdrawals are due to job-hopping of the core or solo employee(s), who were responsible for OpenStack-related business. For example, “I moved to Canonical and another employee to Red Hat, and the rest were not OpenStack savvy and eventually dropped it.”
The last category includes two withdrawal reasons from the Project side. One is a complaint about OpenStack’s maintenance difficulties. Three respondents mentioned this reason. For example, “The major takeaway for leadership was staggering difficulty maintaining a production-grade OpenStack cluster. OpenStack did a very poor job abstracting complexity away. Upgrading it every six months became daunting busywork until the stack was just frozen and eventually replaced with a new setup.” The other is the roadmap conflict between companies and the project of OpenStack. Two companies mentioned these reasons. For instance, one company simply tells that “It was deprioritized by the company because it was not aligned with the product roadmap.”
4.3.2 Results of Survival Analysis.
We identify the factors that may indicate companies’ withdrawal from the answers for RQ2 and RQ3.1. As the answers for RQ2 suggest, the contribution intensity of the withdrawn companies is less than that of the companies that joined in the same version and are still involved. Thus, we hypothesize that
H1: companies that make more contributions have higher survival rates. Companies hold different commercial goals when participating in OSS ecosystems [
74,
80]. In addition, two reasons for company withdrawal, i.e, goal achieved or failed, indicate that
H2: how companies use OpenStack to achieve different goals may relate to their survival rates. Two respondents mentioned the relationship of companies’ scale and their withdrawal, e.g., “
We are no longer going to use OpenStack as it proved too hard for a smallIT team to maintain.” Therefore, we also consider the scale of the company as a factor and hypothesize that
H3: companies of a larger scale tend to have higher survival rates. Since companies also complain that their withdrawals are because of other companies’ domination in OpenStack, we take domination as a factor and assume that
H4: the degree of a project being dominated has a negative impact on the survival rates of the companies participating in it. As a result, we identify four factors that may indicate company withdrawal.
10 It has been found that turnover is detrimental to OSS projects [
17,
34,
77]. Therefore, it is of interest to investigate the possibility of using explanatory factors (i.e.,
contribution intensity, commercial goals, scale, and domination) to predict company withdrawal in advance.
For each version, we measure a company’s contribution intensity following the equation
CI defined in Section
3.3. For the domination factor, we follow the previous work [
73], including the following two steps. (1) For each repository in a specific version, we measure its degree of domination by the ratio of the contributions made by the company with the most contributions to the total contributions received by the repository. (2) For each company in a specific version, we choose the domination value of the repository that the company has contributed the most commits to as the value of the domination factor. As pointed out by existing studies [
74,
75], the repository with the most commits for each company can present the company’s interest in achieving its goals. So we deem the domination in a company’s most interested repository may have the biggest impact on its withdrawal.
To identify the commercial goal of a company, we follow the method used in our previous studies [
74,
75], where the goals of some companies have been categorized by using thematic analysis [
7]. First, we search the Internet (using “OpenStack” and the company’s name as keywords) and collect the first 20 results. We also collect documents from the marketplace page on the official OpenStack website [
19] regarding the products, services, or solutions offered by companies. Then, the first two authors independently perform deductive coding [
16], i.e., apply the existing codes (from [
74,
75]) to the collected records to identify the goal of a company toward OpenStack. We find a high level of agreement between the two coders with a Cohen’s kappa coefficient [
39] of 0.87, which shows high inter-rater reliability. After coding, the two authors discussed their disagreements to reach a consensus. We synthesize all the commercial goals that emerged from the 120 companies (i.e., 60 withdrawn companies plus 60 sustaining companies) in Table
3.
We take the number of employees to represent each company’s scale, which is obtained by visiting the
About page of companies’ official websites or searching companies’ names on Linkedin and Crunchbase.
11 Since some companies only offer a scale range, we manually define five degrees by combining the existing criteria [
2] of determining company scale and the distribution of company employees collected in this study: one refers to “# < 10”; two refers to “10 <= # <100”; three refers to “100 <= # <1,000”; four refers to “1,000 <= # < 10,000”; and five refers to “# >= 10,000” (# refers to the number of employees of a company).
Before constructing the Cox model (as introduced in Section
3.4.2), we have investigated the distribution of the numeric variables, i.e.,
CI and
Domination. For
CI variable, which is detected with skewed distributions, we simply remove the top 1% of values (10 in 987 records and 977 remains) as high-leverage outliers by applying the method described by Patel [
51] and applied by Valiev et al. [
64]. Then we log-transformed
CI to satisfy the modeling assumptions. We also apply the
variance inflation factor (
VIF) to detect multicollinearity problems [
42] for the reliability and stability of the fitted model. The final regression equation is
Surv(tstart, tstop Status) \(\sim\) Log(CI)+ Goal + Scale + Domination,
where
tstart and
tstop in the response are used to set the time interval for observing each company, and we treat each version as a time interval.
Status in the response is a company’s survival status in the time range.
CI,
Goal,
Scale, and
Domination are predictors, and we treat
Goal and
Scale as categorical variables in the model. The results of the fitted model are shown in Table
4. We follow Johnson’s recommendation to use a p-value of 0.005 for statistical evidence instead of the commonly used value of 0.05 because using the latter value often leads to unreproducible results [
36].
As expected,
CI is significantly associated with increased survival rates, meaning that companies with more contributions tend to have higher possibilities of becoming long-term contributors when other factors are held constant. The
integrating business goal and
development infrastructure vendors have a higher possibility of withdrawing compared with the full solution-oriented companies. However, companies
being partial solution vendors tend to have a higher survival rate. As for the scale predictor, the results show that two categories of company scale are significant (i.e., p-values are close to zero). Specifically, large companies with 1,000 to 10,000 (i.e., scale: four) employees and more than 10,000 (scale: five) employees are associated with higher survival rates. For example, holding the other factors constant, having more than 10,000 employees reduces the hazard ratio of withdrawing by a factor of exp(coef) = 0.17, or 83%, when compared with the small ones with no more than ten employees. It may suggest that large companies have higher risk tolerance when involved in OSS ecosystems. Surprisingly,
Domination, as a key factor in our model and derived from our survey results, does not show statistical significance. The reason might be that its potential good effects on OSS, i.e., positively associated with the productivity of contributors and the quality of issue reports [
73], may offset the company’s rejection of domination. Finally, the p-values for all three overall tests (likelihood, Wald, and score) are less than 2e-16, indicating that the model is significant. Besides,
Concordanc12 of the model is 0.86, indicating that the model explains the observed data well and has a good predictive ability.