1 Introduction

Over the last decade, reverse logistics has attracted the attention of many researchers due to the increasing importance of saving raw materials, reducing environmental impacts and government regulations (Kapukaya et al. 2019). Disposal of industrial waste is always one of the most important issues in urban life. With the advent of the industrial revolution, these problems emerged as a result of waste introduction in the environment and its impacts on human health (Krumwiede and Sheu 2002). Recently, with the advent of the coronavirus, waste management of health masks and gloves is a global challenge for managers and government officials.

Since medicines contain complex chemical constituents, medical wastes are classified in the group of the most dangerous waste (Weraikat et al. 2016a). Inefficient methods in the treatment of pharmaceutical waste and lack of reverse logistics initiatives lead to unauthorized disposal and crucial environmental problems including water pollution, soil contamination, and production of toxic and non-toxic wastes (Harijani et al. 2017). Therefore, forming a proper reverse logistics network is essential for PSCs. However, studies on reverse PSCs are still at the conceptual level and do not cover all various return flows of expired medicines (Viegas et al. 2019). In other words, there is still a clear trend toward the development of waste management models in the field of PSCs.

Medicines have a definite lifetime/shelf-life and will be expired/discarded thereafter. These medicines can be stored in pharmacies, or distribution centers (DCs) for a limited time. Nevertheless, they may expire in pharmacies before being delivered to customers. So, there is a serious risk of selling outdated medicines in pharmacies (Narayana et al. 2019). A proper reverse logistics is therefore required to collect outdated medicines to recycle, remanufacture or destroy them by using specific methods. Since the performance of forward and reverse supply chains (SC) are highly correlated, appropriate integration of both forward and reverse flows is imperative (Vahdani et al. 2012). This way, a CLPSC has been introduced to merge forward and reverse flows. An efficient CLPSC structure helps to reduce the negative environmental impacts by adopting eco-friendly transportation systems and collecting industrial wastes (Amin and Zhang 2013). Moreover, in recent years, governmental laws and regulatory requirements for the disposal of pharmaceutical products on one hand, and social responsibilities, economic benefits, and gaining public image, on the other hand, have forced companies in competitive markets to focus on environmental and social issues as well as promoting their sustainable behavior (Kumar et al. 2019).

In this study, a scenario-based MOMILP model is proposed to design a sustainable multi-echelon CLPSC network. While there is a broad range of studies dedicated to PSC management, to the best of our knowledge, the problem of managing a sustainable closed-loop PSC considering some main real-world features such as different manufacturers’ brands, different reverse flows, the dependence of demand on quality and price, and competition in the market is rare. The proposed model aims to address the aforementioned features while maximizing the total profit, minimizing environmental damages and optimizing the social impacts of the CLPSC. This research applies a scenario-based game theory approach to deal with demand uncertainty, which is significant and can stabilize the competitive position of chain members. The proposed multi-objective model is then converted to a single-objective form using the LP-metrics method. Due to the complexity of the proposed model, a heuristic approach is applied to solve the model, especially in large scales. The framework of the research problem is depicted in Fig. 1.

Fig. 1
figure 1

Framework of the present research

The remainder of the paper is organized as follows. In Sect. 2 review of relevant studies is provided. Section 3 explains the research framework, assumptions, and the proposed model. Section 4 describes the solution methodology. In Sect. 5, a real case study and the numerical results are presented. Finally, Sect. 6 presents conclusions about the findings of the research and future directions.

2 Literature review

This section dedicates to investigate some related previous researches in the literature along with research gaps in two separate sub-sections.

2.1 Related studies

The PSC is one of the most sophisticated and critical SCs which is directly related to the life and health of people. PSC faces many challenges, such as inventory management, waste management, insufficient and uncertain information on demand, inventory shortage, and the expiration of medicines (Chen et al. 2019). Many studies have addressed these challenges in PSC models to some extent. Zandieh et al. (2018) studied a four-echelon PSC designed to minimize total cost and environmental gas emissions as well as maximize social welfare by considering the unemployment ratio in local and main DCs as regional factors. Zahiri et al. (2017) designed a PSC network under uncertainty, in which many backup technologies were utilized to manufacture products. The model aimed to minimize total cost, maximize social effects and minimize environmental impacts as well as non-resiliency. By considering a deterministic demand function, Timajchi et al. (2019) introduced an inventory-routing model for a PSC, which consisted of several hospitals and a radio-pharmacy center. Some main features embedded in the Timajchi et al. (2019)’s model are backordered demand, lost sales, accident risk, and product’s deterioration. Franco and Alfonso-Lizarazo (2019) proposed two mixed-integer programming models that the first one considered PSC costs as well as the products expiration date. While the second model tried to achieve the best expiration dates of required products to minimize the number of expired medicines. In Mousazadeh et al. (2015), TH and ɛ-constraint methods were applied to gain strategic and tactical results. Minimizing total cost and unmet demand were aimed for a pharmaceutical case study with a high degree of epistemic uncertainty.

However, the mentioned articles are mainly targeted sustainability issues, they have less paid attention to suitable planning for treatment of the pharmaceutical wastes. As well, a few numbers of researches have studied the issue of sustainability in the context of CLPSCs. Despite the importance of pharmaceutical products recycling, the issues of reverse logistics and closed-loop of pharmaceutics have rarely been noticed by researchers. In other words, early studies on reverse logistics and environmental issues in PSCs have been more at the conceptual level. Abbas and Farooquie (2018) worked conceptually on returning unsold/unwanted medicines from retailers to producers. These medicines included items that either did not match prescription or they had expired. Susarla and Karimi (2012) presented a Mixed-Integer Linear Programming (MILP) model for a globally integrated network of multinational pharmaceutical enterprises, in which the international tax coefficient and waste disposal were embedded. Narayana et al. (2019) investigated the Taguchi method in a dynamic model of two brands with reverse processes. They derived that contrary to the improvement of reverse characteristics, safety stock cannot affect market flooding alleviating. Taleizadeh et al. (2019) mentioned that customer engagement in the collection of unwanted pharmaceuticals, reselling them with a lower price or transferring them to developing countries can increase SC’s profitability and sustainability. Alizadeh et al. (2020) designed a forward and reverse network for medical disposable supplies to maximize total profit as well as minimize biological risks. Liu et al. (2020) developed a bi-level programming model for recycling pharmaceutical products with governmental encourages considering sustainability aspects. In another study, a multi-mode capacitated transportation system was considered in a forward and backward relief supply chain to minimize total costs and maximize the whole reliability (Madani et al. 2020). Ahlaqqach et al. (2018) designed a CLPSC network to optimize the goals of economics, social responsibility and transportation risk of End Of Life (EOL) pharmaceutical products.

The reverse logistics policies in real-world PSCs are highly sensitive to the product’s characteristics. However, most studies on PSCs’ reverse logistics have neglected this issue and considered return flows as a percentage of the forward flows. Regardless of the returned product’s characteristics, they also assumed that a percentage of the total return flows could be recycled. Weraikat et al. (2016b) investigated a PSC in which the decentralized negotiation process was used for collecting unwanted medicines in customer areas. A constant percentage was considered for products’ recycling in their model. Weraikat et al. (2016b) proposed a reverse PSC to find the effect of customer's incentives on facilitating the return of leftover medicines and improve the sustainability of a drug SC. They divided the return flows into three categories and assigned a percentage of the total return flow to each category to determine the size of each category. To increase the reliability of a four-layer green SC, Moslemi et al. (2017) concentrated on the pharmaceutical products priority for each hospital. As well, the raw materials used to produce medicines could be supplied from either recycled material (with lower prices) or normal ones.

In today’s business environment, the competitive advantages of companies are determined by indicators such as speed and accuracy in service, customer satisfaction and loyalty, lower supply chain costs and ultimately lower finished product costs (Grigoroudis et al. 2013). The goal of pharmacies in the PSC is to fulfill customers’ requirements in a productive way.

However, accurately estimating the demand or flow of a product in a PSC is a complex issue since the demand for medicines is highly sensitive to the price and quality of the product. In today's uncertain world, miscalculation of these parameters can lead to inventories surpluses or shortages in pharmacies. This can not only cause financial losses and a high volume of expired products but also have a significant impact on the patient's health. However, previous studies have rarely attempted to deal with uncertainty in CLPSCs.

Table 1 summarizes some main studies on PSC management. Inspire from real-world and considering Table 1, this paper deals with the problem of “designing and planning a sustainable CLPSC in an uncertain and competitive environment” and by considering some real-world requirements such as (1) sensitivity of customer demand to price and quality, (2) brand diversity of pharmaceutical product, (3) medicine waste management and, (4) integration of forward and backward products’ flows by Simultaneous Pickup and Delivery (SPD) which is rarely addressed in the literature.

Table 1 Comparison of the proposed study with closely related studies

2.2 Research gap analysis and contributions

The literature shows that there are a few papers which studied pharmaceutical closed-loop supply chains, especially by quantitative approaches. To the best of our knowledge, the CLPSCs are rarely studied from a waste management perspective too. In the current study to concentrate on the waste management requirements in CLPSCs, medicines and their wastes are classified by their characteristics and components in three classes.

Furthermore, to the best of our knowledge, there is no study, which investigates the impact of price and quality of medicines on the PSC network design and performance. This research applies a novel scenario-based game theory approach to estimate the amount of demand due to competition between price and quality. Also, this paper is among the first attempts that design and analyze a CLPSC in an uncertain and competitive market. This way, a novel scenario-based MOMILP model is proposed to integrate sustainable CLPSC network designing and Vehicle Routing Problem with Simultaneous Pickup and Delivery (VRPSPD) under uncertainty.

Based on the literature and Table 1, some main features that distinguish this paper from existing research are as follows:

  • Designing a new multi-product, multi-period sustainable CLPSC network, which is considered both strategic decisions (i.e., determining numbers and optimized locations of remanufacturers and disposal centers) and tactical decisions (i.e., the assignment of pharmacies to DCs, and optimum flow of products in the network).

  • Designing a sustainable CLPSC for medicines with expiration date and by integrating requirements of pharmaceutical waste management. For this purpose, all possible reverse flows are classified into three categories according to the product type: disposal, remanufacturing, and recycling flows.

  • Designing and analyzing a CLPSC under competitive conditions by taking into account different brands of manufacturers. This way, the sensitivity of customer’s demand to the product’s price and quality is considered by applying a scenario-based game theory approach.

  • Integrating concepts of CLPSC and capacitated VRPSPD inspired by the real world.

3 Problem description

3.1 Modeling framework

In this study, a multi-product multi-period sustainable CLPSC network is configured that consists of multiple manufacturers, DCs and pharmacies in the forward network, addition to disposal centers and remanufacturers in the reverse network. Medicines, with a specified lifetime, are transferred from manufacturers to DCs. The medicines can be sent to pharmacies at the same period or be stored in DCs for several periods before their expiration date and then be transferred to pharmacies. Vehicles in each period transfer medicines from DCs to the pharmacies assigned to them, and simultaneously collect expired products from pharmacies. Then, depending on the type of outdated medicines, decisions are made to transfer them to the related centers. Three categories of reverse flows are taken into consideration in the proposed model.

The first category of return flows includes expired medicines that are hazardous to the environment, humans, and animals. This kind of return flow can contaminate water or be ingested by animals and harm them. Therefore, they should be sent to the disposal centers and their proper disposal is of significant importance. The second class of disposals can be remanufactured and brought back to the forward PSC. Since the reprocessing cycle of these products can be much shorter and cheaper than the initial production cycle, remanufacturing of these products is very beneficial, they should, therefore, be sent to remanufactures. Finally, the last category is related to the products that are recyclable and can be used in non-pharmaceutical SCs. They can be transferred to DCs to be sold in secondary markets. Moreover, considering the uncertainty of medicines’ demand which is sensitive to price and quality, a scenario-based competition has been taken into account. This way, each brand’s demand is determined according to its price and quality. The competition is examined under three scenarios; optimistic, most likely and pessimistic.

The proposed model aims at maximizing total profit and social impacts of the PSC, in addition to minimizing carbon emissions, through determining the optimum routes and flows of medicines from DCs to pharmacies as well as the number and location of remanufacturers and disposal centers. Figure 2 illustrates a schematic view of the considered CLPSC.

Fig. 2
figure 2

Schematic view of the proposed CLPSC

The main assumptions considered are as follows:

  • The production capacity of manufacturers and remanufacturers is unlimited.

  • The total vehicle load at any point of a route cannot exceed the vehicle capacity.

  • Each pharmacy has a time window and can be served by only one DC.

  • The demand of each pharmacy is a function of customer’s willingness to buy a medicine considering its price and quality.

  • Perishability is just allowed in pharmacies.

  • Each medicine has a definite shelf-life, which will expire thereafter, and should be immediately collected after expiration.

  • Each medicine is entered into a DC with its full lifetime and leaves DC (to deliver to a pharmacy) with a remaining shelf-life.

The conceptual outline of the proposed CLPSC is depicted in Fig. 3.

Fig. 3
figure 3

Conceptual outline of the proposed CLPSC

3.2 Proposed MOMILP

In this section, notations and then the proposed mathematical model are presented.

Sets

\(m\in \{1,\dots ,M\}\):

Set of manufacturers.

\(j\in \{1,\dots ,J\}\):

Set of DCs.

\(r\in \{1,\dots ,R\}\):

Set of pharmacies.

\(d\in \{1,\dots ,D\}\):

Set of disposal centers.

\(c\in \{1,\dots ,C\}\):

Set of remanufacturers.

\(n\in \{M,J,D,R,C\}\):

Set of nodes.

\(p\in \{1,\dots ,P\}\):

Set of products.

\(g\in \{1,\dots ,G\}\):

Set of vehicles.

\(t,k\in \{1,\dots ,T\}\):

Set of periods of time.

\({i}_{p}\in \{1,\dots ,I={SL}_{p}\}\):

Set of remaining lifetimes of products.

\(s\in S\):

Set of scenarios.

Parameters

\({FTC}_{g}\):

Transportation cost of vehicle g per unit of distance.

\({IDC}_{jt}\):

Unit inventory cost in DC j in period t.

\({IRC}_{rt}\):

Unit inventory cost in pharmacy r in period t.

\({EXC}_{pmt}\):

Unit expiration cost of product p with brand of manufacturer m in period t.

\({SHC}_{pmrt}\):

Unit lost sale cost of product p with brand of manufacturer m in pharmacy r in period t.

\({CRM}_{c}\):

Fixed opening cost of remanufacturer c.

\({CD}_{d}\):

Fixed opening cost of disposal center d.

\({RMCE}_{pm}\):

Unit remanufacturing cost of product p with brand of manufacturer m.

\({DCE}_{pm}\):

Unit disposal cost of product p with brand of manufacturer m.

\({CTW}_{r}\):

Cost of time window violation in pharmacy r.

\({Tt}_{n{n}^{{\prime}}}\):

Travel time from node n to \({n}^{{{\prime}}}\)

\({VC}_{g}\):

Maximum capacity of vehicle g.Maximum capacity of vehicle g.

\({ID}_{j}^{max}\):

Maximum allowable inventory level in DC j.

\({IR}_{r}^{max}\):

Maximum allowable inventory level in pharmacy r.

\({CEV}_{g}\):

Amount of carbon emitted per unit of distance by vehicle g.

\({CEM}_{mp}\):

Carbon emitted by producing a unit of product p in manufacturer m.

\({CERM}_{cp}\):

Carbon emitted by remanufacturing a unit of product p in remanufacturer c.

\({CED}_{dp}\):

Carbon emitted by disposing of a unit of product p in disposal center d.

\({EBRM}_{p}\):

Environmental benefit results from remanufacturing a unit of product p.

\({EBD}_{p}\):

Environmental benefit results from disposal of a unit of product p.

\({FJO}_{d}\):

Number of fixed jobs created by opening disposal center d.

\({FJO}_{c}\):

Number of fixed jobs created by opening remanufacturer c.

\({VJO}_{d}\):

Number of variable jobs required for disposing of a unit of product in disposal center d.

\({VJO}_{c}\):

Number of variable jobs required for processing a unit of product in remanufacturer c.

\({SL}_{p}\):

Shelf life of product p.

\({DS}_{n{n}^{{\prime}}}\):

Distance between node n and \({n}^{{{\prime}}}\)

\({D}_{pmrts}^{*}\):

Equilibrium demand for product p with brand of manufacturer m in pharmacy r in period t under scenario s.

\({V}_{pmrs}^{*}\):

Equilibrium price of product p with brand of manufacturer m in pharmacy r under scenario s.

\({ISEX}_{p}\):

Sale price of a unit of product p in the secondary market.

\({xy}_{p}\):

1 if product p must be disposed of; 0, otherwise.

\({xz}_{p}\):

1 if product p is recyclable; 0, otherwise.

\({yz}_{p}\):

1 if product p is reproducible; 0, otherwise.

L:

A large number.

\({P}_{S}\):

Occurrence probability of scenario s.

Decision variables

\({Y}_{jn{n}^{{\prime}}gts}\):

1 if in the path of vehicle g assigned to DC j, node \({n}^{{{\prime}}}\) is met after node n under scenario s; 0, otherwise.

\({Z}_{ds}\):

1 if disposal center d is selected under scenario s; 0, otherwise.

\({H}_{cs}\):

1 if remanufacturer c is selected under scenario s; 0, otherwise.

\({FPM}_{pmjts}^{{i}_{p}}\):

Flow of product p with remaining lifetime \({i}_{p}\) that is sent from manufacturer m to DC j in period t under scenario s.

\({FPR}_{pmjrkts}^{{i}_{p}}\):

Flow of product p of manufacturer m that is received by DC j in period k and then sent to pharmacy r in period t with remaining lifetime \({i}_{p}\) under scenario s.

\({NVM}_{gmjts}\):

Number of vehicle g transferring products from manufacturer m to DC j in period t under scenario s.

\({NVJ}_{gjts}\):

Number of vehicle g transferring products from DC j to pharmacies in period t under scenario s.

\({ID}_{pmjts}\):

Inventory level of product p with brand of manufacturer m in DC j in period t under scenario s.

\({IR}_{pmrts}\):

Inventory level of product p with brand of manufacturer m in pharmacy r in period t under scenario s.

\({AT}_{rts}\):

Product’s delivery time to pharmacy r in period t under scenario s.

\({LS}_{pmrts}\):

Lost sale amount of product p with brand of manufacturer m in pharmacy r in period t under scenario s.

\({REX}_{pmrkts}\):

Amount of product p with brand of manufacturer m received by pharmacy r in period k and expired in period t under scenario s.

\({SP}_{pmrkts}\):

Amount of product p with brand of manufacturer m received by pharmacy r in period k and sold in period t under scenario s.

\({Q}_{jnts}\):

Total load of vehicle g assigned to DC j at node n in period t under scenario s.

\({ETW}_{rts}\):

Earliness in pharmacy r in period t under scenario s.

\({LTW}_{rts}\):

Tardiness in pharmacy r in period t under scenario s.

Considering above notations, the research problem is formulated as follows:

Objective functions

$$\begin{aligned} &\mathrm{Max} \sum_{S}\left(\left(\left(\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{V}_{pmrs}^{*}\times {SP}_{pmrkts}+\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{ISEX}_{p}\times {REX}_{pmrkts}\times {xz}_{p}\right)\right.\right.\\ &\left.\left.\quad -\left(\sum_{c=1}^{C}{CRM}_{c}\times {H}_{cs}+\sum_{d=1}^{D}{CD}_{d}\times {Z}_{ds}+\sum_{m=1}^{M}\sum_{j=1}^{J}\sum_{t=1}^{T}\sum_{g=1}^{G}{FTC}_{g}\times {DS}_{mj}\times {NVM}_{gmjts}\right.\right.\right.\\ &\left.\left.\left.\quad +\sum_{j=1}^{J}\sum_{r=1}^{R}\sum_{d=1}^{D}\sum_{c=1}^{C}\sum_{t=1}^{T}{FTC}_{g}\times {DS}_{n{n}^{{\prime}}}\times {NVJ}_{gjts}+\sum_{j=1}^{J}\sum_{m=1}^{M}\sum_{t=1}^{T}\sum_{p=1}^{P}{IDC}_{jt}\times {ID}_{pmjts}\right.\right.\right.\\ &\left.\left.\left.\quad+\,\sum_{m=1}^{M}\sum_{r=1}^{R}\sum_{t=1}^{T}\sum_{p=1}^{P}{IRC}_{rt}\times {IR}_{pmrts}+\sum_{m=1}^{M}\sum_{r=1}^{R}\sum_{t=1}^{T}\sum_{p=1}^{P}{SHC}_{pmrt}\times {LS}_{pmrts}\right.\right.\right.\\ &\left.\left.\left.\quad +\sum_{r=1}^{R}\sum_{t=1}^{T}\left({ETW}_{rts}+{LTW}_{rts}\right)\times {CTW}_{r}+\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{EXC}_{pmt}\times {REX}_{pmrkts}\right.\right.\right.\\ &\quad \left.\left.\left.+\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{DCE}_{pm}\times {REX}_{pmrkts}\right.\right.\right.\\&\left.\left.\left.\quad \times\,{xy}_{p}+\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{RMCE}_{pm}\times {REX}_{pmrkts}\times {yz}_{p}\right)\right)\times {P}_{S}\right).\end{aligned}$$
(1)

The first objective function (1) maximizes the expected value of the total PSC profit calculated from average revenues minus costs. The revenues consist of selling medicines to customers and recycled products to secondary markets. The costs include opening cost of remanufacturers and disposal centers, transportation costs of products and expired products, inventory costs, the penalty of unsatisfied demand, time-window violation costs, expiration costs, and costs of disposing and remanufacturing of outdated products.

$$\begin{aligned}&\mathrm{Min}\sum_{S}\left(\left(\sum_{m=1}^{M}\sum_{j=1}^{J}\sum_{t=1}^{T}\sum_{g=1}^{G}{CEV}_{g}\times {DS}_{mj}\times {NVM}_{gmjts}+\sum_{j=1}^{J}\sum_{r=1}^{R}\sum_{d=1}^{D}\sum_{c=1}^{C}\sum_{t=1}^{T}{CEV}_{g}\times {DS}_{n{n}^{{\prime}}}\times {NVJ}_{gjts}\right.\right.\\ &\left.\left.\quad +\sum_{j=1}^{J}\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{{i}_{p}=1}^{{SL}_{p}}\sum_{t=1}^{T}{CEM}_{mp}\times {FPM}_{pmjts}^{{i}_{p}}\right.\right.\\ &\left.\left.\quad +\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{d=1}^{D}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{(CED}_{dp}-{EBD}_{p})\times {xy}_{p}\times {REX}_{pmrkts}\right.\right.\\ &\left.\left.\quad +\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{c=1}^{C}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{(CERM}_{cp}-{EBRM}_{p})\times {xz}_{p}\times {REX}_{pmrkts}\right)\times {P}_{S}\right).\end{aligned}$$
(2)

The second objective function (2) minimizes the expected value of the environmental impacts of the PSC. The first to third terms indicate carbon emissions due to the transportation and manufacturing of pharmaceutical products. Besides, the fourth and fifth terms outline the environmental impacts of disposal and remanufacturing of medicines, expressed as the difference between carbon emission and environmental benefits.

$$\begin{aligned}&\mathrm{Max}\sum_{S}\left(\left(\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{JO}_{d}\times {Z}_{ds}\right.\right.\\ &\left.\left.\quad +\,\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{JO}_{c}\times {H}_{cs}\right.\right.\\ &\left.\left.\quad +\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{c=1}^{C}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{VJO}_{c}\times {xz}_{p}\times {REX}_{pmrkts}\right.\right.\\ &\left.\left.\quad +\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{d=1}^{D}\sum_{r=1}^{R}\sum_{k=1}^{t}\sum_{t=k}^{T}{VJO}_{d}\times {xy}_{p}\times {REX}_{pmrkts}\right)\times {P}_{S}\right). \end{aligned}$$
(3)

The third objective function (3) states the social goal, which maximizes the expected value of job opportunities resulting from the opening of disposal centers and remanufacturers.

Constraints

$$\sum_{p=1}^{P}{FPM}_{pmjts}^{{i}_{p}={SL}_{p}}\le \sum_{g=1}^{G}{NVM}_{gmjts} \times {VC}_{g} \quad \forall m,j,t,s$$
(4)
$${FPM}_{pmjks}^{{i}_{p}={SL}_{p}}+{FPRM}_{pmcjks}^{{i}_{p}={SL}_{p}}=\sum_{r=1}^{R}\sum_{t=k}^{{SL}_{p}+k-1}{FPR}_{pmjrkts}^{{i}_{p}={SL}_{p}-t+k} \quad \forall p,m,j,k,s$$
(5)
$$\sum_{r=1}^{R}\sum_{g=1}^{G}{Y}_{jjrgts}\ge 1 \quad \forall j,t,s$$
(6)
$$\sum_{n=1}^{N}\sum_{\begin{array}{c}{j}^{{\prime}}=1\\ {j}^{{\prime}}\ne j\end{array}}^{J}{Y}_{jn{j}^{{\prime}}ts}=0 \quad \forall j,t,s$$
(7)
$$\sum_{g=1}^{G}{Y}_{jnngts}=0 \quad \forall j,n,t,s$$
(8)
$$\sum_{n=1}^{N}{Y}_{jn{n}^{{\prime}}ts}=\sum_{n=1}^{N}{Y}_{j{n}^{{\prime}}nts} \quad \forall j,{n}^{{{\prime}}},t,s$$
(9)
$$\sum_{j=1}^{J}\sum_{g=1}^{G}{Y}_{jjrgts}\le 1 \quad \forall r,t,s$$
(10)
$$\sum_{g=1}^{G}{Y}_{jjngts}=0 \quad \forall \mathrm{ j},n\in \left\{d,c,{j}^{{\prime}}\right\},t,s$$
(11)
$$\sum_{r=1}^{R}\sum_{g=1}^{G}{Y}_{jnrgts}=0 \quad \forall \mathrm{ j},n\in \left\{d,c\right\},t,s$$
(12)
$$\sum_{m=1}^{M}\sum_{p=1}^{p}\sum_{k=1}^{t}{FPR}_{pmjrkts}^{{i}_{p}={SL}_{p}-t+k}\le L \times \sum_{n=1}^{N}\sum_{g=1}^{G}{Y}_{jnrgts} \quad \forall j,r,t,s$$
(13)
$${REX}_{pmrkts}=\sum_{j=1}^{J}\sum_{b=1}^{k}{FPR}_{pmjrbks}^{{i}_{p}=t-k}-\sum_{b=k}^{t}{SP}_{pmrkbs} \quad \forall p,m,r,t, k\le t,s$$
(14)
$$\sum_{m=1}^{M}{I}_{pmjts}=\sum_{m=1}^{M}{I}_{pmjt-1s}+\sum_{m=1}^{M}{FPM}_{pmjts}^{{i}_{p}={SL}_{p}}-\sum_{r=1}^{R}\sum_{m=1}^{M}\sum_{k=1}^{t}{FPR}_{pmjrkts}^{{i}_{p}={SL}_{p}-t+k} \quad \forall p,j,t,s$$
(15)
$$\sum_{m=1}^{M}{I}_{pmrts}=\sum_{m=1}^{M}{I}_{pmrt-1s}+\sum_{m=1}^{M}\sum_{k=1}^{t}{FPR}_{pmjrkts}^{{i}_{p}={SL}_{p}-t+k}-\sum_{m=1}^{M}\sum_{k=1}^{t}{SP}_{pmrkts}-\sum_{m=1}^{M}\sum_{k=1}^{t}{REX}_{pmrkts} \quad \forall p,r,t,s$$
(16)
$$\sum_{p=1}^{P}\sum_{m=1}^{M}{I}_{pmets} \le {I}_{n}^{max} \quad \forall n\in \left\{r,j\right\},t,s$$
(17)
$${D}_{pmrts}^{*}-\sum_{k=1}^{t}{SP}_{pmrkts}={LS}_{pmrts} \quad \forall p,m,r,t,s$$
(18)
$${AT}_{{r}^{{\prime}}ts}\ge {AT}_{rts}+{Tt}_{r{r}^{{\prime}}}-L\times \left(1-\sum_{j=1}^{J}{Y}_{jr{r}^{{\prime}}gts}\right)\quad \forall r,{r}^{{{\prime}}},t,s$$
(19)
$${AT}_{{r}^{{\prime}}ts}\le {AT}_{rts}+{Tt}_{r{r}^{{\prime}}}+L\times \left(1-\sum_{j=1}^{J}{Y}_{jr{r}^{{\prime}}gts}\right)\quad \forall r,{r}^{{{\prime}}},t,s$$
(20)
$${AT}_{rts}\ge {Tt}_{jr}-L\times \left(1-\sum_{{j}^{{\prime}}=1}^{J}{Y}_{{j}^{{\prime}}jrgts}\right)\quad \forall j,r,t,s$$
(21)
$${AT}_{rts}\le {Tt}_{jr}+L\times \left(1-\sum_{{j}^{{\prime}}=1}^{J}{Y}_{{j}^{{\prime}}jrgts}\right)\quad \forall j,r,t,s$$
(22)
$${AT}_{rts}+{ETW}_{rts}\ge {LW}_{r} \quad \forall r,s$$
(23)
$${AT}_{rts}-{LTW}_{rts}\le {UW}_{r} \quad \forall r,s$$
(24)
$${Q}_{jjts}=\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t}{FPR}_{pmjrkts}^{{i}_{p}} \quad \forall j,t,s$$
(25)
$${Q}_{{jr}^{{\prime}}ts}\ge {Q}_{jnts}-\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{k=1}^{t}{FPR}_{pmj{r}^{{\prime}}kts}^{{i}_{p}}+\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{k=1}^{t-1}{REX}_{pm{r}^{{\prime}}kt-1s}- M\times \left(1-{Y}_{jn{r}^{{\prime}}gts}\right)\quad \forall j,n,r\in \left\{j,r\right\},t,s$$
(26)
$${Q}_{jnts}\le \sum_{g=1}^{G}{NVJ}_{gjts}\times {VC}_{g} \quad \forall j,n\in \left\{j,r\right\},t,s$$
(27)
$$\sum_{r=1}^{R}\sum_{j=1}^{J}\sum_{g=1}^{G}{Y}_{jrdgts}\le {Z}_{ds} \quad \forall d,s,t$$
(28)
$$\sum_{n\in \left\{r,d\right\}}\sum_{j=1}^{J}\sum_{g=1}^{G}{Y}_{jncgts}\le {H}_{cs} \quad \forall d,s,t$$
(29)
$$\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\sum_{k=1}^{t-1}{(REX}_{pmrkt-1s}\times {xy}_{p})\le L\times \sum_{r=1}^{R}\sum_{j=1}^{J}\sum_{g=1}^{G}{Y}_{jrdgts} \quad \forall d,t,s$$
(30)
$$\sum_{m=1}^{M}\sum_{p=1}^{P}\sum_{r=1}^{R}\left(\left(\sum_{k=1}^{t-1}{REX}_{pmrkt-1s}+\sum_{k=1}^{t-2}{REX}_{pmrkt-2s}\right)\times {yz}_{p}\right)\le L\times \sum_{j=1}^{J}\sum_{n\in \left\{r,d\right\}}\sum_{g=1}^{G}{Y}_{jncgts} \quad \forall c,t,s$$
(31)
$${FPRM}_{pmcjts}^{{i}_{p}={SL}_{p}}\ge \sum_{r=1}^{R}{REX}_{pmrkt-2s}\times {yz}_{p}- M\times \left(1-\sum_{n\in \left\{r,d\right\}}\sum_{g=1}^{G}{Y}_{jncgts}\right)\quad \forall p,m,c,j,t,s$$
(32)
$${FPRM}_{pmcjts}^{{i}_{p}={SL}_{p}}\le \sum_{r=1}^{R}{REX}_{pmrkt-2s}\times {yz}_{p}+ M\times \left(1-\sum_{n\in \left\{r,d\right\}}\sum_{g=1}^{G}{Y}_{jncgts}\right)\quad \forall p,m,c,j,t,s$$
(33)
$$\begin{aligned}&{Y}_{jn{n}^{{\prime}}gts} , {Z}_{ds} , {H}_{cs}\in \left\{\mathrm{0,1}\right\} and\\ & {FPM}_{pmjts}^{{i}_{p}} , {FPR}_{pmjrkts}^{{i}_{p}} , {NVM}_{gmjts} , {NVJ}_{gjts} , {ID}_{pmjts} , {IR}_{pmrts} ,\\ & {AT}_{rts} ,{LS}_{pmrts} , {REX}_{pmrkts} , {SP}_{pmrkts} ,{Q}_{jnts} , {ETW}_{rts} ,{LTW}_{rts}\ge 0 \quad \forall m,t,i,j,k,s. \end{aligned}$$
(34)

Constraint (4) states vehicles capacity limitation. Constraint (5) addresses the relation between input and output flows to DCs. Constraints (6) and (7) ensure that a route starts from a DC and returns to it too. Constraints (79) confirm a tour construction for DCs. Since the starting and ending point of each tour is one DC, there is no flow from each node to itself, and each vehicle enters a node, should go out as well. Constraint (10) grantees that each pharmacy is allocated exactly to one DC. Constraint (11) bans direct movements from DCs to disposal centers, remanufacturers, and other DCs. Constraint (12) prohibits movements from disposal centers or remanufacturers to pharmacies. Constraint (13) indicates that the pharmacy needs to be visited if there is a flow in a given period. Constraint (14) shows the expiration of products in pharmacies. Constraints (15) and (16) express inventory balance equations in DCs and pharmacies respectively. Constraint (17) acknowledges that the inventory amount cannot exceed the maximum possible inventory level. Constraint (18) calculates the number of lost sales. Constraints (1922) represent the product’s arrival time to each pharmacy. Constraints (23) and (24) demonstrate time-window violations. Constraints (25) and (26) state total vehicle load after visiting each node. Constraint (27) guarantees that the whole load does not exceed the vehicle capacity. Constraint (28) shows that the flow to disposal centers can be established if they are opened. Constraint (29) is similarly defined for remanufacturers. Constraint (30) acknowledges that perishable products should be transferred to disposal centers if they are dangerous. Constraint (31) assures that remanufacturer should be visited either there are reproducible products in a given period or there are products that were remanufactured in the previous period. Constraints (32) and (33) show the number of reproducible products that return to the PSC. Finally, Constraint (34) displays the type of decision variables.

3.2.1 Demands based on a scenario-based game theory

The demand for a specific medicine is usually a function of utility that customers perceive based on its price and quality. Among various brands of manufacturers, customers choose the one which brings them the most utility. A pharmacy must carefully consider their customers’ requirements to properly meet their demands and subsequently maximize its profit. In this study, competition on price and quality among members of the PSC is considered to estimate the demand of each brand.

It is assumed that pharmacy r supplies product p from two types of manufacturers; high-quality manufacturers (\(H\)) with quality level \({\alpha }_{pH}\), and low-quality manufacturers (\(L\)) with quality level \({\alpha }_{pL},\) where \(0<{\alpha }_{pL}<{\alpha }_{pH}< 1\). In this structure, manufacturer \(m\) incurs unit production cost \({C}_{pm}\) and sets wholesale price \({W}_{pm}\). Then, the pharmacy r sets retail price \({V}_{pmr}\) for product p of manufacturer \(m\), where \(m=H, L\) and \({C}_{pH}>{C}_{pL}\). The demand functions for each brand are therefore as follows:

$${D}_{pHr}={a}_{pr} -{b}_{vr}({V}_{pHr}-{s}_{vr}{V}_{pLr})+ {b}_{\alpha r}({\alpha }_{pH}-{s}_{\alpha r}{\alpha }_{pL})$$
(35)
$${\mathrm{D}}_{\mathrm{pLr}}={\mathrm{a}}_{\mathrm{pr}} -{\mathrm{b}}_{\mathrm{vr}}({\mathrm{V}}_{\mathrm{pLr}}-{\mathrm{s}}_{\mathrm{vr}}{\mathrm{V}}_{\mathrm{pHr}})+ {\mathrm{b}}_{\alpha \mathrm{r}}({{\alpha }}_{\mathrm{pL}}-{\mathrm{s}}_{\alpha \mathrm{r}}{{\alpha }}_{\mathrm{pH}})$$
(36)

where \({a}_{pr}\) is the market potential of pharmacy r for product p. In the customer zone where pharmacy r is located, \({b}_{vr}\) and \({b}_{\alpha r}\) are demand sensitivity coefficients for price and quality respectively. \({s}_{vr}\)(\({s}_{\alpha r})\) is the substitutability coefficient related to the price (quality) of pharmacy rs competitors. This way, the profit functions of manufacturers and pharmacies are respectively as follows:

$${\pi }_{pm}=\left({W}_{pm} - {C}_{pm}\right)\sum_{r=1}^{R}{D}_{pmr} \quad \forall p,m=H,L$$
(37)
$${\pi }_{pr}=\sum_{m=1}^{M}\left( {V}_{pmr}-{W}_{pm} \right){D}_{pmr}\quad \forall p,m=H,L.$$
(38)

To apply the scenario-based game theory approach, a bi-level programming model is provided which includes two sub-models; upper and lower-level problems. The upper-level problem consists of objectives and decisions of upper-level decision-makers (DMs) that are leaders of the SC. The lower-level problem consists of decisions of lower-level DMs or followers. This way, leaders first make their decisions according to their goals and conditions. Then, followers make their decisions in response to the top level. Consequently, the variables of the two sub-models depend on each other. Since PSCs’ manufacturers mostly have a more prevailing position and market power than downstream members, they are the leaders and pharmacies are the followers. So, to obtain the equilibrium solution, using the backward deduction method, we first should solve the lower-level problem (Eq. 38) and find the optimal retail prices. Then, the results are applied to solve the upper-level problem (Eq. 37) and derive optimal wholesale prices. As a result, optimal prices for each manufacturer can be calculated from the following equations:

$${W}_{pm}^{*}=\frac{\sum_{r=1}^{R}{a}_{pr}\left(2+{s}_{vr}\right)+{b}_{\alpha r}{s}_{vr}\left({\alpha }_{p{m}^{{\prime}}}-{s}_{\alpha r}{\alpha }_{pm}\right)+2{b}_{\alpha r}\left({\alpha }_{pm}-{s}_{\alpha r}{\alpha }_{p{m}^{{\prime}}}\right)+{b}_{vr}(2{C}_{pm}+{s}_{vr}{C}_{p{m}^{{\prime}}})}{\sum_{r=1}^{R}{b}_{vr}(4-{s}_{vr}^{2})}$$
(39)
$${V}_{pmr}^{*}=\frac{{a}_{pr}\left(1+{s}_{vr}\right)+{b}_{\alpha r}{s}_{vr}\left({\alpha }_{p{m}^{{\prime}}}-{s}_{\alpha r}{\alpha }_{pm}\right)+{b}_{\alpha r}\left({\alpha }_{pm}-{s}_{\alpha r}{\alpha }_{p{m}^{{\prime}}}\right)+{b}_{vr}{W}_{pm}^{*}(1-{s}_{vr}^{2})}{2{b}_{vr}(1-{s}_{vr}^{2})}.$$
(40)

After obtaining optimal prices, demands for each brand can be derived as follows:

$${D}_{pmr}^{*}={a}_{pr} -{b}_{vr}\left({V}_{pmr}^{*}-{s}_{vr}{V}_{p{m}^{{\prime}}r}^{*}\right)+ {b}_{\alpha r}\left({\alpha }_{pm}-{s}_{\alpha r}{\alpha }_{p{m}^{{\prime}}}\right)\quad \forall m\ne {m}^{{\prime}}.$$
(41)

Customer’s needs are constantly changing and their tendencies may not remain fixed over time. Therefore, the precise determination of parameters (e.g., customer’s sensitivity to price and quality) is not possible. To deal with this challenge, the competition takes place under three scenarios; optimistic, most likely and pessimistic. In the optimistic scenario, the coefficients of demand sensitivity are low and substitutability coefficients are high. Consequently, customers pay less attention to the brand, quality, and price and buy a specific medicine from any brand that is available at a pharmacy. In the most likely scenario, the coefficients take probable values. Under the pessimistic scenario, sensitivity to price and quality is very high and substitutability coefficients are low. Hence, the slightest mismatch between the price/quality of medicines and the taste of customers makes customers refuse to buy. This way, the optimal demand under scenario s is obtained as follows:

$${D}_{pmrs}^{*}={a}_{pr} -{b}_{vrs}\left({V}_{pmrs}^{*}-{s}_{vrs}{V}_{p{m}^{{\prime}}rs}^{*}\right)+ {b}_{\alpha rs}\left({\alpha }_{pm}-{s}_{\alpha rs}{\alpha }_{p{m}^{{\prime}}}\right)\quad \forall m, {m}^{{\prime}}=H,L.$$
(42)

4 Solution approach

The mathematical model developed to design the proposed sustainable CLPSC is a multi-objective programming model with three conflicting objective functions. Different approaches have been developed to solve the multi-objective programming models including priori, interactive and posteriori ones (Sazvar et al. 2014). One of the most used approaches is the LP-metrics method, which is more perceptible for executives compared to other methods, and in many cases of using this method such as Manopiniwes and Irohara (2017) and Abdelaziz et al. (2018), acceptable results have been provided.

Based on Dethloff (2001), the VRPSPD is an NP-hard problem. Since our problem is extended form of the VRPSPD (the integration of sustainable CLPSC and the VRPSPD), the proposed model is NP-Hard, too. This way, a two-phase hybrid approach is provided to solve the proposed MOMILP model especially in large scales. In the first phase, the multi-objective model is converted to a single-objective model using the LP-metrics method. Then, in the second phase, a heuristic approach is suggested to be able to obtain optimal/near-optimal solutions.

Given the first and third objective functions are maximization and the second is minimization, the LP-metrics objective function is formulated as follows:

$${Z}_{total}=\frac{{{w}_{1}.({\mathrm{Z}}_{1}^{max}-Z}_{1})}{{\mathrm{Z}}_{1}^{max}-{\mathrm{Z}}_{1}^{min}}+\frac{{{w}_{2}.(Z}_{2}-{\mathrm{Z}}_{2}^{min})}{{\mathrm{Z}}_{2}^{max}-{\mathrm{Z}}_{2}^{min}}+\frac{{{w}_{3}.({\mathrm{Z}}_{3}^{max}-Z}_{3})}{{\mathrm{Z}}_{3}^{max}-{\mathrm{Z}}_{3}^{min}}$$
(43)

where \({w}_{i}\) denotes the weight of the ith objective function (Zi), which is determined based on DMs’ opinion. \({\mathrm{Z}}_{i}^{max}\) and \({\mathrm{Z}}_{i}^{min}\) respectively show the maximum and minimum values of the ith objective function. The steps of the heuristic algorithm applied are as follows (Kaur and Singh 2018):

First step:

Consider the relaxed MILP model in which the binary variables \({Y}_{jn{n}^{{{\prime}}}gt}\) are considered as continuous positive variables (\({Y}_{jn{n}^{{{\prime}}}gt}\ge 0\)).

Second step::

Solve the relaxed model optimally.

Third step::

Record all non-zero quantities of \({Y}_{jn{n}^{{{\prime}}}gt}\) which are obtained by solving the relaxed MILP model.

Fourth step::

Equalize each non-zero quantity of \({Y}_{jn{n}^{{{\prime}}}gt}\) to 1, and set them as constraints to the main MILP model.

Fifth step::

Solve the created model optimally.

The efficiency of the proposed two-phase hybrid approach is reported in Supplementary material.

5 Computational experiment

5.1 Case study

In this section, an Iranian medical SC is investigated as a real case study to validate the performance of the proposed MOMILP model. A PCS in Noor city, located in the north of Iran, is concentrated. This supply chain comprises 7 pharmacies (located in Noor city), a local DC (which is one of the branches of Darou Pakhsh Co., in Sari city) and a number of different brands of manufacturers. According to the expert’s suggestions, two potential locations for disposal sites and three potential remanufacturer centers are considered in the suburbs of the Noor city. The network configuration including pharmacies and potential points is shown in Fig. 4. Three high-consumption pharmaceutical products namely Tetracycline (1st product), Imatinib (2nd product), and Midazolam (3rd product) are taken into account. We suppose that medicines have six months’ lifetime. The expired antibiotic Tetracycline can be classified as one of the hazardous medicines. The toxicity is caused by tetracycline degradation products (epi-anhydrotetracycline or anhydrotetracycline), which can lead to a dangerous syndrome. Therefore, waste disposal of this product should follow the guidelines recommended by the ministry of health and the national environment agency (Beery et al. 2019; Toh and chew 2017). Imatinib is one of the medicines used to treat cancer, which can be remanufactured at a lower cost by extracting Active Pharmaceutical Ingredients (APIs) from expired Imatinib (Basha et al. 2015). We consider expired Midazolam recyclable, which can be sold to secondary markets. It can properly perform as an additive in Nickel electrodeposition from the Watts bath due to pure compounds in its pharmaceutical formulation (Duca et al. 2016).

Fig. 4
figure 4

Location of pharmacies, remanufacturers, and disposal centers in the Noor city

The demands of products for a four-year time horizon split into 24 periods (each period is two months) are brought for each pharmacy in detail. Customers in each particular region have different welfare and cultural conditions. In this regard and according to the customer zones, the demand sensitivity coefficients for price and quality, for each pharmacy, are estimated. The data related to each brand and its quality as well as production costs are available too.

The three different fleet of vehicles is employed for transferring products. All the data required for the case study is reported in “Appendix” in detail. Also, it should be noted that based on the experts’ opinions, weights of the objective functions considered as follows: \({w}_{1}=0.4, {w}_{2}=0.3\) and \({w}_{3}=0.3\).

5.2 Experimental results

In this section, the obtained results of solving the proposed MOMILP problem for the aforementioned case study are discussed. The transferred flows from manufacturers to pharmacies in the first three periods are reported in Tables 2, 3 and 4. According to these tables, the competition on price and quality ensures that products of each brand are supplied to pharmacies according to their customers’ willingness to pay. As well, pharmacies stock products in a way that the desired products of their customers be available as much as possible. Regarding Table 7, the pharmacies whose customers are more price-sensitive (e.g., pharmacies 4, 5 and 6), receive a supply of pharmaceuticals, most of which includes economic low-quality alternative brands. While, the pharmacies with quality-sensitive customers (e.g., pharmacies 1, 2 and 7), receive a supply of pharmaceuticals, which mostly consist of high-quality international brands. However, in the absence of competition, products are transferred to pharmacies without any special regulation (Fig. 5).

Table 2 Optimal flow of Tetracycline to pharmacies (t = 1, 2, 3)
Table 3 Optimal flow of Imatinib to pharmacies (t = 1, 2, 3)
Table 4 Optimal flow of Midazolam to pharmacies (t = 1, 2, 3)
Fig. 5
figure 5

Demand of each brand a without competition; b with competition

According to the results, disposal center 2 and remanufacturer 3 are selected for opening. The routes of transferring medicines (expired medicines) among pharmacies (disposal and remanufacturer centers) are outlined in Fig. 6 (Fig. 7).

Fig. 6
figure 6

Routes of transferring medicines among pharmacies

Fig. 7
figure 7

Routes of transferring expired medicines among centers

The waste flows of each product category are reported in Table 5. As can be seen, the proposed model makes it possible to collect expired medicines from pharmacies immediately after expiration. This prevents patients to access to the expired products which can have irreparable effects on patients' bodies and harm their health and life.

Table 5 Collected expired medicines of each brand

As Fig. 8 shows, 30% of the total collected waste is hazardous, that can cause serious environmental damages, and needs to be disposed of. This way, there is no need to dispose of other waste categories because they can be used for other purposes. If products were not classified, all waste flows would have to be disposed of. By categorizing wastes, the disposal costs have therefore been considerably saved. 36% of total waste is sold to secondary markets to be recycled and be used as raw materials in non-PSCs. It, therefore, causes to generate revenue through waste management. Finally, 34% of collected waste is remanufactured and returned to the PSC. This amount of return flow is equivalent to 12.88% of the total flow of Imatinib which is sent to pharmacies through the PSC. It leads to a considerable reduction in production and process times as well as raw materials usage.

Fig. 8
figure 8

Scheme of waste collection network

5.3 Sensitivity analysis

In this section, sensitivity analysis for the main parameters is done to verify the developed model and discover its applications. Figure 9 illustrates the fluctuation of the first Objective Function Value (OFV1) and OFV2 due to the variation of the vehicle’s capacity. A glance at this figure displays that both OFVs are getting better values by increasing the vehicle’s capacity. The OFVs initially have a significant improvement due to the reduction in the number of vehicles utilized. As Fig. 9 shows, when the vehicle’s capacity is increased by 30% from the base case, the OFV2 (amount of carbon emission) is decreased by 31.12%. While from some point onwards, the impact of capacity changes on the number of vehicles required, and subsequently the OFVs, is greatly reduced. For instance, whenever the vehicle’s capacity is increased by 80%, OFV2 is decreased by 57.75% (compared to the base case), indicating a decrease in the OFV2s slope.

Fig. 9
figure 9

OFV1 and OFV2 versus the vehicle’s capacity alteration

Figure 10 illustrates the impacts of lost sale cost (\({SHC}_{pmrt}\)) on inventory level and lost sale amounts in pharmacies. As can be seen, the total level of inventory grows by lost sale costs increasing. In other words, pharmacies prefer to consolidate their position in the competitive market by keeping higher levels of inventory. Also, as Fig. 10 shows, when the lost sale cost is increased by 30% and 75%, the inventory level is increased by 9.6% and 58%, respectively. So, the higher lost sale cost leads to a higher rate of increase in the inventory level. Besides, it is logical that increment in lost sale costs results in lost sales reduction. When the lost sale cost is increased by 60%, the lost sale amount is decreased by 82.35%.

Fig. 10
figure 10

Impact of the unit lost-sale cost on lost-sales and total inventory level (r = 4, t = 2)

Figure 11 is devoted to an analysis of the time window violation cost (\({CTW}_{r}\)). The diagram demonstrates that with a constant lost-sale cost, the OFV1 decreases as the cost of time violation rises. In such a situation, the model may incur costs of routing, unmet demand, or even growth in vehicle numbers to satisfy demands in a predetermined time. Initially, to prevent the deviation from time windows, the amount of unmet demands is increased significantly. However, after a point (about 50% increase in cost), a sharp decline in the increasing rate of unsatisfied demand can be observed. For example, when the time violation cost is one and a half times the base case, the unmet demand is increased by 65%, while, when the violation cost doubles, the unmet demand is increased by 75%. The reason for these changes is that satisfying demands are more important than the response time for PSCs, even if it breaks the deadline.

Fig. 11
figure 11

OFV1 and unmet demand versus the time violation cost

Figure 12 represents OFV1 and OFV3 changes arising from the variation of the expiration cost (\({EXC}_{pmt}\)). As it is evident, both objective functions are getting worse values by increasing the expiration cost of the medicines. By an increase in expiration cost, the model seeks to reduce the number of expired medicines in all pharmacies. Accordingly, the less the expired product, the lower the number of variable occupations, and the value of OFV3 diminishes. Additionally, the negligible reduction in the OFV1 is probably caused by a decrease in revenue from expired medicines.

Fig. 12
figure 12

OFV1 and OFV3 versus the expiration cost

The change in customer’s demand for each brand against products’ price and quality is shown in Fig. 13. By keeping the price sensitivity constant, Fig. 13a demonstrates that high demand sensitivity to quality leads to a high level of demand for high-quality medicines. However, it causes a low demand for low-quality medicines. Regarding Fig. 13a, when the demand sensitivity to quality is increased from 1 to 6 the demand for high-quality brands is increased by 33%, whereas the demand for low-quality brands is decreased by 68.83%.

Fig. 13
figure 13

Each brand’s demand versus a demand sensitivity to the quality; b demand sensitivity to the price (P = 3, r = 3, t = 3)

By keeping the quality sensitivity parameter constant and according to Fig. 13b, the demand for both brands decreases when the price sensitivity increases. This is because of the inherent negative influence of price on the demand. Based on Fig. 13b, a sharp decline in the high-quality medicines’ demand can be observed, while low-quality products’ demand decreases with a gentle slope. For example, when the demand sensitivity to price is increased from 1 to 5, the demands for the high-quality and low-quality brands are decreased by 65.31% and 31.56% respectively.

Figure 14 indicates the transferred flow of medicines to pharmacies under different sets of game scenario’s probability. According to the diagram and given that the customers of pharmacy 7 pay more attention to quality, under the higher probability of a pessimistic scenario occurrence, the high-quality manufacturer will cover more market demand than the low-quality one. Hence, the pharmacy managers must properly replenish the inventory with customer’s favorable items. Because in the absence of required products, customers will refuse to buy from another brand. As the occurrence probability of the optimistic scenario increases, the flow of both brands grows, along with a decline in the difference of two brands flows. According to Fig. 14 the difference between the two brands' flows is decreased by 48.53% from pessimistic to the optimistic scenario. This situation can be desirable for pharmacy managers. Since in the absence of customers’ favorable brand, the possibility of buying medicines from another brand is enforced.

Fig. 14
figure 14

Transferred flow of medicines under different sets of game scenario’s probability (P = 3, r = 7, t = 3)

5.4 Managerial insights

An important obligation of a manager is to set goals, determine the path to achieve them, and make strategic, tactical and operational decisions. To accomplish this commitment, it is vital to have managerial insights. The presented model provides managerial perspectives for PSC managers through the following goals:

  • Despite strategic decisions, tactical decisions related to PSC flows, especially replenishment of inventory, play a pivotal role in reducing costs as well as increasing customers’ satisfaction. Therefore, managers should appropriately plan PSCs to minimize their inventory costs. The proposed model increases knowledge of pharmacy managers to adopt proper replenishment policies, to keep their market shares by enhancing customers’ loyalty.

  • The proposed scenario-based game theory approach provides an efficient way of determining customers’ demands in a competitive market. This way, it can help inventory holders and DC managers in making decisions on transferring optimal flows of each brand’s products to pharmacies.

  • Market conditions are not always monotonous. Factors such as introducing a new product, changes in raw material prices or customers’ tastes can dramatically change the demand rate for a particular brand. Hence, managers should forecast sales under different scenarios to cope with market shocks and avoid from lost-sale or unwanted inventories.

  • As can be seen from Fig. 11, price sensitivity is an inherently negative factor on demand. If the price sensitivity in a customer zone is high, which leads to a low demand rate for a particular brand, it is recommended to apply some incentive policies such as price mark-down to reduce perished products.

  • Green policies, corporate social responsibility and environmentally-friendly performance are important factors in attracting customers. The suggested CLPSC can help managers to achieve these factors by coping with unauthorized dumping of pharmaceutical wastes, properly managing expired products and adopting a green transport system based on the SPD concept.

5.5 Theoretical insights and contributions

In this study, a scenario-based MOMILP model is developed to design a sustainable CLPSC network in a competitive market under the demand uncertainty, which aims at maximizing profit and social aspects, in addition to minimizing environmental harms. However, in spite of the fact that our study highlights PSC, the proposed methodologies are not case-specific and can be applied to any supply chain in various fields and industries. Accordingly, based on the current investigation, some theoretical contributions are extracted to enrich the literature of supply chain management as follows:

  • This study establishes a benchmark model for supply chain administrators to successfully conduct and manage CLSC initiatives along with sustainable targets in any industrial sector. In the literature, many requirements of waste management have been overlooked in multi-product CLSC models. This research contributes to the closed-loop supply chain literature by specifying backward flows based on products’ specifications.

  • Our study directly addresses integrated environmental approaches by designing a CLSC network along with VRPSPD, which is a critical dialogue in research on environmental management. To clarify, unlike other models, in this research for approaching the real-world conditions, almost all diverse environmental hazards arising from industrial and transportation activities are covered. It means that different environmental attributes of the proposed model can be used for a multitude number of studies discussing the environmental impacts of supply chain processes.

  • The present study accomplishes the goal of creating an appropriate balance between overcoming demand uncertainty and increasing overall profitability by connecting game theory outcomes with the supply chain’s multi-objective framework. It should be noted that the proposed scenario-based game theory and its combination with the CLSC model are absolutely new in the area of supply chain management. The applied methods are convenient to perceive and implement. Due to their extreme flexibility, these procedures can yield to develop new models which are able to further extend this field, as well.

  • As a novel idea, the presented scenario-based game theory is suitable and practical for any brand-differentiated supply chain, which enriches the supply chain context and provides a significant improvement in customer orientation activities, especially for urban areas with a broad spectrum of desires. The game scenarios can be evaluated and determined by subject matter experts, and applied to predict the demand, based on the products price and the quality levels under different situations.

  • Finally, although the efficient proposed hybrid algorithm is only evaluated under CLSC cases with SPD, it can be simply utilized to deal with other complex VRPs as well as other optimization issues in the class of NP-Hard problems.

6 Conclusion

The current study focused on the integration of the sustainable CLPSC network design and VRPSPD in a competitive market. The model consisted of some manufacturers, DCs, pharmacies and potential locations for disposal centers and remanufacturers. Products, from different brands, were considered with specific lifetimes. The uncertainty in each brand’s demand was handled by a scenario-based game theory approach. The products were collected after their lifespan and managed in three ways according to their characteristics; disposing of, remanufacturing and recycling. To solve the proposed model, a hybrid solution approach was provided by incorporating the LP-metrics method with a heuristic algorithm. Furthermore, a real case study in Iran was applied to evaluate the applicability of the presented model.

According to the obtained results, classifying pharmaceutical wastes, in addition to a considerable improvement in the negative environmental effects of the PSC (due to proper disposal), could save costs and production times (through the remanufacturing process) and/or generate additional revenue (through selling some expired products to secondary markets). Also, considering the competition in designing the CLPSC could optimally affect major decision variables (e.g., the flow of each brand’s products to pharmacies and/or their inventory levels). If the competitive condition of real markets is not considered in designing a CLPSC, it may cause high lost-sales along with a huge inventory cost in some periods. Because pharmacies’ inventories may include some brands of products with negligible demand that consequently results in a high level of perished and non-perished inventories as well as customer loss.

The future extensions may include (1) improving the developed model by integrating resiliency and sustainability requirements, (2) considering the possibility of transferring excess inventories of an unwanted brand from one pharmacy to another one where there is a demand for that brand and, (3) Examining the application of the developed model for other real cases especially medical SCs involving with managing wastes generated due to COVID-19.