Hostname: page-component-6bb9c88b65-s7dlb Total loading time: 0 Render date: 2025-07-24T05:03:13.828Z Has data issue: false hasContentIssue false

NMAsurv: An R Shiny application for network meta-analysis based on survival data

Published online by Cambridge University Press:  10 July 2025

Taihang Shao
Affiliation:
Center for Pharmacoeconomics and Outcome Research, China Pharmaceutical University, Nanjing, China JC School of Public Health and Primary Care, Faculty of Medicine, https://ror.org/00t33hh48 The Chinese University of Hong Kong , Shatin, China
Mingye Zhao
Affiliation:
Center for Pharmacoeconomics and Outcome Research, China Pharmaceutical University, Nanjing, China
Fenghao Shi
Affiliation:
International Research Center for Medicinal Administration, https://ror.org/02v51f717 Peking University , Beijing, China
Mingjun Rui
Affiliation:
Center for Pharmacoeconomics and Outcome Research, China Pharmaceutical University, Nanjing, China School of Pharmacy, Faculty of Medicine, https://ror.org/00t33hh48 The Chinese University of Hong Kong , Shatin, China
Wenxi Tang*
Affiliation:
Center for Pharmacoeconomics and Outcome Research, China Pharmaceutical University, Nanjing, China
*
Corresponding author: Wenxi Tang; Email: tokammy@cpu.edu.cn
Rights & Permissions [Opens in a new window]

Abstract

Network meta-analysis (NMA) is becoming increasingly important, especially in the field of medicine, as it allows for comparisons across multiple trials with different interventions. For time-to-event data, that is, survival data, traditional NMA based on the proportional hazards (PH) assumption simply synthesizes reported hazard ratios (HRs). Novel methods for NMA based on the non-PH assumption have been proposed and implemented using R software. However, these methods often involve complex methodologies and require advanced programming skills, creating a barrier for many researchers. Therefore, we developed an R Shiny tool, NMAsurv (https://psurvivala.shinyapps.io/NMAsurv/). NMAsurv allows users with little or zero background in R to conduct survival-data-based NMA effortlessly. The tool supports various functions such as drawing network plots, testing the PH assumption, and building NMA models. Users can input either reconstructed pseudo-individual participant data or aggregated data. NMAsurv offers a user-friendly interface for extracting parameter estimations from various NMA models, including fractional polynomial, piecewise exponential models, parametric survival models, Cox PH model, and generalized gamma model. Additionally, it enables users to effortlessly create survival and HR plots. All operations can be performed by an intuitive “point-and-click” interface. In this study, we introduce all the functionalities and features of NMAsurv and demonstrate its application using a real-world NMA example.

Information

Type
Software Focus
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Research Synthesis Methodology

Highlights

What is already known?

  • Network meta-analysis (NMA) based on survival data is becoming increasingly important, while the traditional Cox proportional hazards (PH) model, which relies on the PH assumption, cannot handle some novel evidence.

  • More flexible, non-PH assumption-based models have been proposed, but they are limitedly used probably due to their complex methodology and programming.

What is new?

  • NMAsurv is presented as the first tool for handling time-to-event data using non-PH assumption-based NMA models.

  • NMAsurv offers several useful functionalities, including network plot generation, PH assumption testing, and batch export of results, to facilitate NMA studies.

  • NMAsurv provides direct parameter estimations and generates survival plots and hazard ratio plots, enabling users to obtain comprehensive evidence synthesis results efficiently.

Potential impact for RSM readers

  • NMAsurv features an intuitive “point-and-click” interface, allowing users with little or no background in R to follow the provided user manual to conduct an NMA based on survival data and generate comprehensive results step-by-step without installing any software.

1 Introduction

Evidence synthesis is an important component of health technology assessment, through which decision-makers can combine data from multiple trials to get a pooled estimate of clinical efficacy.Reference Cope, Chan and Campbell1 Pairwise meta-analysis is suitable when comparing two interventions. However, if more than two interventions are of interest, and the available trials do not individually compare all interventions, then network meta-analysis (NMA) should be considered.Reference Lumley2 NMA can be an extension of pairwise meta-analysis, which can estimate the comparative effectiveness of multiple interventions simultaneously through building a connected network.Reference Schwingshackl, Schwarzer, Rucker and Meerpohl3

The efficacy of novel anti-cancer interventions is frequently measured using time-to-event (TTE) outcomes, such as progression-free survival (PFS) and overall survival (OS), which are often presented through Kaplan–Meier (KM) curves. Typically, the relative treatment effect between two interventions is expressed as a hazard ratio (HR) or restricted mean survival time (RMST).Reference Batson, Greenall and Hudson4 Reference Royston and Parmar6 NMA based on TTE outcomes can be constructed directly through trial-specific reported HRs, which are usually derived from the Cox proportional hazards (PH) model.Reference Freeman, Cooper and Sutton7 However, the PH assumption may sometimes be violated because different trials may exhibit variations in follow-up times or compare interventions with different mechanisms of action.Reference Royston and Parmar6, Reference Li, Han, Hou, Chen and Chen8, Reference Schadendorf, Hodi and Robert9 In this case, continuing to use the Cox PH model may lead to biased estimates.

To address this issue, alternative NMA models that relax the PH assumption have been developed.Reference Cope, Chan and Campbell1, Reference Freeman, Cooper and Sutton7, Reference Heeg, Garcia and Beekhuizen10 Nonetheless, these methods have seen limited adoption.Reference Freeman, Sutton and Cooper11 One possible barrier is that researchers are not familiar with the detailed methodology of these models, which can be essential to their correct application.Reference Freeman, Cooper and Sutton7 Another challenge is the difficulty in implementing these methods, as they often require advanced programming skills. While user-friendly web-based tools or software for traditional NMA based on binary or continuous outcomes exist,Reference Owen, Bradbury, Xin, Cooper and Sutton12 Reference Zhao, van Valkenhoef, de Brock and Hillege15 tools specially designed for NMA based on TTE outcomes are not yet available.

To break the barriers of using flexible NMA methods to process the TTE outcomes, we created an R Shiny framework-based interactive application: NMAsurv. In this technical note, we first give a brief review of currently existing NMA methods and introduce the methodology of the methods included in this app in Section 2. In Section 3, we describe the structure and functionality of NMAsurv. In Section 4, an illustrative example showing how to use NMAsurv is provided. Finally, in Section 5, we discuss the advantages and limitations of NMAsurv, as well as potential future extensions. A user manual for NMAsurv can be found in Supplementary Material 1.

2 Methods

2.1 NMA for TTE outcomes

Typically, when conducting NMA based on survival data, researchers often lack access to IPD (Individual Patient Data) for all included trials. In cases where IPD is available for one or more trials, alternative population-adjusted indirect comparison methods, such as simulated treatment comparison (STC) and matching-adjusted indirect comparison (MAIC), can be employed for two-study scenarios. Multilevel network meta-regression can be used for large treatment networks.Reference Owen, Bradbury, Xin, Cooper and Sutton12 Reference Zhao, van Valkenhoef, de Brock and Hillege15 These methods are well developed, and some user-friendly tools are also available.Reference Phillippo, Ades and Dias16 Reference Jiang, Cappelleri and Gamalo18 However, it is more common for researchers to lack access to IPD. In such cases, techniques can be used to digitize published OS and PFS KM curves and reconstruct pseudo-IPD.Reference Guyot, Ades, Ouwens and Welton19 Reference Wei and Royston21 In NMAsurv, we only considered reconstructed pseudo-IPD as our data input.

2.2 A brief review of the NMA models

We conducted a systematic review in PubMed to identify published articles containing NMA models based on survival data. The detailed search strategy was provided in Supplementary Material 2. Additionally, we reviewed previously published reviewsReference Cope, Chan and Campbell1, Reference Freeman, Cooper and Sutton7, Reference Heeg, Garcia and Beekhuizen10, Reference Wiksten, Hawkins, Piepho and Gsteiger22, Reference de Jong, Moons and Riley29, Reference Saramago, Chuang and Soares30 and guidelines31, 32 on NMA for TTE results. After screening, we finally identified the following eight methods: the Cox PH model, fractional polynomial (FP) model, piecewise exponential (PWE) model, parametric survival model (PSM), Royston–Parmar (RP) model, RMST, generalized gamma model, and cure models. A brief introduction and related citations for these models can be found in Table 1. Furthermore, researchers can add treatment-by-covariates interactions in NMA to evaluate multivariate treatment effects.Reference Jansen and Cope33 We do not include treatment-by-covariates interactions in our discussion as it requires access to the true IPD with covariates.

Table 1 A brief introduction of included models

Abbreviations: AD, aggregated data; HR, hazard ratio; IPD, individual participant data; PH, proportional hazards.

2.3 Methodology of included NMA models

Based on the review in Section 2.2, we finally included five methods in this app: the Cox PH model, FP model, PWE model, PSM, and generalized gamma model. Cure models were excluded from this study since the current practice is not based on R. The RP model and the RMST model were excluded because the existing algorithms do not support “point-and-click” operations. Further discussion on these excluded methods can be found in Section 5. A summary of the five implemented methods along with their respective data and model assumptions is provided in Table 2.

Table 2 A summary of the five implemented methods and their data/model assumptions

Note: Green = implemented in the app; red = supported in the literature but not implemented; gray = not applicable. The symbol “*” indicates that PSM under the frequentist framework can only be realized through the two-step IPD approach. The symbol “#” refers to Freeman et al. (2022).Reference Freeman, Cooper and Sutton7 The symbol “$” refers to Cope et al. (2020).Reference Cope, Chan and Jansen24 The symbol “§” refers to Bowden et al. (2011).Reference Bowden, Tierney, Simmonds, Copas and Higgins34

Abbreviations: AD, aggregated data; FE, fixed-effect; FP, fractional polynomial; IPD, individual participant data; PH, proportional hazard; PSM, parametric survival model; PWE, piecewise exponential; RE: random-effects.

This table contains information updated up to July 31, 2024.

2.3.1 Fractional polynomial model

In current practice, two approaches are commonly used to implement FP NMA: (1) a one-step, IPD-based methodReference Freeman, Cooper and Sutton7 and (2) a two-step, AD-based method.Reference Wiksten, Hawkins, Piepho and Gsteiger22 In this study, we employed the two-step, AD-based method. In the first step, an ANOVA-like parameterization is used to express and fit the model as a generalized linear model (GLM) with time-varying covariates within a frequentist framework. The models are then compared based on the Akaike information criterion (AIC), and the model with the lowest AIC is selected for further analysis in a Bayesian or frequentist setting in the second step. This two-step approach can speed up the model selection process.Reference Freeman, Cooper and Sutton7, Reference Wiksten, Hawkins, Piepho and Gsteiger22 Typically, an FP model can be written as followsReference Jansen23:

(1) $$ \begin{align}y=\left\{\begin{array}{@{}l}{\beta}_0+{\beta}_1{t}^p,\\ {}\\ {}{\beta}_0+{\beta}_1{t}^{p_1}+{\beta}_2{t}^{p_2}.\end{array}\right.\end{align}$$

Power p can be −2, −1, −0.5, 0, 0.5, 1, 2, and 3 with ${t}^0=\mathit{\log}t$ , if ${p}_1={p}_2=p$ and $y={\beta}_0+{\beta}_1{t}^p+{\beta}_2{t}^p\mathit{\log}t$ . For reference treatment a and intervention b, taking the FP1 model with power = −2 as an example, we get d 0ab , d 1ab , β 0a , β 1a , β 0b , and β1b . To be specific,

(2) $$\begin{align}{\beta}_{0b}={\beta}_{0a}+{d}_{0 ab};{\beta}_{1b}={\beta}_{1a}+{d}_{1 ab}.\end{align}$$

Thus, we can get the following formulas:

(3) $$\begin{align}\left\{\begin{array}{c}\log \left({h}_a(t)\right)={\beta}_{0a}+{\beta}_{1a}\ast {t}^{-2},\\ {}\\ {}\log \left({h}_b(t)\right)={\beta}_{0b}+{\beta}_{1b}\ast {t}^{-2},\end{array}\right.\end{align}$$
(4) $$\begin{align}\left(\begin{array}{c}{\beta}_{0k}\\ {}{\beta}_{1k}\end{array}\right)=\left\{\begin{array}{c}\left(\begin{array}{c}{\mu}_0\\ {}{\mu}_1\end{array}\right)\kern0.24em \mathrm{if}\;k=a,\\\\[-6pt] {}\left(\begin{array}{c}{\mu}_0\\ {}{\mu}_1\end{array}\right)+\left(\begin{array}{c}{d}_0\\ {}{d}_1\end{array}\right)\kern0.24em \mathrm{if}\;k=b.\end{array}\right.\end{align}$$

Here, d is the difference in β 0 and β 1 of the log hazard for treatment b relative to a and μ is the parameters β of the reference treatment. The methodology of calculation in both the Bayesian setting and the frequentist setting has been described in detail elsewhere.Reference Wiksten, Hawkins, Piepho and Gsteiger22, Reference Jansen23 For the random-effects model, we consider only a heterogeneity parameter for d 0 as the methodology for including all heterogeneity parameters still needs further development.Reference Jansen23

2.3.2 Piecewise exponential model

The PWE model follows a framework similar to that of the FP model, employing a two-step, AD-based method.Reference Wiksten, Hawkins, Piepho and Gsteiger22 However, currently, only fixed-effect models are available for PWE. For reference treatment a and intervention b, taking a PWE model with time point = 2 as an example, we get d 0ab , d 1ab , β 0a , β 1a , β 0b , and β 1b . We can get formula (5) based on formulas (2) and (4):

(5) $$\begin{align}\left\{\begin{array}{@{}l}\log \left({h}_a(t)\right)={\beta}_{0a}\left(0\le t<2\right);{\beta}_{1a}\left(t\ge 2\right),\\ {}\\ {}\log \left({h}_b(t)\right)={\beta}_{0b}\left(0\le t<2\right);{\beta}_{1b}\left(t\ge 2\right).\end{array}\right.\end{align}$$

2.3.3 Parametric survival model

In current practice, there are also two ways to realize PSM.Reference Cope, Chan and Jansen24, Reference Ouwens, Philips and Jansen25 Here, we applied a one-step, AD-based method. The models including Weibull, Gompertz, log-logistic, and log-normal are considered. Since frequentist analysis based on GLM is not available, we only include the Bayesian analysis. For reference treatment a and intervention b, taking the PSM Weibull model as example, based on formulas (2) and (4), we can get formula (6):

(6) $$\begin{align}\left\{\begin{array}{@{}l}\mathit{\log}\left({h}_a(t)\right)={\beta}_{0a}+{\beta}_{1a}\ast \mathit{\log}(t)\\ {}\\ {}\mathit{\log}\left({h}_b(t)\right)={\beta}_{0b}+{\beta}_{1b}\ast \mathit{\log}(t)\end{array}\right.\end{align}$$

The log hazard function formulas for all four distributions can be found in Supplementary Material 2.

2.3.4 Cox PH model

The NMA model based on the PH assumption can be realized in several ways. In this app, we select the method developed by Freeman et al.,Reference Freeman, Cooper and Sutton7 which follows a two-step, IPD-based approach. Further discussion on the NMA model based on the PH assumption can be found in Section 5. In the first step, a Cox PH model is fitted individually to each trial to obtain an estimate of the log HR for the treatment effect and its corresponding standard error (SE):

(7) $$\begin{align}{h}_{j, ab}(t)={h}_{0j, ab}(t)\exp \left({\alpha}_{j, ab}{x}_{ij}\right).\end{align}$$

In formula (7), hj,ab(t) is the hazard function for treatment b compared to the baseline treatment a in trial j, h0j,ab(t) is the baseline hazard function for trial j, xij is the treatment indicator variable for patient i from trial j, and αj,ab refers to the HR for a patient receiving treatment b compared to the baseline treatment a in trial j. In the second step, the treatment effect estimates are synthesized through a fixed-effect NMA model.Reference Freeman, Cooper and Sutton7

2.3.5 Generalized gamma model

The generalized gamma model is fitted through a two-step, IPD-based method.Reference Freeman, Cooper and Sutton7 Each trial is independently evaluated by employing the generalized gamma model in the first step to get treatment effect estimates (log HR). In the second step, these treatment effect estimates of the baseline treatment compared to treatment i in trial j are synthesized, and their variability is estimated within a standard fixed-effect NMA model. The probability density function for the generalized gamma model can be found in Supplementary Material 2.

2.3.6 Calculation of HR

For FP, PWE, and PSM, the HR is time-varying, meaning that HR can be written as a function of time. The calculation of HR can be written as formula (8) according to formulas (2)–(6):

(8) $$\begin{align}\left\{\begin{array}{@{}l}\log \left(\mathrm{H}{\mathrm{R}}_{ab}(t)\right)={d}_{0 ab}+{d}_{1 ab}\ast {t}^{-2}\kern0.24em \left(\mathrm{FP}1,\mathrm{power}=-2\right),\\ {}\\ {}\log \left(\mathrm{H}{\mathrm{R}}_{ab}(t)\right)={d}_{0 ab}\left(0\le t<2\right);{d}_{1 ab}\left(t\ge 2\right)\kern0.24em \left(\mathrm{PWE},\mathrm{cutpoint}=2\right),\\ {}\\ {}\log \left(\mathrm{H}{\mathrm{R}}_{ab}(t)\right)={d}_{0 ab}+{d}_{1 ab}\ast \log (t)\kern0.24em \left(\mathrm{PSM},\mathrm{Weibull}\right).\end{array}\right.\end{align}$$

For the Cox PH model, the HR is time-invariant and does not require further calculation. For the generalized gamma model, the direct output of the model is the treatment effect. Typically, researchers have to use some functions in R to get the hazard function and the survival function.Reference Jackson35 In order to get the time-varying HR, the relationship between HRs, the cumulative hazard function, and the survival function should be considered.

3 R Shiny app

3.1 Brief introduction

This web-based tool was constructed through the R Shiny package. Currently, most of the NMA models for survival data can be realized through R. This means that interactive data analysis and visualization with R are allowed. Users who do not have too much experience in R can also use this tool since they do not need to know the exact packages and functions to build models. Even installing R is not necessary since NMAsurv was deployed on shinyapps.io, which is a webserver. NMAsurv can be easily accessed through the URL https://psurvivala.shinyapps.io/NMAsurv/. The names of the R packages used to construct this tool can be found in Supplementary Material 2.

3.2 Functionalities and features

Our goal is to build a user-friendly interface so that users can conduct the NMA based on TTE outcomes conveniently. The interface of the homepage is shown in Figure 1. In addition to enabling users to run NMA models, we also offer additional functions to assist with data preparation before running NMA and model selection afterward. The application consists of six modules, which can be easily accessed through the navigation bar: “Home Page,” “Data Page,” “AD-Based NMA,” “IPD-Based NMA,” “Application Output,” and “User Manual.” Some modules contain subsections that provide additional functionalities.

  • “Data Page” module: This module allows users to upload their own data, transform IPD to AD, do the PH assumption tests, and generate network plots. Three kinds of PH assumption tests are available: Schoenfeld residual plot, Log–Log plot, and Grambsch–Therneau test. For the Network plot, users can create customized network plots by modifying design elements such as line color and background.

  • “AD-Based NMA” module: This module consists of three sections, each corresponding to a different AD-based NMA model. Within each section, the application provides multiple subsections to accommodate various parameter settings, using frequentist or Bayesian frameworks, as well as using random-effects or fixed-effect models. This structure makes it easier for users to compare the results.

  • “IPD-Based NMA” module: This module follows a similar structure to “AD-Based NMA.”

  • “Application Output” module: In this module, users can quickly get familiar with the process and tips of model selection according to the notes provided. Additionally, users can export results in batches for further analysis here.

Figure 1 The interface of the homepage of NMAsurv.

Most operations of this app can be simply finished by “point-and-click” on the web browser. Provided example data can help users quickly get in touch with the functions in this app. All generated results will be shown in texts, tables, or figures, which can be exported and downloaded. In Section 4, we provide an illustrative example of how to use this tool step-by-step.

4 An illustrative example

To illustrate the capacities of NMAsurv, we reanalyzed one NMA based on survival data published in BMC Cancer.Reference Zhao, Shao, Shao, Zhou and Tang36 This NMA aimed to compare the efficacy, safety, and effects on quality of life of different anaplastic lymphoma kinase (ALK)-inhibitors for global and Asian patients with advanced ALK-positive non-small-cell lung cancer (NSCLC). We reanalyzed the results of OS of first-line treatments on global patients. Authors included six RCTs (ALEX, CROWN, ALTA-1L, eXalt 3, PROFILE 1014, and ASCEND-4) with seven treatments (crizotinib, alectinib, lorlatinib, brigatinib, ensartinib, chemotherapy, and ceritinib). In this illustrative example, we used the same data as in the referenced NMA study for all analyses, which were the reconstructed pseudo-IPD based on the algorithm of Guyot et al.Reference Guyot, Ades, Ouwens and Welton19 We presented only the FP model for AD-based NMA and the Cox PH model for IPD-based NMA, as the procedures for running other models are quite similar. We will go through this example based on the following steps: (1) uploading IPD; (2) transforming IPD to AD; (3) doing the PH assumption test; (4) creating the network plot; (5) running the FP model; (6) running the Cox PH model; and (7) generating a report. Please note that the steps outlined here apply only to this example; standardized procedures for conducting an NMA study can be found elsewhere.Reference Cope, Chan and Campbell1 A detailed description of the steps for running all models can be found in the user manual. All data used in this illustrative example can be found in Supplementary Material 3.

Uploading IPD is a compulsory step and should be the first step when using this app. Since NMAsurv is constructed based on the reconstructed pseudo-IPD, we can directly use the reconstructed pseudo-IPD here. To begin, navigate to the “Data Page” module and locate the “Load Data” section. A summary of how to import data and details on the required data format can be found in the “Instruction” panel. After loading the IPD, tables displaying summarized information about the imported data are available in the “Show the Input Data” panel. To run AD-based NMA models, AD is needed. Thus, transforming the IPD to AD in the “Data Transform” panel is the second step. Here, we should set “The max timepoint” and the “Step of the timepoint.” Typically, “The max timepoint” refers to the longest start timepoint in the AD, while “Step of the timepoint” refers to the interval between start timepoints. In this NSCLC-ALK network, since the longest follow-up time is approximately 65 months (PROFILE 1014, chemotherapy), we set “The max timepoint” to 60 and “Step of the timepoint” to 6. After clicking the “Transform to AD” button, followed by the “Use Transformed AD to Analysis” button, the AD is successfully generated and ready for analysis. (as shown in Figure 2a,b). In addition to using transformed AD as we did in this example, users can also choose to import AD. Before performing model analysis, the data source used in the analysis can be verified through the “Record” panel in the sidebar.

Figure 2 The interface of the “Data Transform” panel.

The PH assumption test is the third step in our illustrative example, and it is a crucial step in TTE-based NMA. Rich studies on the PH assumption test can be found elsewhere.Reference Grambsch and Therneau37, Reference Schoenfeld38 In the “PH assumption test” panel, we can select the studies and treatments to run the tests. Three PH assumption test methods are available, which can be chosen through the drop-down selection box. An example of Schoenfeld’s residual plot for ALEX is shown in Figure 3. All generated results for the NSCLC-ALK network are provided in Supplementary Material 2. According to the results, the Cox PH model is enough since the PH assumption holds. However, to demonstrate the functions in AD-based NMA, we still ran the FP model.

Figure 3 The interface of the “PH assumption test” panel.

The fourth step involves creating the network plot. Since the data format for this step differs from that of IPD and AD, new data should be imported here. The data can be viewed in the table displayed in the main panel, as shown in Figure 4. A variety of customization options are available in the sidebar panel, including “the title of the plot,” “color of the line,” “multi-arm or not,” “color of the multi-arm area,” “color of the points,” “color of the interior of words,” and “color of the edge of words.” After clicking the button to draw the plot, the final network plot is displayed in the main panel (as shown in Figure 4).

Figure 4 The interface of the “Network Plot” panel.

In the fifth step of this example, we ran the FP model to show a common procedure for running AD-based NMA models. First, we set the “Reference Study,” “Reference Treatment,” and “Extrapolation Time” through the text input box. In the NSCLC-ALK network, we selected “ALEX” as the reference study, “crizotinib” as the reference treatment, and 10 years as the extrapolation time. By clicking the “Run FP NMA (Frequentist analysis)” button, we obtained the step-one results for all FP models (as shown in Figure 5a). Then, we selected the model with the best statistical performance to run in the “FP Step Two (FP1)” panel. In this illustrative example, the FP1 models with powers of two, three, and one ranked top three. We finally selected the FP1 model with a power of one since overfitting was observed in subsequent runs of the other two models (more details can be found in the user manual of this app). A quick overview of the “Input Panel” and the results of Bayesian analysis using the fixed-effect model are shown in Figure 5b,c. More detailed results can be found in Supplementary Material 2. Based on the survival plot and the HR plot, we concluded that lorlatinib had the highest survival rate before the 45th month, while alectinib took the lead after the 45th month. The “Rhat” values in the coefficient table for Bayesian analysis showed that the model converges well. Additionally, parameter estimates from the frequentist and Bayesian analyses were highly consistent, indicating the robustness of the results. To illustrate parameter interpretation, we took alectinib versus crizotinib in the frequentist analysis as an example. The two parameters were 0.088 and −0.021, which indicated that the log HR can be written in the following formula: $\mathrm{Log}\;\mathrm{HR}=0.088-0.021\ast \mathrm{time}$ . For the fixed-effect model, NMAsurv provides downloads of extrapolated HRs.

Figure 5 The interface of the panels in the FP model.

The sixth step of this illustrative example involves running the Cox PH mode. The results can be easily obtained by clicking the button to run the model. HRs compared to the reference treatment are displayed in the main panel (as shown in Figure 6). Detailed information can be found in Supplementary Material 2. According to the results, all ALK inhibitors demonstrated HRs less than 1; however, their 95% confidence intervals (CIs) crossed 1. This suggested that, within this NSCLC-ALK network, the advantages of ALK inhibitors over crizotinib were not statistically significant. These findings might be different from the results obtained from other methods like “gemtc” or “netmeta” in R. A potential explanation is that the method used in this app takes the IPD into consideration, while “gemtc” and “netmeta” used RCT reported HRs with 95% CIs only.Reference Shim, Kim, Lee and Rucker13

Figure 6 The interface of the panels in the Cox PH model.

The final step of this example is to generate a report with selected results. NMAsurv provides an “Output Report” panel that enables users to export the results in batches. By clicking the “Export the results of this page” buttons as illustrated in Figures 5c and 6, the names of the selected model outcomes are displayed as checkboxes as shown in Figure 7. Then, we selected the desired outcomes by checking these boxes and then generated a report by clicking the “Download the Word File” button. The report included coefficient tables, survival plots, hazard plots, and HR plots, providing a detailed summary of the analyses.

Figure 7 The interface of the “Output report” panel.

5 Discussion

In this article, we introduced NMAsurv, a newly developed R Shiny tool designed for NMA based on survival data. With the emergence of innovative therapies, NMA, especially NMA under non-PH assumption, is becoming increasingly important in medical research. Flexible modeling techniques can be good alternatives to the traditional Cox PH model when the PH assumption is violated. However, methodologies of these methods are often complex and require users to have good programming skills. Shiny in R is a web-based interactive tool that enables users to run R functions using their own data without programming. Thus, users who have little knowledge of R programming or even non-R users can conduct NMA for TTE data through our tool. Moreover, NMAsurv offers a comprehensive set of results that cater to the needs of different users. The plotted survival curves provide clear insights for researchers assessing the long-term survival benefit differences between treatments, while the extrapolated HRs, available for download, support researchers conducting subsequent cost-effectiveness analyses.

It is important to highlight that NMA based on TTE outcomes can be directly established using trial-specific HRs,Reference Liu, Bai and Wang39, Reference Minakata, Fujiwara and Yokoyama40 which relies on the PH assumption. Typically, these HRs are derived from Cox PH models developed by statistical analysts working on the trials. Researchers can calculate the logarithm of these HRs along with their SEs, which can then be input into statistical software such as R, Stata, and Gemtc to compute relative log HRs. However, we did not include this method in NMAsurv for two reasons: (1) it is already well supported by numerous tools that offer simple and comprehensive tutorials, and (2) it does not require IPD or AD, making it less aligned with the focus of NMAsurv on TTE outcomes. Instead, in NMAsurv, we adopted an alternative approach to conducting NMA under the PH assumption. Specifically, we applied the two-step, IPD-based approach developed by Freeman et al.Reference Freeman, Cooper and Sutton7 We provide this function to enable users to directly analyze imported data when they determine that the PH assumption holds in their studies, thereby eliminating the need to switch to alternative software or reformat data for analysis.

Five models were included in NMAsurv, namely the Cox PH model, FP model, PWE model, PSM, and generalized gamma model, which are commonly used and well developed.Reference Freeman, Cooper and Sutton7, Reference Heeg, Garcia and Beekhuizen10 Other existing models, such as cure models, the RP model, and the RMST model, were excluded from the current version of NMAsurv due to practical constraints. The current practice of constructing cure models, including the mixture cure model and non-mixture cure model, is to use Stan, which is a probabilistic programming language for statistical inference written in C++.Reference Heeg, Garcia and Beekhuizen10, Reference Heeg, Verhoek and Tremblay41 In addition, cure models are the best choice when the cure is clinically realistic or can be estimated.Reference Heeg, Garcia and Beekhuizen10 The results will be biased if the survival data are immature, or the cure is estimable for the treatment group but not for the reference group. The RP model is a highly flexible approach that accommodates extreme changes in hazard functions over time. It is considered more adaptable and robust than FP models, as it reduces the likelihood of undesirable end effects and requires fewer iterations for convergence.Reference Freeman and Carpenter27 However, its complex and data-specific coding structure and increased computational burden present challenges for the implementation of simple “point-and-click” operations.Reference Freeman, Cooper and Sutton7, Reference Freeman and Carpenter27 The RMST model is an alternative to the HR when there is evidence of non-PH. However, a key limitation of this approach is that it does not support extrapolation, nor does it provide estimates of survival rates.Reference Freeman, Cooper and Sutton7 Additionally, like the RP model, its data-specific coding structure makes it difficult to be included in an application. Given these challenges, our tool focuses on models that are both well-developed and feasible for implementation in a user-friendly interface. Nonetheless, researchers are encouraged to try and find more modeling approaches than those provided by NMAsurv based on the characteristics of their data and the features of different models. Further information on the methodologies and corresponding codes is available elsewhere.Reference Cope, Chan and Campbell1, Reference Freeman, Cooper and Sutton7, Reference Heeg, Garcia and Beekhuizen10, Reference Wiksten, Hawkins, Piepho and Gsteiger22, Reference Jansen23, Reference Ouwens, Philips and Jansen25, Reference de Jong, Moons and Riley29

Despite NMAsurv making the model construction in NMA research easier to realize, it has several limitations. First, this study was conducted based on reconstructed pseudo-IPD without taking individual patient characteristics into consideration, which is a popular way in NMA for TTE data. If researchers have true IPD with individual patient characteristics, we highly recommend them to try additional IPD-based methods. For two-study comparisons, methods such as STC and MAIC can be used. For larger treatment networks, multilevel network meta-regression is a suitable approach.Reference Phillippo, Ades and Dias16, Reference Phillippo, Dias, Elsada, Ades and Welton17, Reference Maciel, Jansen and Klijn42 Second, this tool only supports NMA methods that are well developed. While cure models, RP models, and RMST models were excluded from this version, these models may be included in future updates. Additionally, the random-effects model for all parameters in FP, whose methodology is still controversial, was not included in our tool. Third, the rank plot, which can be an important output in NMA, is not included in our tool. This is because the current treatment ranking code relies on the WinBUGS-specific function “ranked,” which is not available in JAGS. The reason we did not use WinBUGS is that codes based on WinBUGS cannot be deployed on the server (shinyapps.io, Linux). Fourth, although NMAsurv provides extrapolated HRs for download, which can be used in CEA, users should pay attention when using these estimates, especially when their research data are immature.Reference Palmer, Borget and Friede43, Reference Shao, Zhao, Liang, Shi and Tang44 Relying on these potentially imprecise estimates may lead to research bias. While some methods have been applied in published studies to reduce this bias,Reference Wang, Hong and Alexander45 Reference Pei, Shi and Lv47 we strongly recommend that researchers perform comprehensive sensitivity analyses to thoroughly assess the impact of extrapolation. Finally, our tool is designed to generate statistical results and plots rather than explicitly recommending a specific model for users to choose. This serves as an important reminder: model selection is a complex process, and relying solely on statistical outcomes is not sufficient to make an informed decision. It is also crucial to assess the presence of overfitting by visual inspection by users themselves. To ensure transparency and reliability, following a standardized process for model selection is highly recommended.Reference Palmer, Borget and Friede43 Although NMAsurv does not directly provide a conclusion for model selection, it enables users to export results in batches and offers several helpful notes to guide and inform the model selection process.

Future extensions of NMAsurv will include two directions. First, adding a survival data extrapolation function will be beneficial to the output of IPD-based NMA. Currently, only HRs or relative treatment effects are reported in IPD-based methods, while the report of survival plots or HR plots needs the extrapolation of survival data. Second, integrating more IPD-based methods will be the main direction. Taking covariates into consideration and including methods like the RP, RMST models will make this tool better.Reference Freeman, Cooper and Sutton7, Reference Freeman48 Future studies may focus on the simplification of algorithms in these complex models.

Acknowledgements

We would like to thank Leyi Liang and Hanqiao Shao for their advice on the design of the R Shiny app.

Author contributions

Methodology: T.S., M.Z.; Review and editing the manuscript: all authors; Software development: T.S.; Software testing and improvement: M.Z., F.S., M.R.; Study design: T.S., M.Z., W.T.; Writing the draft manuscript: T.S. All authors provided a substantial contribution to the design and implementation of this R Shiny app, as well as writing and editing the manuscript, and approving the final version .

Competing interest statement

The authors declare that no competing interests exist.

Data availability statement

The software in this paper is available at https://psurvivala.shinyapps.io/NMAsurv/. The source codes of the software are freely available on GitHub (https://github.com/TaihangShao/NMAsurv), and the data presented in this paper can be found in the Supplementary Material.

Funding statement

This research was supported by the General Program of the National Natural Science Foundation of China (Grant No. 72174207).

Supplementary material

To view supplementary material for this article, please visit http://doi.org/10.1017/rsm.2025.10020.

Footnotes

T.S. and M.Z. contributed equally to this work.

References

Cope, S, Chan, K, Campbell, H et al. A comparison of alternative network meta-analysis methods in the presence of nonproportional hazards: a case study in first-line advanced or metastatic renal cell carcinoma. Value Health. 2023;26(4):465476.Google ScholarPubMed
Lumley, T. Network meta-analysis for indirect treatment comparisons. Stat Med. 2002;21(16):23132324.Google ScholarPubMed
Schwingshackl, L, Schwarzer, G, Rucker, G, Meerpohl, JJ. Perspective: network meta-analysis reaches nutrition research: current status, scientific concepts, and future directions. Adv Nutr. 2019;10(5):739754.Google ScholarPubMed
Batson, S, Greenall, G, Hudson, P. Review of the reporting of survival analyses within randomised controlled trials and the implications for meta-analysis. PLoS One. 2016;11(5):e0154870.Google ScholarPubMed
Moher, D, Hopewell, S, Schulz, KF et al. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. J Clin Epidemiol. 2010;63(8):e1e37.Google ScholarPubMed
Royston, P, Parmar, MK. Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome. BMC Med Res Methodol. 2013;13:152.Google Scholar
Freeman, SC, Cooper, NJ, Sutton, AJ et al. Challenges of modelling approaches for network meta-analysis of time-to-event outcomes in the presence of non-proportional hazards to aid decision making: application to a melanoma network. Stat Methods Med Res. 2022;31(5):839861.Google Scholar
Li, H, Han, D, Hou, Y, Chen, H, Chen, Z. Statistical inference methods for two crossing survival curves: a comparison of methods. PLoS One. 2015;10(1):e0116774.Google ScholarPubMed
Schadendorf, D, Hodi, FS, Robert, C et al. Pooled analysis of long-term survival data from phase II and phase III trials of ipilimumab in unresectable or metastatic melanoma. J Clin Oncol. 2015;33(17):18891894.Google ScholarPubMed
Heeg, B, Garcia, A, Beekhuizen, SV et al. Novel and existing flexible survival methods for network meta-analyses. J Comp Eff Res. 2022;11(15):11211133.Google Scholar
Freeman, SC, Sutton, AJ, Cooper, NJ. Uptake of methodological advances for synthesis of continuous and time-to-event outcomes would maximize use of the evidence base. J Clin Epidemiol. 2020;124:94105.Google ScholarPubMed
Owen, RK, Bradbury, N, Xin, Y, Cooper, N, Sutton, A. MetaInsight: an interactive web-based tool for analyzing, interrogating, and visualizing network meta-analyses using R-Shiny and netmeta. Res Synth Methods. 2019;10(4):569581.Google ScholarPubMed
Shim, SR, Kim, SJ, Lee, J, Rucker, G. Network meta-analysis: application and practice using R software. Epidemiol Health. 2019;41:e2019013.Google ScholarPubMed
van, Valkenhoef G, Lu, G, de Brock, B et al. Automating network meta-analysis. Res Synth Methods. 2012;3(4):285299.Google Scholar
Zhao, J, van Valkenhoef, G, de Brock, EO, Hillege, HL. ADDIS: an automated way to do network meta-analysis. SOM Research Reports, 12007-Other. 2012. University of Groningen, SOM Research School.Google Scholar
Phillippo, DM, Ades, AE, Dias, S et al. Methods for population-adjusted indirect comparisons in health technology appraisal. Med Decis Making. 2018;38(2):200211.Google ScholarPubMed
Phillippo, DM, Dias, S, Elsada, A, Ades, AE, Welton, NJ. Population adjustment methods for indirect comparisons: a review of National Institute for Health and Care Excellence technology appraisals. Int J Technol Assess Health Care. 2019;35(3):221228.Google ScholarPubMed
Jiang, Z, Cappelleri, JC, Gamalo, M et al. A comprehensive review and shiny application on the matching-adjusted indirect comparison. Res Synth Methods. 2024;15(4):671686.Google ScholarPubMed
Guyot, P, Ades, AE, Ouwens, MJ, Welton, NJ. Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan–Meier survival curves. BMC Med Res Methodol. 2012;12:9.Google ScholarPubMed
Liu, N, Zhou, Y, Lee, JJ. IPDfromKM: reconstruct individual patient data from published Kaplan–Meier survival curves. BMC Med Res Methodol. 2021;21(1):111.Google ScholarPubMed
Wei, Y, Royston, P. Reconstructing time-to-event data from published Kaplan–Meier curves. Stata J. 2017;17(4):786802.Google ScholarPubMed
Wiksten, A, Hawkins, N, Piepho, HP, Gsteiger, S. Nonproportional hazards in network meta-analysis: efficient strategies for model building and analysis. Value Health. 2020;23(7):918927.Google ScholarPubMed
Jansen, JP. Network meta-analysis of survival data with fractional polynomials. BMC Med Res Methodol. 2011;11:61.Google ScholarPubMed
Cope, S, Chan, K, Jansen, JP. Multivariate network meta-analysis of survival function parameters. Res Synth Methods. 2020;11(3):443456.Google ScholarPubMed
Ouwens, MJ, Philips, Z, Jansen, JP. Network meta-analysis of parametric survival curves. Res Synth Methods. 2010;1(3-4):258271.Google ScholarPubMed
Wiecek, W, Karcher, H. Nivolumab versus cabozantinib: comparing overall survival in metastatic renal cell carcinoma. PLoS One. 2016;11(6):e0155389.Google ScholarPubMed
Freeman, SC, Carpenter, JR. Bayesian one-step IPD network meta-analysis of time-to-event data using Royston–Parmar models. Res Synth Methods. 2017;8(4):451464.Google ScholarPubMed
Petit, C, Blanchard, P, Pignon, JP, Lueza, B. Individual patient data network meta-analysis using either restricted mean survival time difference or hazard ratios: is there a difference? A case study on locoregionally advanced nasopharyngeal carcinomas. Syst Rev. 2019;8(1):96. Published 2019 Apr 15. doi: 10.1186/s13643-019-0984-x Google ScholarPubMed
de Jong, V, Moons, K, Riley, RD et al. Individual participant data meta-analysis of intervention studies with time-to-event outcomes: a review of the methodology and an applied example. Res Synth Methods. 2020;11(2):148168.Google Scholar
Saramago, P, Chuang, LH, Soares, MO. Network meta-analysis of (individual patient) time to event data alongside (aggregate) count data. BMC Med Res Methodol. 2014;14:105.Google ScholarPubMed
CHTE2020 SOURCES AND SYNTHESIS OF EVIDENCE; UPDATE TO EVIDENCE SYNTHESIS METHODS; 2020. Accessed April 14, 2022.Google Scholar
NICE DSU technical support document 21: flexible methods for survival analysis; 2022. Accessed April 3, 2022.Google Scholar
Jansen, JP, Cope, S. Meta-regression models to address heterogeneity and inconsistency in network meta-analysis of survival outcomes. BMC Med Res Methodol. 2012;12:152.Google ScholarPubMed
Bowden, J, Tierney, JF, Simmonds, M, Copas, AJ, Higgins, JP. Individual patient data meta-analysis of time-to-event outcomes: one-stage versus two-stage approaches for estimating the hazard ratio under a random effects model. Res Synth Methods. 2011;2(3):150162.Google ScholarPubMed
Jackson, CH. flexsurv: a platform for parametric survival modeling in R. J Stat Softw. 2016;70(8):133.Google ScholarPubMed
Zhao, M, Shao, T, Shao, H, Zhou, C, Tang, W. Identifying optimal ALK inhibitors in first- and second-line treatment of patients with advanced ALK-positive non-small-cell lung cancer: a systematic review and network meta-analysis. BMC Cancer. 2024;24(1):186. Published 2024 Feb 8. doi: 10.1186/s12885-024-11916-4 Google ScholarPubMed
Grambsch, PM, Therneau, TM. Proportional hazards tests and diagnostics based on weighted residuals. Biometrika. 1994;81(3):515526.Google Scholar
Schoenfeld, D. Partial residuals for the proportional hazards regression model. Biometrika. 1982;69(1):239241.Google Scholar
Liu, L, Bai, H, Wang, C et al. Efficacy and safety of first-line immunotherapy combinations for advanced NSCLC: a systematic review and network meta-analysis. J Thorac Oncol. 2021;16(7):10991117.Google ScholarPubMed
Minakata, D, Fujiwara, S, Yokoyama, D et al. Relapsed and refractory multiple myeloma: a systematic review and network meta-analysis of the efficacy of novel therapies. Br J Haematol. 2023;200(6):694703.Google ScholarPubMed
Heeg, B, Verhoek, A, Tremblay, G et al. Bayesian hierarchical model-based network meta-analysis to overcome survival extrapolation challenges caused by data immaturity. J Comp Eff Res. 2023;12(3):e220159.Google ScholarPubMed
Maciel, D, Jansen, JP, Klijn, SL et al. Implementing multilevel network meta-regression for time-to-event outcomes: a case study in relapsed refractory multiple myeloma. Value Health. 2024;27(8):10121020.Google ScholarPubMed
Palmer, S, Borget, I, Friede, T et al. A guide to selecting flexible survival models to inform economic evaluations of cancer immunotherapies. Value Health. 2023;26(2):185192.Google ScholarPubMed
Shao, T, Zhao, M, Liang, L, Shi, L, Tang, W. Impact of extrapolation model choices on the structural uncertainty in economic evaluations for cancer immunotherapy: a case study of Checkmate 067. Pharmacoecon Open. 2023;7(3):383392.Google ScholarPubMed
Wang, L, Hong, H, Alexander, GC et al. Cost-effectiveness of systemic treatments for metastatic castration-sensitive prostate cancer: an economic evaluation based on network meta-analysis. Value Health. 2022;25(5):796802.Google ScholarPubMed
Gebski, V, Gares, V, Gibbs, E, Byth, K. Data maturity and follow-up in time-to-event analyses. Int J Epidemiol. 2018;47(3):850859.Google ScholarPubMed
Pei, R, Shi, Y, Lv, S et al. Nivolumab vs pembrolizumab for treatment of US patients with platinum-refractory recurrent or metastatic head and neck squamous cell carcinoma: a network meta-analysis and cost-effectiveness analysis. JAMA Netw Open. 2021;4(5):e218065.Google ScholarPubMed
Freeman, SC. Individual patient data meta-analysis and network meta-analysis. Methods Mol Biol. 2022;2345:279298.Google ScholarPubMed
Figure 0

Table 1 A brief introduction of included models

Figure 1

Table 2 A summary of the five implemented methods and their data/model assumptions

Figure 2

Figure 1 The interface of the homepage of NMAsurv.

Figure 3

Figure 2 The interface of the “Data Transform” panel.

Figure 4

Figure 3 The interface of the “PH assumption test” panel.

Figure 5

Figure 4 The interface of the “Network Plot” panel.

Figure 6

Figure 5 The interface of the panels in the FP model.

Figure 7

Figure 6 The interface of the panels in the Cox PH model.

Figure 8

Figure 7 The interface of the “Output report” panel.

Supplementary material: File

Shao et al. supplementary material

Shao et al. supplementary material
Download Shao et al. supplementary material(File)
File 4.3 MB