Hostname: page-component-54dcc4c588-5q6g5 Total loading time: 0 Render date: 2025-10-07T11:56:03.990Z Has data issue: false hasContentIssue false

Ensuring robustness in scientific research, split-root assays as an example case

Published online by Cambridge University Press:  13 August 2025

Lucila Salvatore*
Affiliation:
Experimental and Computational Plant Development group, Utrecht University , Utrecht, The Netherlands Theoretical Biology group, Utrecht University , Utrecht, The Netherlands
Ronald Pierik
Affiliation:
Experimental and Computational Plant Development group, Utrecht University , Utrecht, The Netherlands Laboratory of Molecular Biology, Wageningen University and Research , Wageningen, The Netherlands
Kaisa Kajala
Affiliation:
Experimental and Computational Plant Development group, Utrecht University , Utrecht, The Netherlands
Kirsten ten Tusscher*
Affiliation:
Experimental and Computational Plant Development group, Utrecht University , Utrecht, The Netherlands Theoretical Biology group, Utrecht University , Utrecht, The Netherlands
*
Corresponding authors: Lucila Salvatore and Kirsten ten Tusscher; Emails: l.salvatore@uu.nl, K.H.W.J.tentusscher@uu.nl
Corresponding authors: Lucila Salvatore and Kirsten ten Tusscher; Emails: l.salvatore@uu.nl, K.H.W.J.tentusscher@uu.nl

Abstract

Scientific progress relies on reproducibility, replicability, and robustness of research outcomes. After briefly discussing these terms and their significance for reliable scientific discovery, we argue for the importance of investigating robustness of outcomes to experimental protocol variations. We highlight challenges in achieving robust, replicable results in multi-step plant science experiments, using split-root assays in Arabidopsis thaliana as a case study. These experiments are important for unraveling the contributions of local, systemic and long-distance signalling in plant responses and play a central role in nutrient foraging research. The complexity of these experiments allows for extensive variation in protocols. We investigate what variations do or do not result in similar outcomes and provide concrete recommendations for enhancing the replicability and robustness of these and other complex experiments by extending the level of detail in research protocols.

Information

Type
Perspective
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press in association with John Innes Centre

1. Introduction

While the general public may imagine scientific progress to arise from isolated sparks of genius of individual scientists, science to a large extent is an incremental and collaborative endeavor. For science to progress in an efficient manner time- and resource-wise, it is critical that results can be repeated and built upon. The apparent lack of reproducibility and repeatability of results both inside and outside of the life sciences has received considerable interest over the last decade (Baker, Reference Baker2016; Hicks, Reference Hicks2023) and has resulted in a strong focus on making articles, protocols, analyses, simulation codes, and results publicly available and reproducible. While FAIR principles are nowadays broadly followed (Jacobsen et al., Reference Jacobsen, de Miranda Azevedo, Juty, Batista, Coles, Cornet, Courtot, Crosas, Dumontier, Evelo, Goble, Guizzardi, Hansen, Hasnain, Hettne, Heringa, Hooft, Imming, Jeffery and Schultes2020; Wilkinson et al., Reference Wilkinson, Dumontier, Aalbersberg, Appleton, Axton, Baak, Blomberg, Boiten, da Silva Santos, Bourne, Bouwman, Brookes, Clark, Crosas, Dillo, Dumon, Edmunds, Evelo, Finkers and Mons2016), recreating results from complex protocols is far from trivial, and determining the causal factor underlying different outcomes can be a substantial burden.

In scientific research, the term reproducibility is typically reserved for the capacity to generate quantitatively identical results when using the same methods and conditions. For data analysis and computational biological research, in theory, full reproducibility can be achieved if data, analysis protocols, and codes are available. In practice, limited documentation on the settings of codes and preprocessing of data may still hamper this reproducibility. In experimental research, generating identical results on biological systems even in the same lab with the same equipment, conditions, and person executing the experiment is highly unlikely due to noise from both biological sources and experimental execution. Therefore, under the Claerbout/Donoho/Peng convention, in experimental biology we speak of replicability when experiments, performed under the same conditions, produce quantitatively and statistically similar results (Barba, Reference Barba2018). Still, even striving for replicability rather than reproducibility can be a daunting task, sometimes even resulting in the somewhat ominously sounding phrase ‘in our hands’ in some papers.

One may think that striving for replicability and reproducibility is sufficient to collect the best quality data and gain new knowledge in the most efficient possible manner. However, robustness of outcomes may also inform us about the significance of the biological phenomenon we are observing. What we define as robustness here, for experimental biology, is the capacity to generate similar outcomes also in slightly different conditions (Kitano, Reference Kitano2004). In computational biology, where we create models to simulate biological phenomena of interest, it is common practice to investigate the robustness of models to changes in parameters or model assumptions. The general idea is that for a reliable model, outcomes should only depend significantly on certain parameters, e.g. whether we simulate growth under normal versus drought conditions or the stability of a key protein yet should remain relatively constant to moderate changes in most parameters.

A robust model is more likely to simulate the right behaviour for the right reasons than a model that is sensitive to all its finely tweaked parameter values. In a similar vein, it would be informative for experimental protocols to have knowledge about which changes in the protocol have substantial effects on outcomes and which changes are buffered against. In plant biology, robust outcomes of experiments under slight variations in protocol are more likely to be outcomes that are relevant in natural conditions. This increased relevance stems from the fact that natural conditions constitute a more variable environment compared to the environment in the experiment. Additionally, experimental protocols with robust outcomes would enhance the potential for similar research to be performed in e.g. labs with less funding or different equipment, by allowing some flexibility in concentrations, time windows, or equipment used (Nosek et al., Reference Nosek, Hardwicke, Moshontz, Allard, Corker, Dreber, Fidler, Hilgard, Struhl, Nuijten, Rohrer, Romero, Scheel, Scherer, Schönbrodt and Vazire2022). However, even if methods in papers or (e-) lab journals are fully complete, most times it is not clear whether a particular aspect of a protocol was optimized for, or if it rather results from habit or random choice. While it is not trivial to vary and investigate for each and every aspect of a protocol whether it is essential, for some aspects this information is available. Despite being available, this type of information is typically not provided in either protocols or methods sections. It is of great importance to be aware that the way we write our materials and methods and what we omit can be decisive for the success of future research projects.

Here, we will use split-root assays to investigate nutrient foraging in Arabidopsis thaliana as an example case. Split-root experiments are used in a wide variety of plant species and for various types of research fields and have the power to discern local from systemic responses. The main goal of these types of assays is to divide the root system architecture (RSA) into halves and expose each half to different environments. In the case of plant nutrient foraging, these studies are of major importance for unraveling the systemic signaling pathways that indicate the demand for nutrients against local supply, enabling plants to preferentially invest in root growth in locations of high nutrient supply (Hackett, Reference Hackett1972; Ruffel et al., Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011; Torres-Martínez et al., Reference Torres-Martínez, Rodríguez-Alonso, Shishkova and Dubrovsky2019; Zhang & Forde, Reference Zhang and Forde1998; Zhang et al., Reference Zhang, Jennings, Barlow and Forde1999). We will illustrate the practical problems we encountered with obtaining robust replicable results based on published methods for Arabidopsis split-root heterogeneous nitrate supply experiments. We will also suggest solutions and recommendations for alleviating these problems that are of wider applicability to complex experimental plant science protocols.

Split-root experiments have been applied to a wide variety of species and for a range of research questions (Saiz-Fernández et al., Reference Saiz-Fernández, Černý, Skalák and Brzobohatý2021). As a consequence, a wide variety of protocols exists, ranging from simply dividing a well-developed root system over two pots (Schortemeyer et al., Reference Schortemeyer, Feil and Stamp1993), to splitting the main root of plants with thick enough roots in two halves (Remans et al., Reference Remans, Nacry, Pervent, Filleur, Diatloff, Mounier, Tillard, Forde and Gojon2006), and from grafting an additional main root (Kassaw & Frugoli, Reference Kassaw and Frugoli2012) to cutting off the main root after two lateral roots (LRs) have developed to use these two laterals in the two different nutrient compartments (Remans et al., Reference Remans, Nacry, Pervent, Filleur, Diatloff, Mounier, Tillard, Forde and Gojon2006). These protocols have been reviewed in detail elsewhere (Saiz-Fernández et al., Reference Saiz-Fernández, Černý, Skalák and Brzobohatý2021).

Even if we constrain our analysis to split-root experiments in Arabidopsis grown on agar plates used to analyze nitrate foraging, where the main root is cut away after two laterals have formed, still a large variety of experimental protocols exist (Table 1 ). Protocols differ in the duration and number of growth steps, concentrations of high and low nitrate used, light levels, sucrose concentration in the media, and other parameters. Additionally, protocols may differ in the analysis of root growth and architecture, with some studies focusing e.g. on differences in overall root system growth between two root halves whereas others appear to focus only on specific parts of the root system.

Table 1 Comparison of split-root assays used to investigate nitrogen foraging in Arabidopsis thaliana

Note: The table shows the wide variation in nitrogen concentrations in high nitrogen (HN) and low nitrogen (LN) treatments, other media components and their concentrations, length of the protocol, duration of the treatment, light intensity and photoperiod, and temperature conditions.

Despite this variation, all the papers included in Table 1 robustly observe preferential foraging, i.e. the preferential investment in root growth at the side of the split-root system where the plant experiences the highest nitrate levels (HNln) (Figure 1 HNln > LNhn ). However, in the seminal paper by Ruffel et al. (Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011) it was additionally reported that in plants grown in heterogeneous nitrate, the high nitrate (HNln) side invests more in root growth compared to plants where both sides experience high nitrate (HNHN) (Figure 1 HNln > HNHN). Additionally, the low nitrate (LNhn) side invests less in root growth compared to roots grown in a homogeneous low nitrate (LNLN) split-root set-up (Figure 1 LNhn < LNLN ). These results have been taken as an important hallmark for demand and supply signalling, and hence, robustness of these phenotypes may be critical to fully sample the repertoire of local, systemic, and long-distance signals involved in root foraging decisions.

Figure 1. Schematic illustration of the Ruffel et al. (Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011) results for split-root experiments with different nitrogen concentrations. The cartoon plates represent real plates after a nitrogen split-root assay, and the box plot represents expected results for lateral root (LR) traits such as summed LR length, number and density. The colors surrounding each plant correspond with the colors of the boxplot.

Our aim was to arrive at a robust reproducible protocol recapitulating the Ruffel et al. (Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011) results which were as simple and efficient as possible. To this end, we started with the protocol described in Figure 2 by the black typed text. As we tested the effect of various parameters, the protocol was updated as explained in the next sections.

Figure 2. Representative experimental protocol. Typically, some steps are adjusted through repetition to achieve the most replicable outcome in the most efficient way possible. The black typed text represents the original protocol, and the adjustments are visualized as ‘post-it notes’ or red ‘handwriting’ comments and the most important of these additions are discussed in detail in the main text.

To investigate the robustness of plant root nitrogen foraging responses to variations in protocol we will focus on three major aspects. The first is the preferential foraging response, where the root system of the high nitrogen side (HNln) should be more developed than that low nitrogen side (LNhn). The second aspect is the comparison between the two homogeneous treatments, where we expect a stronger investment in root growth in LNLN plants than in HNHN plants given previous observations of foraging versus systemic repression responses (Giehl & von Wirén, Reference Giehl and von Wirén2014; Gruber et al., Reference Gruber, Giehl, Friedel and von Wirén2013; Ruffel et al., Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011). Finally, the roots from the HNln treatment are expected to outgrow HNHN roots, consistent with demand signalling, while the roots from the LNhn treatment are expected to be less developed compared to LNLN roots, consistent with supply signalling (Figure 1).

2. Results

2.1. A secret numbers game

Methods sections typically describe what is done to plant materials, at which time point, with which concentrations, and so on (Varapparambath et al., Reference Varapparambath, Mathew, Shanmukhan, Radhakrishnan, Kareem, Verma, Ramalho, Manoj, Vellandath, Aiyaz, Radha, Landge, Mähönen, Heisler, Weijers and Prasad2022). More extensive protocols, published as stand-alone papers additionally provide a shopping list of required materials and equipment as well as describe more extensively the sequence of steps involved in the experiment, the time they take, and potential problems and solutions (Bjornson et al., Reference Bjornson, Song, Dandekar, Franz, Drakakaki and Dehesh2015; Mathew et al., Reference Mathew, Shanmukhan, Varapparambath and Prasad2023). Perhaps surprisingly, information on the number of seeds or plants that the experiments started with, and the percentage kept for subsequent experimental steps are only infrequently and incompletely provided (Greenwood et al., Reference Greenwood, Hall, Locke, Staiger, Davis and Davis2022; Xuan et al., Reference Xuan, Opdenacker, Vanneste, Beeckman, Ristova and Barbez2018). These differences between the initial number of seeds and plants kept throughout the experiment may result from failure to germinate or recover, or substantial differences in development or growth. Particularly in the case of complex multi-step protocols information on these numbers is important.

To illustrate this point, based on the experience in our lab, where previously root growth and architecture phenotyping was done only on relatively young, intact root systems, typically the number of seeds used for a protocol is five times the number of plants eventually used for phenotyping. This 5-fold increase comes from the observation that on average 20% of seeds display efficient and homogeneous germination and seedling establishment, with remaining seeds failing to germinate or slowly germinating. These latter seedlings are discarded to avoid heterogeneity affecting the outcomes of our experiments. Thus, given that we required 30 plants for split-root phenotyping experiments, 8 for HN-HN, 8 for LN-LN and 14 for HN-LN combinations, we initially started with 150 seeds.

However, 150 seeds did not nearly result in 30 plants suitable for split-root experiments. Instead, due to the many steps in a split-root protocol, substantial numbers of additional plants needed to be discarded at various stages. First, for cutting the main root, only plants that have grown two LRs at the time point of cutting can be used. Alternatively, if cutting is not done below the first two LRs, but rather position-based, only plants developing two LRs after cutting can be maintained for further experiments as each half of the split-root plate requires a main root (Figure 1). In both cases, after the recovery phase, plants developing adventitious roots (AR) at the root-shoot junction and at the cutting point need to be discarded as well, as these are different types of roots and alter the root system structure (explained in section 2.4.2). Additionally, only plants with approximately similarly sized LRs can be deployed for the actual split-root treatment as otherwise, the initial size and resource differences may obscure differences in response to nutrient conditions.

After substantial trial and error and consultation with one of the authors of previously published split-root experiments, we arrived at a protocol starting with 300 seeds. We selected this number considering two major losses. First, loss rates of 50% of the plants in the cutting step, due to insufficient development towards a large enough root, where we set the threshold to having a main root longer than 1.5 cm. Second, a loss of around 70% of the cut and recovered plants during the selection for the split-root plates, discarding here those plants that have grown either adventitious roots, extra roots, or whose two LRs are not similarly sized and/or dissimilar from the mean LRs length in the experiment. Overall, it becomes evident that conducting a rigorously controlled experiment with homogenous plants results in a loss of approximately 90% of the seedlings started with, requiring us to start with at least 10 times the number of plants needed for final analysis. The reporting of rough estimates of the loss rates at crucial handling steps would speed up protocol adoption in a different lab.

2.2. Robustness and sensitivity to variations in the methodology

Not all parameters in our protocol are expected to have a significant effect on the phenotype that we are observing. On the other hand, there will also be factors in our protocol that have a major effect on the plant’s response to the treatment. While it may be impossible to determine the impact of all aspects of our protocol, it may still be helpful to report for those aspects that were varied whether they influenced the observed phenotype or whether experimental outcomes were found to be robust against these variations.

To illustrate this, we will describe our experience with the split-root protocol, showing some examples of conditions that were found to have a significant effect on outcomes as well as some of them that appeared to have limited impact on the phenotypes of interest.

2.3. Robustness to variation in protocol parameters

2.3.1. Seedling age during treatment

As shown in Table 1, split-root protocols differ in the number of days at which the main root is cut, the recovery time after the cut before being transferred to a split-root medium, and even the number of transfer steps. Combined this has a substantial effect on the age and thereby the size of the seedlings at the time they are exposed to heterogeneous nitrate conditions. A lower seedling age is advantageous both in terms of protocol duration and in efficiency of root system phenotyping, as for older plants root systems become increasingly large and complex and therefore harder to phenotype. However, plant age may also impact the responses that are being studied.

We compared experimental outcomes for two different seedling ages. In the first case, we let seedlings grow for 10 days after germination (DAG), after which they were cut below the 2nd visible LR and left to recover for 5 days before exposing them to the split-root treatment. In the second case, we cut the main root of 7 DAG seedlings 0.4 cm below the hypocotyl-radicle junction point and let them recover for 3 days before exposing them to split-root treatment. Note that the rationale for now cutting below a certain distance rather than below the first two visible LRs is that, at this earlier age, two laterals are not generally visible yet. The distance of 0.4 cm was derived from observations at later time points in which results for different cutting distances were compared, indicating that in most plants two LRs would form within this 0.4 cm distance. Both younger and older seedlings were exposed to 5 days of split-root treatment before measuring the RSA.

These differences in the duration of protocol steps enabled us to compare the responses of 20 DAG or 15 DAG plants (Figure 3a,b). The average length of the LRs and the summed total length of LRs were all significantly larger in older plants, yet the responses to the treatments were highly similar in both experiments (Figure 3). Both 15 DAG (Figure 3c, Supplementary Table 1) and 20 DAG (Figure 3e, Supplementary Table 1) plants showed the same trend in total LR length as we discussed before for the overall foraging versus systemic repression (LNLN>HNHN), and supply and demand responses (HNln>HNHN, LNhn<LNLN). The same pattern was also obtained for both plant ages when instead of the total LR length, the average LR length was analyzed (Figure 3d,e, Supplementary Table 1). This implies that the bigger total LR length observed under e.g. LNLN versus HNHN or HNln versus LNhn, was not mainly due to increased LR number but at least partly arises from individual longer roots. These results demonstrate that seedling age, at least within the range tested here, does not substantially affect the nitrogen-foraging phenotypes of interest. While trends are the same, not all comparisons show similar significance and there were fewer significant differences for the 20 DAG. There may be a difference in power of the assay depending on the length of the protocol, but establishing such a difference with certainty would require additional repetitions beyond the scope of this manuscript.

Figure 3. Comparison of Arabidopsis split-root phenotypes for different nitrogen availabilities in a long (20 DAG) and a short (15 DAG) protocol. Arabidopsis thaliana root system response after 5 days of treatment in split-root conditions in either 15-day-old seedlings (a-c-d) or in 20-day-old seedlings (b-e-f). Seedlings were grown in d-root systems the entire treatment and the cut was performed with a razor blade. Typical 10 DAG seedling at treatment day 0 in the 7+3 setup (a). Typical 15 DAG seedling at treatment day 0 in the 10+5 setup (b). LRs responses to different nitrate provision conditions: HNHN, HNLN, and LNLN, where we used for HN the 10 mM KNO3 and for LN0.2 mM KNO3 The summed length of the LRs of each half of each individual plant was measured (c, e), and the average length of the LRs of each half of each plant (d-f). The traits were measured for the total section of the system. Boxplot displays the median of each group (n = 26-55 roots) bounded by the first and third quartiles. Asterisks indicate the significance in the comparison of two nitrogen treatments: * P < 0.05; ** P < 0.01; *** P<0.001. The HNln vs LNhn split-root treatment was compared using a Wilcoxon test and the rest of the comparisons were performed using a Mann–Whitney test.

Notably, when instead number of LRs was compared between the two protocols there was only a preferential foraging phenotype for the short but not the longer protocol (Supplementary Figure S1). Considering that the whole root system was analyzed, the lack of difference in LR number may be caused by the fact that for the long protocol, the old part (where the primordia were mostly pre-set before the treatment) is outweighing the effect of the new part (where the primordia are being established in the context of the split-root treatment). For the short protocol, due to the shorter main root at the start of the nutrient experiment, the contribution from the new part of the root is larger.

2.3.2. Measurement section of the root system

Different studies use different sections of the root for analysis (Mounier et al., Reference Mounier, Pervent, Ljung, Gojon and Nacry2014; Remans et al., Reference Remans, Nacry, Pervent, Filleur, Diatloff, Mounier, Tillard, Forde and Gojon2006; Ruffel et al., Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011). For some root traits, this may have biological relevance. For example, counting LR number/primordia in a section of the root that existed before the start of treatment is not very informative in studying a treatment response, as most root primordia in that section likely started to form prior to the treatment. In such a case, focusing on a newly formed primary root section might be more appropriate. However, this may not necessarily be the case for all root traits investigated. For example, LR length could potentially respond to nitrate, irrespective of whether these LRs arose from pre-existing or newly formed roots/primordia.

In light of these considerations about pre-existing versus newly formed tissues, we reanalyzed the nitrogen split-root 7+3 day experiment while considering three different root system compartments: the overall main root that is in the treatment (total root), the new part of the main root that grew during the treatment (new root), and the old part of the main root that grew before the transfer to the treatment (old root) (Supplementary Figure S2).

Focusing first on total LR length, in both the root segment and the whole root analyzed, we observed a preferential LR growth in HNln relative to the LNhn part of the same plate (Figure 4ac, Supplementary Table 2). Additionally, LNhn roots had less LR growth than LNLN roots in all segments, and HNln roots had a longer total LR system than HNHN roots in the new root. Finally, LNLN had a larger root system than HNHN roots in all root segments, showing the well-known foraging response of the plant in poor nitrogen conditions. Thus, for total LR length, this set of phenotypic responses is robust to variations in the part of the root system being used for the analysis. Similarly, LR density ( Supplementary Figure S3) shows a robust phenotype in the three sections as well, enabling the observation of foraging responses and a preferential phenotype in all 3 sections.

Figure 4. Comparison of Arabidopsis split-root root phenotypes for different nitrogen availability in different sections of the root. Arabidopsis thaliana root system response in 15 DAG old seedlings after 5 days of treatment in split-root conditions. Seedlings were grown in d-root systems the entire treatment and the cut was performed with a razor blade. RSA traits were measured in different root sections: New section, the part of the main root that grows in the treatment (a, d, g); Old section, the part of the main root that was already developed before the exposure to treatment (b, e, h). Total section, the whole root (c, f, i). RSA traits measured are the summed length of LRs (a–c), the average LR length (d–f), and the total number of LRs (g–i). The Boxplot displays the median of each group (n = 26–55 roots) bounded by the first and third quartile with individual outliers identified by the Tukey test marked as points. Asterisks indicate the significance in the comparison of two nitrogen treatments: * P < 0.05; ** P < 0.01; *** P < 0.001. The HNln vs LNhn split-root treatment was compared using a Wilcoxon test and the rest of the comparisons were performed using a Mann–Whitney test.

Next, we focused on average LR length. The results demonstrated a consistent phenotype for the old and total root section for both the foraging phenotype (LNLN>HNHN) and the preferential phenotype (HNln> LNhn) (Figure 4 e,f, Supplementary Table 2). However, there was no clear foraging response for the average LR length in the new section (Figure 4d). This lack of response may be attributed to the limited time available for new LRs in LNLN to achieve length differences. We hypothesize that with prolonged growth, the LRs in the new section will eventually also display significant size differences.

As expected, LR numbers (Figure 4gi, Supplementary Table 2) showed a significant preferential root foraging phenotype in all three sections of the roots (HNln >LNhn), yet the foraging phenotype (LNLN>HNHN) could only be observed for the new part of the root. We hypothesize that primordia in the old part at least partly formed prior to the treatment start, causing reduced responsiveness in terms of LR number. As for the lack of difference in LR number in the total section, we conclude that the lack of differences in the older part outweighs the differences observed in the new part of the root.

Summarizing, when focusing on LR length or LR density, results are robust to variations in the root system segment being used for analysis, while for average LR length, we found clearest responses in old root sections and total root, and for LR numbers instead we observed clearest responses in new root sections. Moreover, if only the split-root heterogeneous response is analyzed the preferential growth in the HN side compared to the LN side is prevalent in all the sections and root traits.

2.4. Sensitivity to variation in protocol parameters

2.4.1. Light exposure of roots

It has been demonstrated that some previously reported RSA responses may not represent a plant’s ‘natural’ response but may rather be an artifact of the experimental setup, e.g. the light exposure of roots in many plate assays (Silva-Navas et al., Reference Silva-Navas, Moreno-Risueno, Manzano, Pallero-Baena, Navarro-Neila, Téllez-Robledo, Garcia-Mina, Baigorri, Gallego and del Pozo2015). Using the so-called D-root (dark-grown root) system, Yokawa et al. (Reference Yokawa, Fasano, Kagenishi and Baluška2014) reported an enhanced halotropic response to moderate-strength salt gradients, indicating repression of this response in light-exposed roots. In contrast, Silva-Navas et al. (Reference Silva-Navas, Conesa, Saez, Navarro-Neila, Garcia-Mina, Zamarreño, Baigorri, Swarup and del Pozo2019) reported that RSA responses to phosphate starvation, increase of LR density, and severe shortening of MR are significantly stronger in light versus dark-grown roots. Thus, it is important to describe root light conditions when writing methods or protocols.

To the best of our knowledge, agar plate split-root experiments for Arabidopsis have thus far all been performed on light-exposed roots, with the experiments we described in this article thus far being the exception. To investigate whether preferential investment in HNln versus LNhn or the foraging response in LNLN versus HNHN depend on root light exposure, we compared split-root nitrogen phenotypes between experiments with the roots being covered with a D-root system and experiments where the roots were fully exposed to white light conditions (150 μmol/m2s light intensity in a 16/8 light/dark growth chamber). We studied these responses in the new parts of the root system.

We observed that the preferential foraging response (HNln > LNhn) is independent of root light exposure, and can be observed for LR length, number, and density (Figure 5, Supplementary Figure S4, and Supplementary Table 3). These findings agree with previous results showing qualitatively similar preferential foraging responses to heterogeneous nitrate in light-grown Arabidopsis plants on agar and soil-grown maize plants (Remans et al., Reference Remans, Nacry, Pervent, Filleur, Diatloff, Mounier, Tillard, Forde and Gojon2006; Yu et al., Reference Yu, White, Hochholdinger and Li2014).

Figure 5. Comparison of Arabidopsis split-root phenotypes for different nitrogen availability with the roots covered or exposed to light. Arabidopsis thaliana 15 DAG old root system response after 5 days of treatment in split-root conditions in either seedlings with roots exposed to light (non-covered) (a, c) or seedlings with roots under D-root system (b, d). Seedlings were cut with a razor blade when the protocol was performed. Different RSA traits measured were the summed length of all LRs (a, b), and the number of the LRs in each plant (c, d). The traits were measured for the new section of the root system. Boxplot displays the median of each group (n = 15-30 roots) bounded by the first and third quartile with individual outliers identified by the Tukey test marked as points. Asterisks indicate the significance in the comparison of the two nitrogen treatments: * P < 0.05; ** P < 0.01; *** P < 0.001. The HNln vs LNhn split-root treatment was compared using a Wilcoxon test and the rest of the comparisons were performed using a Mann–Whitney test.

Comparing root growth under HNHN vs LNLN in dark-grown conditions to light-grown conditions, significant differences can be observed. In dark-grown roots, our results suggest a foraging response (LNLN > LNhn and HNHN) for total lateral root length, number, and density (Figure 5b,d, Supplementary Figure S4, and Supplementary Table 3). This response is expected because, in the latter two conditions, systemic nitrate levels are higher, reducing investments in root growth. In contrast, for light-exposed roots, this foraging response was no longer observed. Instead, LNLN LR length, number, and density were reduced rather than increased compared to HNln, LNhn, and HNHN conditions (Figure 5a,c, Supplementary Figure S4, and Supplementary Table 3). Thus, our experimental outcomes are partly sensitive to whether the root system is exposed or covered from light.

Furthermore, when comparing shoots of light-exposed plants for LNLN versus HNLN or HNHN plants, we saw a highly reduced rosette area in the LNLN plants (Supplementary Figure S5). While we have not investigated this in further detail, these observations suggest that the combined stress of low nitrate provision and light-exposed roots resulted in starvation rather than a foraging response, i.e. reduced instead of increased growth. The fact that this did not occur in previous studies with light-grown roots may arise from these experiments having been optimized for light conditions, e.g. through the addition of sucrose, differences in prior growth conditions, or intermediate steps, whereas we optimized our protocol for dark-grown roots.

2.4.2. Cutting instrument

While describing the machines used for PCR or microscopy, simpler tools used in our protocols are often given limited consideration when writing methods sections. One commonly overlooked tool is the instrument employed for cutting the plants in protocols involving e.g. split-root, grafting, or root tip regeneration. Cutting may be performed either with razor blades or a scalpel. Cutting away plant parts inflicts an injury, resulting in the activation of various stress signalling pathways (Acosta et al., Reference Acosta, Gasperini, Chételat, Stolz, Santuari and Farmer2013; Durgaprasad et al., Reference Durgaprasad, Roy, M, A. V., Kareem, Raj, Willemsen, Mähönen, Scheres and Prasad2019). However, different forms of damage elicit different responses. For instance, Durgaprasad et al. (Reference Durgaprasad, Roy, M, A. V., Kareem, Raj, Willemsen, Mähönen, Scheres and Prasad2019) discussed that in the context of root tip regeneration cutting approaches resulting in sharp cuts should be preferred, as blunter cuts resulted in more damage and failure of regeneration.

In our split-root experiments, we frequently observed the formation of AR at the site where the main root was cut. In A. thaliana, ARs whose development involves the WUSCHEL-RELATED HOMEOBOX11 (WOX11) regulatory pathway, have been previously shown to be induced as a response to wounding and callus formation in the roots (Ikeuchi et al., Reference Ikeuchi, Favero, Sakamoto, Iwase, Coleman, Rymen and Sugimoto2019; Sheng et al., Reference Sheng, Hu, Du, Zhang, Huang, Scheres and Xu2017). AR formation interferes with the generation of a symmetric split-root system of shared ontogeny needed for our experiments. Therefore, we decided to investigate the impact of cutting instruments on the frequency of AR formation. As using a scalpel frequently resulted in the remaining main root stumps being pressed into the agar, for a subset of plants cut with a scalpel we decided to use tweezers to put the root stump back on top of the agar.

We found that the cutting instrument and approach significantly influenced the percentage of AR formation at the wound site 3 days after cutting (Figure 6). While using a razor blade may appear to be a ‘cleaner’ method of making a cut due to its sharpness and precision, it promoted a higher AR percentage. We hypothesize that the different tools used for cutting result in varying degrees of damage to the roots, with less damage leading to more frequent AR formation. Indeed, the lowest frequency of AR formation occurred after cutting with a scalpel and manipulating the root with tweezers. Since we aimed to avoid AR formation, we decided to cut our roots with a scalpel and lift the remaining main root stump with a tweezer for our final protocol as well as in the experiments shown in Figures 35 and discussed in the previous sections.

Figure 6. The percentage of plants with at least one adventitious root appearing at the location of the cut 3 days after cutting. Arabidopsis thaliana seedlings were grown for 7 days in d-root system before the cut. Each dot represents the mean of an independent replicate, where each independent replicate consists of n = 10 plates with 6-7 plants per plate. First the mean was computed per plate, after which the mean of these means was computed over the 10 plates making up a single replicate. Lines show the mean of the group. The significance was calculated using chi-square test. *** P<0.001.

3. Discussion

In this article, we have explored the differences between replicability, reproducibility, and robustness. As experimental scientists, we aim to observe quantitatively and statistically similar effects when we replicate the same experiment (i.e. replicability) (Barba, Reference Barba2018), with the possibility of full reproducibility (quantitatively identical results) reserved for computational analyses. However, when reporting experimental results, we place less emphasis on the robustness of our outcomes. Yet, reporting on the robustness or sensitivity of our outcomes to variation in the different relevant parameters is a well-established and expected practice in other domains, such as computational science (Confalonieri et al., Reference Confalonieri, Bregaglio and Acutis2010; Grimm & Berger, Reference Grimm and Berger2016). Communicating also in the experimental sciences which parameters of our protocols required optimization, and which were simply not varied would be very useful to fellow scientists interested in adopting the protocol. In addition, factors like seed quality (including mother plant growth conditions), germination rates and conditions, and further details such as light quality (not just intensity), etc. will all have a potentially massive impact on the protocols but generally are not detailed in our articles. It would not just save time, but it would also provide an understanding of which parameters interact with the outcomes of the experiments or the functioning of a protocol. We acknowledge that in collaborations with commercial parties, the strive for maximal transparency may collide with the request for confidentiality privileges by companies. We urge journals to take a leading role in this matter, by expanding their requirements for reporting experimental methods.

Here we used Arabidopsis split-root nitrate foraging assays on plates as a case study on replicability and robustness of complex multi-step plant science protocols. Different variants of this assay have been documented in numerous articles over the past 15 years, with relatively similar outcomes of roots showing preferential foraging in the high nitrate compartment under heterogeneous conditions. Nonetheless, there is considerable diversity among split-root protocols associated with various protocol parameters, such as number of transfer steps, plant age, light intensity, and sucrose concentration in the medium (Table 1). It is currently unclear whether all these protocols are able to reproduce the complete range of root system responses presented in Ruffel et al., Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011. This highlights the importance of understanding and reporting on the possible extent to which outcomes are robust or sensitive to changes in protocol parameters.

The current study identified two parameters to which experimental outcomes are robust. The first is the root system compartment used in measurements, total, newly grown, or previously formed. We demonstrated that for the total and average of theLRs length, the outcome is independent of the section we examined (Figure 4). As expected, for the LRs number, which logically can mostly be affected in the newly growing root section after transfer to the split medium, outcomes when using the new section or overall are different from those using the old section. We recommend taking these findings into account, adjusting the root section measure to the type of root phenotype (LR length or rather number) one aims to investigate. The second parameter to which our experimental findings were found to be robust is the age of the plant during the treatment. In the existing literature, plant age at the end of the treatment ranges from 14 DAG (Remans et al., Reference Remans, Nacry, Pervent, Filleur, Diatloff, Mounier, Tillard, Forde and Gojon2006), 15-20 DAG (Girin et al., Reference Girin, El-Kafafi, Widiez, Erban, Hubberten, Kopka, Hoefgen, Gojon and Lepetit2010; Mounier et al., Reference Mounier, Pervent, Ljung, Gojon and Nacry2014; Ohkubo et al., Reference Ohkubo, Tanaka, Tabata, Ogawa-Ohnishi and Matsubayashi2017; Tabata et al., Reference Tabata, Sumida, Yoshii, Ohyama, Shinohara and Matsubayashi2014), to a maximum of 23 DAG (Poitout et al., Reference Poitout, Crabos, Petřík, Novák, Krouk, Lacombe and Ruffel2018; Ruffel et al., Reference Ruffel, Krouk, Ristova, Shasha, Birnbaum and Coruzzi2011). Exposing 15 and 20 DAG seedlings to the same treatment, we demonstrated that plant nitrogen foraging response is robust against changes in seedling age in our protocol (Figure 3).

In contrast to these parameters against which our experimental outcomes proved robust, we also identified parameters that did significantly affect outcomes. First, when roots were exposed to light instead of being covered by a D-root system, the foraging phenotype (where low-nitrogen-grown LRs are more and/or longer than high-nitrogen-grown LRs) observed in the new section of the root grown under homogenous low-nitrogen was lost (Figure 5). Even though in most split-root assays the root systems of plants are exposed to light, we developed the protocol using a D-root system, and it is possible that some of the optimizations are sub-optimal under conditions where roots are exposed to light. We currently do not know which of the steps of the protocol would be light-sensitive, or if light effects are dependent on the light source, light composition, photoperiod, and/or intensity. What is clear from our results is that the optimization done using the D-root system resulted in the loss of the foraging phenotype when the roots were exposed to light. As a second example of a parameter change leading to non-robust outcomes, we demonstrated that the cutting instrument, which is rarely mentioned in protocols, can have a significant effect (Figure 6). This highlights the importance of not only mentioning details of the major equipment like microscope type but also of the simple instruments used when writing protocols.

Based on the above discussion, we propose implementing more complete reporting conventions when describing experimental protocols, especially regarding experimental parameters that have been varied, and either required optimization or rather were found to have limited effect on obtaining the reported results. An example of a potential tool for writing experimental methodology would be a GitHub-type portal format. Just like the actual GitHub portal, which serves for storing codes and their versioning, in this portal, protocols can be detailed with thorough descriptions provided of any steps or parameters that were adjusted. Being interactive, this portal enables others to review, comment, and modify protocol steps, while maintaining a trackable record. In a separate section, peers can upload modifications they applied and notes on whether those modifications produced robust outcomes or not. This would pave the way to more collaborative and transparent scientific reporting within the research community and would also enhance the efficiency of studies that follow up on published work, which is the essence of the scientific process.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/qpb.2025.10004.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/qpb.2025.10004.

Data availability statement

The data that supports the findings of this study are available in the supplementary material of this article https://doi.org/10.5281/zenodo.14010234.

Acknowledgements

We would like to thank Pierre Gautrat for guidance in experimental setup and fruitful discussions, Sara Buti and Muthanna Biddanda Devaiah for the technical advice and support, Benjamin Planterose Jimenez for his advice on statistics, and Otto van de Beek and his workshop at Utrecht University for the construction of the guillotine device. We acknowledge the use of Biorender, and Adobe Photoshop (version 25-12-0) for the creation of the graphic abstract and the use of Grammarly (version 1.87.1) and ChatGPT-4 for checking grammar and sentence structures.

Author contributions

K. ten Tusscher, L. Salvatore and R. Pierik conceived and designed the study. L. Salvatore conducted data gathering and performed statistical analyses. L. Salvatore, K. Kajala, K and ten Tusscher wrote the article. R. Pierik provided feedback on the article.

Funding statement

LS and KtT are supported by a VICI grant (VI.C.202.011 12268) and KK by a VIDI grant (VI.Vidi.193.104), both awarded by the Netherlands Organization for Scientific Research (NWO).

Competing interest

The authors declare none.

Statistical analysis

To avoid dependent data in the homogenous treatments (HNHN and LNLN) we took an average of the measurements of the paired roots. To determine the distribution of our data we used either a Shapiro-Wilk test, for datasets smaller than n = 50, or a D’Agostino-Pearson test for bigger datasets. As we couldn’t prove normal distribution we opted to use non-parametric test to evaluate the significance of our comparisons. Most of the comparisons were evaluated with a Mann-Whitney test, except for the comparison between HNln vs LNhn, where due to the data being paired, we used a Wilcoxon test.

Footnotes

Associate Editor: Dr. Christian Fleck

References

Acosta, I. F., Gasperini, D., Chételat, A., Stolz, S., Santuari, L., & Farmer, E. E. (2013). Role of NINJA in root jasmonate signaling. Proceedings of the National Academy of Sciences, 110(38), 1547315478. https://doi.org/10.1073/pnas.1307910110 CrossRefGoogle ScholarPubMed
Baker, M. (2016). 1500 scientists lift the lid on reproducibility. Nature, 533(7604), 452454. https://doi.org/10.1038/533452a CrossRefGoogle Scholar
Barba, L. A. (2018). Terminologies for reproducible research (arXiv:1802.03311). arXiv. https://doi.org/10.48550/arXiv.1802.03311.CrossRefGoogle Scholar
Bjornson, M., Song, X., Dandekar, A., Franz, A., Drakakaki, G., & Dehesh, K. (2015). A chemical genetic screening procedure for arabidopsis thaliana seedlings. Bio-Protocol, 5(13). https://doi.org/10.21769/BioProtoc.1519 CrossRefGoogle ScholarPubMed
Confalonieri, R., Bregaglio, S., & Acutis, M. (2010). A proposal of an indicator for quantifying model robustness based on the relationship between variability of errors and of explored conditions. Ecological Modelling, 221(6), 960964. https://doi.org/10.1016/j.ecolmodel.2009.12.003 CrossRefGoogle Scholar
Durgaprasad, K., Roy, M. V., M, A. V., , Kareem, A., Raj, K., Willemsen, V., Mähönen, A. P., Scheres, B., & Prasad, K. (2019). Gradient expression of transcription factor imposes a boundary on organ regeneration potential in plants. Cell Reports, 29(2), 453, e3–463. https://doi.org/10.1016/j.celrep.2019.08.099 CrossRefGoogle ScholarPubMed
Giehl, R. F. H., & von Wirén, N. (2014). Root nutrient foraging. Plant Physiology, 166(2), 509517. https://doi.org/10.1104/pp.114.245225 CrossRefGoogle ScholarPubMed
Girin, T., El-Kafafi, E.-S., Widiez, T., Erban, A., Hubberten, H.-M., Kopka, J., Hoefgen, R., Gojon, A., & Lepetit, M. (2010). Identification of Arabidopsis mutants impaired in the systemic regulation of root nitrate uptake by the nitrogen status of the plant. Plant Physiology, 153(3), 12501260. https://doi.org/10.1104/pp.110.157354 CrossRefGoogle ScholarPubMed
Greenwood, M., Hall, A. J. W., & Locke, J. C. W. (2022). High spatial resolution luciferase imaging of the Arabidopsis thaliana circadian clock. In Staiger, D., Davis, S., & Davis, A. M. (Eds.), Plant circadian networks: Methods and protocols (pp. 4755). Springer. https://doi.org/10.1007/978-1-0716-1912-4_4.CrossRefGoogle Scholar
Grimm, V., & Berger, U. (2016). Robustness analysis: Deconstructing computational models for ecological theory and applications. Ecological Modelling, 326, 162167. https://doi.org/10.1016/j.ecolmodel.2015.07.018 CrossRefGoogle Scholar
Gruber, B. D., Giehl, R. F. H., Friedel, S., & von Wirén, N. (2013). Plasticity of the Arabidopsis root system under nutrient deficiencies. Plant Physiology, 163(1), 161179. https://doi.org/10.1104/pp.113.218453 CrossRefGoogle ScholarPubMed
Hackett, C. (1972). A method of applying nutrients locally to roots under controlled conditions, and some morphological effects of locally applied nitrate on the branching of wheat roots. Australian Journal of Biological Sciences, 25(6), 11691180. https://doi.org/10.1071/bi9721169 CrossRefGoogle Scholar
Hicks, D. J. (2023). Open science, the replication crisis, and environmental public health. Accountability in Research, 30(1), 3462. https://doi.org/10.1080/08989621.2021.1962713 CrossRefGoogle ScholarPubMed
Ikeuchi, M., Favero, D. S., Sakamoto, Y., Iwase, A., Coleman, D., Rymen, B., & Sugimoto, K. (2019). Molecular mechanisms of plant regeneration. Annual Review of Plant Biology, 70, 377406. https://doi.org/10.1146/annurev-arplant-050718-100434 CrossRefGoogle ScholarPubMed
Jacobsen, A., de Miranda Azevedo, R., Juty, N., Batista, D., Coles, S., Cornet, R., Courtot, M., Crosas, M., Dumontier, M., Evelo, C. T., Goble, C., Guizzardi, G., Hansen, K. K., Hasnain, A., Hettne, K., Heringa, J., Hooft, R. W. W., Imming, M., Jeffery, K. G., & Schultes, E. (2020). FAIR principles: Interpretations and implementation considerations. Data Intelligence, 2(1–2), 1029. https://doi.org/10.1162/dint_r_00024 CrossRefGoogle Scholar
Kassaw, T. K., & Frugoli, J. A. (2012). Simple and efficient methods to generate split roots and grafted plants useful for long-distance signaling studies in Medicago truncatula and other small plants. Plant Methods, 8(1), 38. https://doi.org/10.1186/1746-4811-8-38 CrossRefGoogle ScholarPubMed
Kitano, H. (2004). Biological robustness. Nature Reviews Genetics, 5(11), 826837. https://doi.org/10.1038/nrg1471 CrossRefGoogle ScholarPubMed
Mathew, M. M., Shanmukhan, A. P., Varapparambath, V., & Prasad, K. (2023). Protocol for real-time imaging, polar protein quantification, and targeted laser ablation of regenerating shoot progenitors in Arabidopsis . STAR Protocols, 4(2), 102184. https://doi.org/10.1016/j.xpro.2023.102184 CrossRefGoogle ScholarPubMed
Mounier, E., Pervent, M., Ljung, K., Gojon, A., & Nacry, P. (2014). Auxin-mediated nitrate signalling by NRT1.1 participates in the adaptive response of Arabidopsis root architecture to the spatial heterogeneity of nitrate availability. Plant, Cell & Environment, 37(1), 162174. https://doi.org/10.1111/pce.12143 CrossRefGoogle Scholar
Nosek, B. A., Hardwicke, T. E., Moshontz, H., Allard, A., Corker, K. S., Dreber, A., Fidler, F., Hilgard, J., Struhl, M. K., Nuijten, M. B., Rohrer, J. M., Romero, F., Scheel, A. M., Scherer, L. D., Schönbrodt, F. D., & Vazire, S. (2022). Replicability, robustness, and reproducibility in psychological science. Annual Review of Psychology, 73, 719748. https://doi.org/10.1146/annurev-psych-020821-114157 CrossRefGoogle ScholarPubMed
Ohkubo, Y., Tanaka, M., Tabata, R., Ogawa-Ohnishi, M., & Matsubayashi, Y. (2017). Shoot-to-root mobile polypeptides involved in systemic regulation of nitrogen acquisition. Nature Plants, 3(4), 4. https://doi.org/10.1038/nplants.2017.29 CrossRefGoogle ScholarPubMed
Poitout, A., Crabos, A., Petřík, I., Novák, O., Krouk, G., Lacombe, B., & Ruffel, S. (2018). Responses to systemic nitrogen signaling in Arabidopsis roots involve trans-Zeatin in shoots. The Plant Cell, 30(6), 12431257. https://doi.org/10.1105/tpc.18.00011 CrossRefGoogle ScholarPubMed
Remans, T., Nacry, P., Pervent, M., Filleur, S., Diatloff, E., Mounier, E., Tillard, P., Forde, B. G., & Gojon, A. (2006). The Arabidopsis NRT1.1 transporter participates in the Signaling pathway triggering root colonization of nitrate-rich patches. Proceedings of the National Academy of Sciences of the United States of America, 103(50), 1920619211 10.1073/pnas.0605275103CrossRefGoogle ScholarPubMed
Ruffel, S., Krouk, G., Ristova, D., Shasha, D., Birnbaum, K. D., & Coruzzi, G. M. (2011). Nitrogen economics of root foraging: Transitive closure of the nitrate–cytokinin relay and distinct systemic signaling for N supply vs. demand. Proceedings of the National Academy of Sciences, 108(45), 1852418529. https://doi.org/10.1073/pnas.1108684108 CrossRefGoogle ScholarPubMed
Saiz-Fernández, I., Černý, M., Skalák, J., & Brzobohatý, B. (2021). Split-root systems: Detailed methodology, alternative applications, and implications at leaf proteome level. Plant Methods, 17(1), 7. https://doi.org/10.1186/s13007-020-00706-1 CrossRefGoogle Scholar
Schortemeyer, M., Feil, B., & Stamp, P. (1993). Root morphology and nitrogen uptake of maize simultaneously supplied with ammonium and nitrate in a Split-root system. Annals of Botany, 72(2), 107115. https://doi.org/10.1006/anbo.1993.1087 CrossRefGoogle Scholar
Sheng, L., Hu, X., Du, Y., Zhang, G., Huang, H., Scheres, B., & Xu, L. (2017). Non-canonical WOX11-mediated root branching contributes to plasticity in Arabidopsis root system architecture. Development, 144(17), 31263133. https://doi.org/10.1242/dev.152132 Google ScholarPubMed
Silva-Navas, J., Conesa, C. M., Saez, A., Navarro-Neila, S., Garcia-Mina, J. M., Zamarreño, A. M., Baigorri, R., Swarup, R., & del Pozo, J. C. (2019). Role of cis-zeatin in root responses to phosphate starvation. New Phytologist, 224(1), 242257. https://doi.org/10.1111/nph.16020 CrossRefGoogle ScholarPubMed
Silva-Navas, J., Moreno-Risueno, M. A., Manzano, C., Pallero-Baena, M., Navarro-Neila, S., Téllez-Robledo, B., Garcia-Mina, J. M., Baigorri, R., Gallego, F. J., & del Pozo, J. C. (2015). D-root: A system for cultivating plants with the roots in darkness or under different light conditions. The Plant Journal, 84(1), 244255. https://doi.org/10.1111/tpj.12998 CrossRefGoogle ScholarPubMed
Tabata, R., Sumida, K., Yoshii, T., Ohyama, K., Shinohara, H., & Matsubayashi, Y. (2014). Perception of root-derived peptides by shoot LRR-RKs mediates systemic N-demand signaling. Science, 346(6207), 343346. https://doi.org/10.1126/science.1257800 CrossRefGoogle ScholarPubMed
Torres-Martínez, H. H., Rodríguez-Alonso, G., Shishkova, S., & Dubrovsky, J. G. (2019). Lateral root primordium morphogenesis in angiosperms. Frontiers in Plant Science, 10. https://www.frontiersin.org/articles/10.3389/fpls.2019.00206 10.3389/fpls.2019.00206CrossRefGoogle ScholarPubMed
Varapparambath, V., Mathew, M. M., Shanmukhan, A. P., Radhakrishnan, D., Kareem, A., Verma, S., Ramalho, J. J., Manoj, B., Vellandath, A. R., Aiyaz, M., Radha, R. K., Landge, A. N., Mähönen, A. P., Heisler, M. G., Weijers, D., & Prasad, K. (2022). Mechanical conflict caused by a cell-wall-loosening enzyme activates de novo shoot regeneration. Developmental Cell, 57(17), 2063, e102080. https://doi.org/10.1016/j.devcel.2022.07.017 CrossRefGoogle ScholarPubMed
Wilkinson, M. D., Dumontier, M., Aalbersberg, I., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.-W., da Silva Santos, L. B., Bourne, P. E., Bouwman, , Brookes, A. J., Clark, T., Crosas, M., Dillo, , Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., & Mons, B. (2016). The FAIR guiding principles for scientific data management and stewardship. Scientific Data, 3(1), 160018. https://doi.org/10.1038/sdata.2016.18 CrossRefGoogle ScholarPubMed
Xuan, W., Opdenacker, D., Vanneste, S., & Beeckman, T. (2018). Long-term in vivo imaging of luciferase-based reporter gene expression in Arabidopsis roots. In Ristova, D., & Barbez, E. (Eds.), Root development: Methods and protocols (pp. 177190). Springer. https://doi.org/10.1007/978-1-4939-7747-5_13.CrossRefGoogle Scholar
Yokawa, K., Fasano, R., Kagenishi, T., & Baluška, F. (2014). Light as stress factor to plant roots – Case of root halotropism. Frontiers in Plant Science, 5. https://doi.org/10.3389/fpls.2014.00718 CrossRefGoogle ScholarPubMed
Yu, P., White, P. J., Hochholdinger, F., & Li, C. (2014). Phenotypic plasticity of the maize root system in response to heterogeneous nitrogen availability. Planta, 240(4), 667678. https://doi.org/10.1007/s00425-014-2150-y CrossRefGoogle ScholarPubMed
Zhang, H., & Forde, B. G. (1998). An Arabidopsis MADS box gene that controls nutrient-induced changes in root architecture. Science, 279(5349), 407409. https://doi.org/10.1126/science.279.5349.407 CrossRefGoogle ScholarPubMed
Zhang, H., Jennings, A., Barlow, P. W., & Forde, B. G. (1999). Dual pathways for regulation of root branching by nitrate. Proceedings of the National Academy of Sciences, 96(11), 65296534. https://doi.org/10.1073/pnas.96.11.6529 CrossRefGoogle ScholarPubMed
Figure 0

Table 1 Comparison of split-root assays used to investigate nitrogen foraging in Arabidopsis thaliana

Figure 1

Figure 1. Schematic illustration of the Ruffel et al. (2011) results for split-root experiments with different nitrogen concentrations. The cartoon plates represent real plates after a nitrogen split-root assay, and the box plot represents expected results for lateral root (LR) traits such as summed LR length, number and density. The colors surrounding each plant correspond with the colors of the boxplot.

Figure 2

Figure 2. Representative experimental protocol. Typically, some steps are adjusted through repetition to achieve the most replicable outcome in the most efficient way possible. The black typed text represents the original protocol, and the adjustments are visualized as ‘post-it notes’ or red ‘handwriting’ comments and the most important of these additions are discussed in detail in the main text.

Figure 3

Figure 3. Comparison of Arabidopsis split-root phenotypes for different nitrogen availabilities in a long (20 DAG) and a short (15 DAG) protocol. Arabidopsis thaliana root system response after 5 days of treatment in split-root conditions in either 15-day-old seedlings (a-c-d) or in 20-day-old seedlings (b-e-f). Seedlings were grown in d-root systems the entire treatment and the cut was performed with a razor blade. Typical 10 DAG seedling at treatment day 0 in the 7+3 setup (a). Typical 15 DAG seedling at treatment day 0 in the 10+5 setup (b). LRs responses to different nitrate provision conditions: HNHN, HNLN, and LNLN, where we used for HN the 10 mM KNO3 and for LN0.2 mM KNO3 The summed length of the LRs of each half of each individual plant was measured (c, e), and the average length of the LRs of each half of each plant (d-f). The traits were measured for the total section of the system. Boxplot displays the median of each group (n = 26-55 roots) bounded by the first and third quartiles. Asterisks indicate the significance in the comparison of two nitrogen treatments: *P < 0.05; **P < 0.01; ***P<0.001. The HNln vs LNhn split-root treatment was compared using a Wilcoxon test and the rest of the comparisons were performed using a Mann–Whitney test.

Figure 4

Figure 4. Comparison of Arabidopsis split-root root phenotypes for different nitrogen availability in different sections of the root. Arabidopsis thaliana root system response in 15 DAG old seedlings after 5 days of treatment in split-root conditions. Seedlings were grown in d-root systems the entire treatment and the cut was performed with a razor blade. RSA traits were measured in different root sections: New section, the part of the main root that grows in the treatment (a, d, g); Old section, the part of the main root that was already developed before the exposure to treatment (b, e, h). Total section, the whole root (c, f, i). RSA traits measured are the summed length of LRs (a–c), the average LR length (d–f), and the total number of LRs (g–i). The Boxplot displays the median of each group (n = 26–55 roots) bounded by the first and third quartile with individual outliers identified by the Tukey test marked as points. Asterisks indicate the significance in the comparison of two nitrogen treatments: *P < 0.05; **P < 0.01; ***P < 0.001. The HNln vs LNhn split-root treatment was compared using a Wilcoxon test and the rest of the comparisons were performed using a Mann–Whitney test.

Figure 5

Figure 5. Comparison of Arabidopsis split-root phenotypes for different nitrogen availability with the roots covered or exposed to light. Arabidopsis thaliana 15 DAG old root system response after 5 days of treatment in split-root conditions in either seedlings with roots exposed to light (non-covered) (a, c) or seedlings with roots under D-root system (b, d). Seedlings were cut with a razor blade when the protocol was performed. Different RSA traits measured were the summed length of all LRs (a, b), and the number of the LRs in each plant (c, d). The traits were measured for the new section of the root system. Boxplot displays the median of each group (n = 15-30 roots) bounded by the first and third quartile with individual outliers identified by the Tukey test marked as points. Asterisks indicate the significance in the comparison of the two nitrogen treatments: *P < 0.05; **P < 0.01; ***P < 0.001. The HNln vs LNhn split-root treatment was compared using a Wilcoxon test and the rest of the comparisons were performed using a Mann–Whitney test.

Figure 6

Figure 6. The percentage of plants with at least one adventitious root appearing at the location of the cut 3 days after cutting. Arabidopsis thaliana seedlings were grown for 7 days in d-root system before the cut. Each dot represents the mean of an independent replicate, where each independent replicate consists of n = 10 plates with 6-7 plants per plate. First the mean was computed per plate, after which the mean of these means was computed over the 10 plates making up a single replicate. Lines show the mean of the group. The significance was calculated using chi-square test. ***P<0.001.

Supplementary material: File

Salvatore et al. supplementary material

Salvatore et al. supplementary material
Download Salvatore et al. supplementary material(File)
File 5.6 MB

Author comment: Ensuring robustness in scientific research, split-root assays as an example case — R0/PR1

Comments

Dear editor, dear Olivier,

Hereby we would like to submit our article titled “Ensuring Robustness in Scientific Research, Split-root assays as an example case” to be considered for publication in Quantitative Plant Biology.

Our article deals with the problems with replication of outcomes in complex multi-step plant science experiments. As an example case we use Arabidopsis split root assays, describing our journey towards an efficient replicable experimental setup. We discuss how problems with replication relate to a lack of detail in how we typically write our experimental protocols, and how -inspired by robustness/sensitivity analysis approaches in computational biology- we propose to include available knowledge on protocol parameters to which outcomes are or are not found to be robust as relevant information in methods sections. We provide examples of experimental parameters for which changes lead to both robust and variable outcomes for the split root experiments.

We believe that the Journal of Quantitative Biology, with its strong focus on quantitative aspects, FAIR principles, and its wide variety of article formats and openness to different types of approaches would be the perfect fit for our manuscript.

We have previously communicated about this and received confirmation of interest from QPB.

Kind regards,

Lucila Salvatore,

Ronald Pierik

Kaisa Kajala

Kirsten ten Tusscher

Review: Ensuring robustness in scientific research, split-root assays as an example case — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

This was an interesting and enjoyable paper to read.

I couldn’t agree more with the authors' points on scientific reporting.

I hope this manuscript encourages experimental scientists to be more thorough within their reporting. The idea that experimentalists should report on robustness is beautiful, and obviously a good idea once it has been said.

The experimental system chosen to illustrate the authors points was well chosen, simple enough for scientists from all fields to understand, complex enough to be a useful illustration tool.

There are three minor points regarding the manuscript.

1) The significance information in the boxplots in figures 3, 4 and 5 would be clearer in table form. I think the figures do not need to be removed or edited, they are informative. I think the addition of tables to the manuscript showing significance measures is important to enable the reader to follow the discussions, and to show the significance data. The significance tables may be large, if they do not fit in the main manuscript, please put them in the supplementary information and point to them in the manuscript.

2) The sentence on lines 130 to 133 was hard to understand. It is such an important point for the paper that I think the text needs to be rewritten, more clearly.

3) Figure 2 was very attractive, but it took me a while to work out what it was showing. It is a lovely figure which does show what the authors' intent. I think the figure could be explained a little more clearly, all that is needed is for the figure legend to explain that the black typed text is the original protocol (as is described on line 149). Would be nice to tell the reader in the legend that the protocol modifications in the figure are discussed in the following sections.

Review: Ensuring robustness in scientific research, split-root assays as an example case — R0/PR3

Conflict of interest statement

One of the authors works at same university as I do. I did not think about this until I had completed my review. I don’t think it has affected the review in any way.

Comments

In “Ensuring Robustness in Scientific Research, Split-root assays as an example case”, the authors show how a more detailed description of protocols could increase replicability of research and robustness of conclusions. In particular, they plead for including a description of which parts of the protocols have been optimized and which not; which parts of the protocol are critical; which “simple” materials have been used; how much time and how many initial seeds (etc) are needed.

They use the split root assay as an example of a complex protocol with substantial variation within the field, and demonstrate that some factors matter a lot, whereas others can be varied without affecting conclusions.

I think this is an important topic for quantitative (plant) biology. The article is well written. I have overall few comments, except that for a paper that aims at setting a methodological example, the statistical methods are not so well described and may need improvement (see below). The complementary star methods give a good example of a protocol as advocated.

I would like to challenge the authors to think big on what they propose, and see if they can address the following. The split root assays probably are uniquely used in fundamental science, so supporting maximum replicability should always be in the author’s interest. In some cases, however, there may be a conflict of interests in some wanting to exploit advantage of having an optimized protocol in house (e.g., for a recalcitrant species, in a project with a plant breeder on board, etc). Can this work be generalized to think of both (increased) minimum and optimal reporting standards/methods for protocols, or is this only a call to goodwill?

Lines 271-271: “These results demonstrate that seedling age, at least within the range tested here, does not significantly affect the nitrogen-foraging phenotypes of interest.”

Judging from the markers for statistical significance in figure 3, there seems to be a difference. Although visually, the trends look the same for both 15 DAG and 20 DAG, not all comparisons are significantly different at the alpha = 0.05 level for 20 DAG or 15 DAG. This difference is not even acknowledged in the main text, which I think is a bad thing. The authors could note this and state that there might be a difference in power of the assay depending on the length of the protocol, but establishing such a difference with certainty would require additional repetitions beyond the scope of this manuscript.

Statistical testing:

I am highly surprised that paired ratio t-tests and one-way ANOVA are used for the count data (LR number), whereas the non-parameteric Wilcoxon and Kruskal-Wallis tests are used for the continuous (LR length) data. I could see the rationale of doing it the other way around, i.e., considering that count data are by definition not normally distributed (Poisson perhaps, or a different discrete distribution). Was this choice of methods the result of some testing for normality of the distributions (without reporting)? Whatever the motivation was, it would be good to have it reported.

Multiple legends state “the rest of the comparisons were analyzed using one-way ANOVA”. As this ANOVA test only tells whether there is a difference among the groups, but not the pairwise differences, most likely a post hoc test has been used. If yes: which one? If no: what exactly has been tested? If a different one-way ANOVA has been used to compare each pair of (HNHN vs HNln, LNhn vs LNLN, HNHN vs LNLN), than this would be the equivalent of three t-tests. In total, 4 t-tests are then performed on the sample, so some correction for multiple comparisons would need to be applied. It is not reported whether or not this is done.

Similar applies to where Kruskal-Wallis tests are performed.

It is unclear to me how many roots are in the HNHN and LNLN samples. Does n=52-55 refer to the number of plants, or the number of roots? If plants: are the quantities first averaged per plant (left/right), or are the ~100 roots added individually to the plot, or only the left (or right) roots on the plate?

Degrees of freedom are not reported.

Minor comments:

Figure 1: it looks like the plot shows real data (but without a y-axis or -label). Although not important for the story, it would be good to state which data is shown.

Multiple figures: Please label “total LR length” where applicable, to make this immediately obvious (vs average).

Line 268: average LR length is in figure 3D+F.

Figure legends in general: I think it would be a good signal to the readers to add to each legend some information that varied within the manuscript, such as whether data was measured on the new part / total root; roots were grown under light or dark conditions.

Figure S5: image missing/broken.

Recommendation: Ensuring robustness in scientific research, split-root assays as an example case — R0/PR4

Comments

I apologise for the significant time the reviews took. The comments of the reviewers are positive and they acknowledge the significance of the topic. Following the suggestions of the reviewers I recommend Minor Revision. I would urge the authors to be more precise regarding the statistical methods and provide a better justification for the statistical methods applied.

Decision: Ensuring robustness in scientific research, split-root assays as an example case — R0/PR5

Comments

No accompanying comment.

Author comment: Ensuring robustness in scientific research, split-root assays as an example case — R1/PR6

Comments

Dear editor,

We hereby submit a revised version of our manuscript “Ensuring Robustness in Scientific Research, Split-root assays as an example case”. We have provided a detailed list of answers to the individual reviewers comments as a separate document. We would like to thank them for their helpful advice.

We would like to remark that we experienced difficulties uploading our figures in the requested resolutions and therefore decided to upload them for now in a reduced resolution. If these turn out to be of insufficient quality please let us know as we have higher resolution images available.

Kind regards,

Kirsten

Review: Ensuring robustness in scientific research, split-root assays as an example case — R1/PR7

Conflict of interest statement

Reviewer declares none.

Comments

The authors have addressed my comments. Thank you.

However, I do not like the image which has been added to page 1.

Minor revision suggested - but not essential, as it could be a matter of taste.

The AI generated image of a young scientist frowning/sulking while holding two split root plates is cartoon like and clownish. In my opinion the image undermines the scientific importance of the work in the manuscript.

The frown/sulk also gives a negative impression of the work to follow.

Apart from the split root plates the image does not inform the reader about the content of the paper.

I advise that, if an image is needed, it is an informative and positive (emotionally neutral) image is generated.

I’d remove any people from the image, for inclusivity reasons.

Review: Ensuring robustness in scientific research, split-root assays as an example case — R1/PR8

Conflict of interest statement

Reviewer declares none.

Comments

The comments were adequately addressed.

Recommendation: Ensuring robustness in scientific research, split-root assays as an example case — R1/PR9

Comments

The authors are happy with the revision. However, one reviewer comments on the graphical abstract. I agree with this comment and advise the authors to either remove it or replace it by something more neutral and informative.

Decision: Ensuring robustness in scientific research, split-root assays as an example case — R1/PR10

Comments

No accompanying comment.

Author comment: Ensuring robustness in scientific research, split-root assays as an example case — R2/PR11

Comments

Dear editor,

We replaced our graphical abstract, as requested.

We removed the statement of use of ChatGPT as for the new graphical abstract this was no longer the case.

Kind regards,

Kirsten

Recommendation: Ensuring robustness in scientific research, split-root assays as an example case — R2/PR12

Comments

Dear Kirsten,

I find the graphical abstract now more suitable for a scientific publication and I am happy to finally formally accept the manuscript for publication.

Christian

Decision: Ensuring robustness in scientific research, split-root assays as an example case — R2/PR13

Comments

No accompanying comment.