Abstract
Free full text
TissueMiner: A multiscale analysis toolkit to quantify how cellular processes create tissue dynamics
Abstract
Segmentation and tracking of cells in long-term time-lapse experiments has emerged as a powerful method to understand how tissue shape changes emerge from the complex choreography of constituent cells. However, methods to store and interrogate the large datasets produced by these experiments are not widely available. Furthermore, recently developed methods for relating tissue shape changes to cell dynamics have not yet been widely applied by biologists because of their technical complexity. We therefore developed a database format that stores cellular connectivity and geometry information of deforming epithelial tissues, and computational tools to interrogate it and perform multi-scale analysis of morphogenesis. We provide tutorials for this computational framework, called TissueMiner, and demonstrate its capabilities by comparing cell and tissue dynamics in vein and inter-vein subregions of the Drosophila pupal wing. These analyses reveal an unexpected role for convergent extension in shaping wing veins.
eLife digest
Cells interact, divide, rearrange and change shape to build an organ during development. Modern microscopy and computer technology can follow each individual cell of an entire organ in a living organism. However, to understand how the complex choreography of cells leads to well-shaped organs, researchers need tools to help the store and analyze the large amounts of data generated. Tools are also needed to visualize and quantify the complex cell behaviors in an easy and flexible manner.
During its development, a fruit fly’s wings become divided into distinct regions separated by tubular supports called veins. Early on in development, the vein cells are indistinguishable from their neighbors, but at late stages, vein cells become a different shape. Veins also become narrower, which is assumed to be due to the number of vein cells falling. However, the way in which cells behave to bring about these changes has not been studied in detail.
Etournay, Merkel, Popović, Brandl et al. have now developed a toolkit called TissueMiner that enables users to store large amounts of data about cells and analyze how cells collectively shape an organ. TissueMiner was then used to identify vein cells at late stages of wing development and follow them backward in time to reveal their position at early stages. This showed that veins become narrower and more elongated because the cells that make up the veins shrink more than cells in other regions.
TissueMiner was then used to show that vein cells specifically rearrange and elongate to produce thinner regions, while the number of cells increases slightly because the cells divide. These results suggest that the cell behaviors responsible for making veins elongate and narrow are likely to be different from what had previously been assumed.
TissueMiner can be used in future studies to help understand the molecule signals that influence how cells behave in veins during wing development. The toolkit could also now be used to explore the changes involved in the development of other organs in other organisms.
Introduction
Understanding how cells collectively shape a tissue is a long-standing question in developmental biology. We recently addressed this question by analyzing morphogenesis of the Drosophila pupal wing at cellular resolution (Etournay et al., 2015). To understand the cellular contributions to pupal wing shape changes, we quantified the spatial and temporal distribution of both cell state properties (e.g. cell area, shape and packing geometry), as well as dynamic cellular events like rearrangements, divisions, and extrusions. We quantitatively accounted for wing shape changes on the basis of these cellular events. By combining these analyses with mechanical and genetic perturbations, we were able to develop a multiscale physical model for wing morphogenesis and show how the interplay between epithelial stresses and cell dynamics reshapes the pupal wing.
Researchers interested in epithelial dynamics face similar challenges in processing and analyzing time-lapse movie data. Quantifying epithelial dynamics first requires image-processing steps including cell segmentation and tracking, to digitalize the time-lapse information. Recently, software tools for segmentation and tracking have become generally available (Aigouy et al., 2010; Mosaliganti et al., 2012; Sagner et al., 2012; Barbier et al., 2015; Cilla et al., 2015; Wiesmann et al., 2015; Heller et al., 2016; Aigouy et al., 2016). However, more advanced analysis is required to quantify, interpret and visualize the information derived from segmentation and tracking. Epithelial cells share a set of core behaviors, such as division, rearrangement, shape change and extrusion, which underlie a wide variety of morphogenetic events in different tissues. Methods for analyzing these core behaviors have been developed independently in several labs (Blanchard et al., 2009; Bosveld et al., 2012; Etournay et al., 2015; Guirao et al., 2015). However, these analysis tools have not yet been made available to other users in an easy to use and well-documented form.
Here, we propose a generic data layout and a comprehensive and well-documented computational framework called TissueMiner (see Box 1) for the analysis of epithelial dynamics in 2D. It enables biologists and physicists to quantify cell state properties and cell dynamics, their spatial patterns and their time evolution in a fast, easy and flexible way. It also facilitates the comparison of quantities within and between tissues. To make TissueMiner accessible to a novice, we provide tutorials that guide the user through its capabilities in detail and release a workflow that automatically performs most of the analysis and visualization tasks we reported previously for Drosophila pupal wings (Etournay et al., 2015). These tutorials operate using one small example dataset and 3 large wild-type datasets corresponding to the distal wing blade, which we also provide. The code for TissueMiner, along with tutorials and datasets, are publically available (Box 1). We illustrate the utility and power of these tools by performing a more extensive analysis of pupal wing morphogenesis focused on differences in the behavior of vein and inter-vein cells.
Wing veins are specified during larval stages, but only become morphologically distinct during prepupal and pupal morphogenesis. During pupal morphogenesis, the dorsal and ventral surfaces of the wing epithelium become apposed to each other on their basal sides, except in the regions that will give rise to veins - here the basal surfaces of dorsal and ventral cells form a lumen. Vein and inter-vein cells also differ on their apical surfaces. Vein cells have a narrower apical cross-section and form corrugations that protrude from the dorsal and ventral surfaces of the wing blade. The cell dynamics underlying vein morphogenesis have never been quantitatively examined.
Results
We analyze epithelial morphogenesis within TissueMiner in three steps (Figure 1—figure supplement 1). First, all epithelial cells of the tissue are digitalized (segmented) and automatically tracked over time using the interactive TissueAnalyzer software (Aigouy et al., 2010, 2016; Sagner et al., 2012), which is included in the TissueMiner framework. This software generates segmented images, referred to as segmentation masks that contain information about cell geometry, cell neighbor topology and cell ancestry, which are essential for the study of morphogenesis (Aigouy et al., 2010; Sagner et al., 2012; Etournay et al., 2015). Second, we use a TissueMiner automated workflow to extract this information from the images and store it in a relational database. This workflow also automatically performs most of the visualization steps we describe in this paper (Materials and methods, and Appendix 1). Third, we use TissueMiner’s powerful and convenient library of tools for R and Python to query the database to both visualize the data and quantitatively compare cell properties and behaviors between different movies and subregions of the tissue.
Time-lapse datasets are rich with information, and one important set of tools that TissueMiner provides is the ability to visualize this information on the tissue. Such type of visualization can reveal interesting spatial and temporal patterns of core cell behaviors and can guide subsequent analyses. This is, however, insufficient for quantitatively comparing regions within the same tissue or even comparing how the tissue behaves across replicates or various conditions. Therefore, we developed tools to enable the user to define regions of interest, synchronize movies in time, and align all tissues to a common orientation. We then provide tools to easily plot average quantities in different regions or across movies. For each type of measurement, we refer to the tutorials regarding the specific visualization tools we have built (Box 1).
Preparing the dataset (TM R-User Manual sections 1.1 to 1.5)
Before conducting any analysis, the TissueMiner automated workflow reads three configuration files that contain (1) user-defined regions of interest (ROI’s), (2) time offsets for movie synchronization, and (3) the rotation angle used to align the tissue to a standard orientation (Figure 1—figure supplement 1).
Defining regions of interest (howto Video 1)
As cellular behaviors may be spatially patterned, one should have the ability to quantify and compare cell dynamics within different ROI’s. TissueMiner provides a Fiji macro (draw_n_get_ROIcoord.ijm) to manually define a set of ROI’s directly on one given image of the movie. This program manages several lists of ROI’s, which the user can create, modify and delete. These lists help maintain the consistency of ROI labels, which is essential for subsequent analysis (Video 1).
In addition, defining ROI’s of different shapes and following them backwards and forwards in time (Figure 1A–E’) is a useful method to visualize tissue deformations (Figure 1D–D’, Video 2). These ROI’s can be defined at any frame within the movie. Thus, it is even possible to specify a region based on morphological features that only arise late in the morphogentic process under study, which is true of wing veins for example (see Figure 1E–E’). ROI definition allows the user to define morphologically relevant regions of interest and compare the behavior of cells in the different regions.
Video 1.
Video 2.
By default, TissueMiner generates two regions of interest – raw and whole_tissue – in order to select cell populations by name. The raw ROI corresponds to all segmented and tracked cells. However cells located at the tissue margin may move in and out of the field of view of the microscope lens. TissueMiner identifies the population of cells (whole_tissue) whose entire lineage lies within the field of view throughout the movie. To identify this population, we developed a filtering tool to discard in each movie frame margin cells located at the edge of the segmentation mask and one additional row of cells that contact the margin cells. The choice of discarding two rows of cells is motivated by the fact that segmentation quality drops near the margin. We iterate over all time points to ensure that we discard all cells moving in and out the field of view (see Materials and methods). User-defined ROI’s are also subjected to this filtering.
Aligning movies in time
To temporally align movies, TissueMiner provides a configuration file in which to manually define a time correction for each movie relative to one reference movie whose time correction is set to zero. The time correction can be estimated based on the appearance of morphological landmarks, or by aligning curves of a defined state quantity in time, such as cell area or cell elongation, on the assumption that this quantity has a similar qualitative time evolution.
Aligning movie orientation (howto Video 3)
In order to compare replicates of the same dynamic biological process, all movies should have a common orientation. TissueMiner contains a Fiji macro (orient_tissue.ijm) to assist the user in finding the optimal angle through which each movie should be rotated so that all movies have a comparable orientation (see Video 3 for an example on the pupal wing).
Video 3.
Visualizing cell area, cell shape and cell packing on the entire tissue (TM R-User Manual sections 2.2 and 2.6, Py-tutorial sections 2.1 to 2.3)
An important step in analyzing tissue morphogenesis is to quantify cell state properties over time. These properties include cell area, shape anisotropy and packing geometry. In this section, we demonstrate the analysis and visualization tools of TissueMiner by comparing how these state properties evolve during wing morphogenesis in vein and inter-vein regions.
Cell area and elongation (TM R-User Manual sections 2.2 – 2.5, Py-tutorial sections 2.1 – 2.2)
Morphogenesis is often characterized by changes in cell area and elongation. In the TissueMiner workflow, these properties are calculated from the original segmentation masks and stored in the database (Materials and methods). To visualize the evolution of the cell area pattern at the scale of the whole tissue, we map the area values of each individual cell to a gradient color scale (see Figure 2A–A’, Video 4). Each cell contour is filled with a color that corresponds to its area. Figure 2A’ shows the pattern of cell areas in the wing at the end pupal wing blade elongation. This visualization scheme reveals that cells in the proximal hinge and in wing veins have a smaller cross-sectional area (blue) at this time.
Video 4.
Cell elongation is characterized by a nematic tensor describing the axis and magnitude of the elongation (Aigouy et al., 2010). As with cell area, we map the magnitude of cell elongation to a color scale (Figure 2B–B’, Video 5). This fine-grained quantification of cell elongation highlights striking differences between inter-vein and vein cells. Inter-vein cells are more elongated than vein cells at 22hr after puparium formation (hAPF), but this pattern is reversed by 31hAPF.
Video 5.
The color scale above reveals only the magnitude of the tensor. To visualize both the magnitude and direction of cell elongation, we represent the elongation nematic as a line whose length and angle correspond to the magnitude and angle of cell elongation, respectively. Nematics can also be averaged across multiple cells in a region in order to coarse-grain the patterns and highlight the main features (Figure 2C–C’’, Video 6). For example, the coarse-grained elongation nematics shown in Figure 2C, highlight the global alignment of cell elongation in the proximal-distal direction at 22hAPF.
Video 6.
Packing geometry (TM R-User Manual section 2.6, Py-tutorial section 2.3)
Cells in the wing become progressively more hexagonal during pupal wing morphogenesis (Classen et al., 2005). To visualize packing geometry, we map the neighbor number of each cell to a discrete color code (Figure 2D–D’, Video 7). This makes changes in packing geometry during morphogenesis immediately obvious (22and 31hAPF).
Video 7.
Plotting temporal evolution of average cell properties (TM R-User manual sections 3.3 to 3.6, Py-tutorial section 3)
The visualization tools described above effectively reveal detailed spatial patterns of cell properties. To highlight how average cell properties change over time, and to facilitate comparison between movies and ROI’s, TissueMiner also provides tools to create plots of average quantities as a function of time. In Figure 2E and Figure 2F, we compare the time evolution of the average cell area and the average cell elongation in movies of the 3 WT wings (blue, green, red) used in (Etournay et al., 2015). The plots in Figure 2 compare the time evolution of average cell elongation and area values for vein and inter-vein cells. We previously showed that average cell area in the wing blade decreases during morphogenesis, but that cell area decrease is balanced by cell divisions to maintain wing blade area. Quantifying average area values in vein and inter-vein ROI’s reveals that vein cells contract over a longer period of time than inter-vein cells, and thus have a smaller cross-sectional area at the end of morphogenesis (Figure 2F). As previously described, cells in the wing blade elongate and then relax their shapes during pupal wing morphogenesis (Etournay et al., 2015) (Figure 2E, blade part). Plotting elongation in vein and inter-vein ROI’s reveals that vein cells elongate more slowly and also relax their elongation more slowly than inter-vein cells. These differences suggest that vein and inter-vein cells have different mechanical properties.
Visualizing patterns of cell divisions (TM R-User Manual sections 2.7 – 2.9, Py-tutorial section 2.4)
Oriented tissue morphogenesis may reflect the number, orientation and spatio-temporal pattern of cell divisions. TissueMiner provides several tools to visualize these events. Overlaying color-coded generation number on a pupal wing movie reveals patterns of cell divisions as they occur (Video 8), and examining the last frame of the movie (Figure 3A) reveals the cumulative pattern of cell divisions. This analysis is largely consistent with the cell division timing inferred from classical BrdU pulse-chase experiments (Schubiger and Palka, 1987; Garcia-Bellido et al., 1994; Milan et al., 1996), but also reveals unexpected additional features. The pattern of cell divisions correlates with veins: most cells in the wing blade divide only once during pupal morphogenesis, whereas in some parts of inter-vein regions they divide twice. These include the cells lying adjacent to veins L3, L4 and L5, and the region posterior to L5. We estimate the median cell-cycle length between the first and second rounds of cell divisions to be (5.25 ± 1.50) hr.
Video 8.
To further investigate how cell divisions are patterned in the blade, we quantified the time evolution of cell division rates in each vein and inter-vein region (Figure 3B). This analysis reveals differences in the timing and numbers of cell divisions in these different ROI’s. Cells in veins L2 and L4 divide before those in L3 and L5. These divisions are followed by a second peak of division in the inter-vein regions distInterL3-L4, interL2-L3 and postL5 (see cartoon in Figure 3A).
To more easily visualize the spatio-temporal pattern of divisions in veins only, the powerful tools available in TissueMiner allow us to assign vein cells a color corresponding to the time at which they divide: blue for 16–18hAPF and red for 18–20 hAPF (see Video 9). This analysis reveals more detailed patterning in division timing. Cell divisions in vein regions that protrude ventrally (L2 and proximal L4), peak at the same time and earlier than those that protrude dorsally (L3, distal L4 and L5). Precise correlation of cell divisions with specific vein and inter-vein regions suggests that they are autonomously controlled by signaling associated with veins.
Video 9.
To measure the orientation of cell divisions, we define a unit nematic tensor (see Materials and methods). For each cell division, the orientation of this unit nematic is defined by the line connecting the centers of mass of the two daughter cells when they first appear (see Figure 3C–C', and TM R-User Manual section 2.8). Each nematic is assigned a position on the tissue that corresponds to the center of combined mass of the two daughter cells. To visualize division orientation patterns, unit nematics can be added within different regions and averaged over different time intervals (Figure 3D, Video 10, TMR-User Manual section 2.9).
Video 10.
Visualizing cell junction dynamics (TM R-User Manual sections 2.10 – 2.12, 3.8–3.9)
Epithelial tissues can be reshaped by cell rearrangements, or T1 transitions (for review [Walck-Shannon and Hardin, 2014]). In the simplest case, a T1 transition involves two pairs of cells, that exchange neighbors by disassembling one cell-cell contact and replacing it by another – bringing together two previously separated cells (Figure 4A). In reality, cell contacts may undergo multiple rounds of shrinkage and regrowth before resolving (Figure 4A'). Furthermore some epithelia undergo the related process of rosette formation where multiple cell junctions are disassembled before new neighbors are brought into contact. By separately quantifying the orientation with which cell contacts are gained and lost, one can reveal whether there is a net directionality to cell junction assembly and disassembly. To identify gained and lost cell contacts, we compare cell neighbor relationships between 2 subsequent frames. We exclude changes in neighbor relationships resulting from cell division, extrusion or a cell moving in and out of the field of view. The remaining neighbor relationship changes are used to define cell contacts that have appeared or disappeared.
We characterize the orientation of contact gains and losses by assigning them a unit nematic tensor. For contact loss, the orientation of the nematic is defined by the axis intersecting the two cell centers. For contact gain, the orientation of the nematic is perpendicular to the axis intersecting the two cell centers (Figure 4A–A’). If there is a simple disappearance and reappearance of a single cell contact, corresponding nematics will cancel out. Therefore, the sum of contact gain and contact loss nematics over time and/or space will represent an effective T1 nematic describing net direction of contact assembly/disassembly.
The rate of contact gain and loss can be visualized in different ways. Cell contact dynamics can be viewed directly on movies of tissue morphogenesis by assigning colors to cells as they gain (red) or lose (green) contacts. Those cells that simultaneously gain and lose different cell contacts are colored blue (Figure 4B–B’).
The frequency of contact gain and loss, independent of orientation, can be plotted over time. Figure 4C compares the frequency of contact assembly/disassembly in vein and inter-vein regions. In both regions, this rate begins to decrease in the second half of morphogenesis.
To visualize the pattern of orientation of T1 transitions throughout the wing, we sum contact gain and loss nematics over square grid elements, and average over a chosen time window (about 50 min in Figure 4D, Video 11, see TM R-User Manual section 2.12).
Video 11.
Finally, the average orientation of effective T1 nematics in sub-regions over time can be visualized using circular diagrams, where nematics are color-coded to indicate developmental time. Figure 4—figure supplement 1A reveals that the orientation of effective T1’s is along the anterior-posterior (AP) axis early (blue) and shifts to the proximal-distal (PD) axis in the second half of morphogenesis (red). A similar approach can be used to illustrate average cell elongation nematics over time (Figure 4—figure supplement 1B).
Quantification of tissue deformation and the contribution of different cellular events (TM R-user manual section 3.10)
While it is useful to quantify the number and orientation of cellular events like elongation, rearrangement, extrusion and division, this by itself does not provide quantitative information about the amount of tissue shape change contributed by each type of event. We therefore devised a method to measure deformation caused by these cellular processes such that they sum to the measured tissue deformation.
Tissue deformation can be decomposed into isotropic and anisotropic parts that distinguish changes in area (compression/expansion) from changes in aspect ratio (pure shear, for details see also Materials and methods). The quantities describing area changes are scalar, whereas the quantities describing shear rate in a 2D-network are nematic tensors harboring two distinct components that describe the orientation and magnitude of the shear.
Tissue area changes can be calculated based on cell area change and the number of cells gained and lost by divisions and extrusions – information that is all available in the TissueMiner database (Etournay et al., 2015).
To quantify the cellular contributions to anisotropic tissue deformation, TissueMiner uses the so-called Triangle Method, which is based on a triangular tiling of the junctional network (Etournay et al., 2015; Merkel et al., 2016). Triangle elongation is a proxy for cell elongation, and topological changes in the network result in redrawing of triangles (Figure 5A–C). The resulting change in average triangle elongation can be used to calculate the shear due to the topological changes (Etournay et al., 2015). In addition to contributions from divisions, cell rearrangements, extrusions and cell shape changes, the method also takes into account deformation caused by correlations between elongation and both area change and rotation.
Validation of tissue deformation measurements using computer-generated cells
To test the reliability of TissueMiner in calculating large cell and tissue deformations, we created two computer-generated movies of hexagonal cells sheets (Videos 12, 13). In one movie, we imposed a constant isotropic expansion rate of 3.50 10−2 per frame, without any anisotropic deformation. In the second movie, we imposed a constant pure shear along the x-axis with a rate of 1.75 10−2 per frame, and without any isotropic expansion. The amounts of isotropic expansion and pure shear have been chosen to be at least 10 times higher than what we measure between subsequent frames of pupal wing movies.
Video 12.
Video 13.
We then asked if TissueMiner could quantitatively recapitulate the respectively imposed deformation rates. In each dataset, TissueMiner automatically defines a 'whole_tissue' region of interest that corresponds to a consistent set of cells that are always visible (about 100 cells in the isotropic expansion movie and about 50 cells in the pure shear movie, green labels in Videos 12 and 13). All measurements are done in this ROI to avoid measuring deformation due to inward and outward cell flows. Figure 5—figure supplement 1 shows the time evolution of the measured tissue expansion rate (panel A) and tissue shear rate (panel C) that were averaged over the 'whole_tissue' ROI, and their respective cellular contributions. Panels B and D show the corresponding cumulated curves. As expected, in the isotropic expansion movie we observe a nearly constant isotropic expansion rate, which is accounted for by the cell area change contribution. We measure an average expansion rate of (3.53 ± 0.04) 10–2 per frame, which is consistent with the value imposed when creating the movie. The measured uncertainty is the 95% confidence interval of the standard error of the mean. The pure shear rate and its cellular contributions nearly vanish in this movie (Figure 5—figure supplement 1C,D).
For the pure shear movie, we measure an approximately constant horizontal component of the pure shear rate of (1.74 ± 0.02) 10–2 per frame, which is consistent with the value imposed when creating the movie. This pure shear rate is entirely accounted for by cell elongation change. The isotropic expansion rate and its cellular contributions nearly vanish (Figure 5—figure supplement 1A, B). Other contributions to expansion and shear rates are negligible in both movies.
The pixelated nature of individual cell contours contributes to fluctuations of our measured values. Moreover, we find that these fluctuations cancel out when cumulating the deformation (Figure 5—figure supplement 1B and D). Thus, the current implementation of TissueMiner captures the tissue isotropic expansion and pure shear rates as well as the corresponding cellular contributions with a good precision in these computer-generated movies.
Deformation of the pupal fly wing
Figure 5—figure supplement 2 shows the rate of relative area change and cumulative area change of vein and inter-vein regions over time, as well as the cellular contributions to these area changes. As previously noted, the area of the blade as a whole changes very little. However sub-region analysis reveals that inter-vein expansion compensates for compression in vein regions. Vein cells not only divide less than inter-vein cells, but also decrease their area more.
Next we use the Triangle Method to calculate pure shear rates in the time-lapse movies of developing pupal wings. To visualize the spatial pattern of pure shear rate in the wing, TissueMiner allows us to plot nematics corresponding to the local tissue shear rates (Figure 5D) and to rates of shear produced by different cellular contributions (Figure 5—figure supplement 3, and [Etournay et al., 2015]) averaged within the squares of about 26 x 26 microns.
To compare the time evolution of pure shear rate between different tissue subregions we plot this rate averaged over the corresponding ROI (Figure 5E–F and [Etournay et al., 2015]). A positive sign for shear indicates an extension along the PD axis and a contraction along the AP axis, whereas a negative sign indicates an extension along the AP axis and a contraction along the PD axis.
As reported previously, the wing blade as a whole shears along its PD axis between 16 and 32 hAPF. T1 transitions and cell elongation are major contributors to total PD shear, and they display complementary behavior that evolves over time. In the first phase, cells elongate in the PD axis in response to tissue stresses generated by hinge contraction, and by actively oriented T1 transitions that occur first along the AP axis. In the second phase, cell elongation causes the orientation of T1 transitions to shift 90˚ from the AP to the PD axis (Etournay et al., 2015). These PD oriented T1 transitions both contribute to tissue shear and relax PD cell elongation. We now compare shear and cellular contributions to shear in vein and inter-vein regions. Tissue shear peaks earlier in inter-vein regions than in veins, but veins shear more overall. Examining the cellular contributions to shear suggests that increased shear in veins reflects a different relationship between cell elongation and T1 transitions. PD-oriented T1 transitions do not only produce more shear in veins, they also fail to relax PD cell elongation as much as in inter-vein regions.
Discussion
Quantitative image analysis of developing epithelia is a powerful approach to understanding morphogenesis, but the tools with which to tame and analyze these complex data have not been widely available in a standard and well-documented format. Here we provide an introduction to the capabilities of TissueMiner and tutorials for its use. TissueMiner provides general strategy to store and analyze large data sets of interwoven objects by combining state of the art tools for data mining. It allows quantification and visualization of epithelial morphogenesis at multiple scales – from individual cells to entire tissues. It provides both a generic database format and a multi-platform toolkit to interrogate and visualize data and quantify cellular contributions to large-scale epithelial deformations.
TissueMiner has been designed to be versatile and expandable. The database format we provide standardizes the organization of tracked cell data and collects all data into a single file per movie. Such a standardized data format facilitates data sharing between different sources, thereby enhancing cross-laboratory reproducibility. As the database stores positional information about cells and cell contacts, as well as cell neighbor topology, it could also be useful for parameterizing simulations of epithelial remodeling by vertex models or other physical network models. The scheme of our relational database is expandable: additional properties of cells, bonds and vertices can be appended to the database without affecting the relationships between tables. As a consequence, our current query tools to interrogate the database remain functional, even if the database is extended with new properties of cells, bonds and vertices.
TissueMiner takes advantage of the advanced graphical capabilities of R and Python to enable the visualization of patterns of deformation and cell state properties directly on the movie images or quantitatively summarized in graphs. In particular, R provides a flexible grammar with which to manipulate tables obtained from the database and to easily plot graphs (Wickham, 2009; Francois and Wickham, 2015). TissueMiner also offers multiple options for coarse-graining data in space and time through an expandable collection of scripts, which constitutes the TissueMiner library for R or Python. These two easy-to-learn programming languages give TissueMiner its great flexibility to both address general questions of epithelial morphogenesis and project-specific questions, and enable automation, parallelization and customization of user-specific workflows.
The tools underlying TissueMiner were originally developed to understand the interplay of cell dynamics and epithelial tension on the developing wing of the fruit fly, where we described cellular contributions to pupal wing morphogenesis averaged throughout the entire wing blade (Etournay et al., 2015). Here, to illustrate the utility of the TissueMiner framework, we compared the behavior of vein and inter-vein regions in the developing pupal wing. Comparing cell dynamics in veins and inter-vein regions provided an unexpected explanation for the process of 'vein refinement'. Vein refinement refers to the fact that veins become narrower during pupal morphogenesis. This had been interpreted as a signaling-dependent reduction in the number of cells assuming the vein fate (Blair, 2007). Here we show instead that vein narrowing results from a convergent extension-like process that is stronger in veins than in inter-vein regions. This elongates and narrows the veins without reducing vein cell number. It will be interesting to examine how signaling pathways involved in vein refinement influence cell dynamics in veins during morphogenesis. The standardization of analysis that TissueMiner provides will facilitate these and other comparisons critical for deciphering the molecular mechanisms underlying epithelial morphogenesis.
Materials and methods
Live imaging of the pupal wing
The knock-in Ecad::GFP fly line (Huang et al., 2005) was used for live imaging of the developing pupal wing. Flies were raised and maintained at 25°C during imaging by using a temperature-controlled chamber equipped with a humidifier to prevent desiccation. Long-term time-lapse imaging was performed as previously described (Etournay et al., 2015). After the imaging session, flies were maintained in a humid environment where they eclosed at the term of pupal development.
A relational database to store the history of cells, their lineage and their constituent bonds and vertices
The visualization and quantification of cell dynamics underlying tissue morphogenesis rely on the ability to extract information about cell geometry, cell neighbor topology and cell histories from time-lapse movies (Aigouy et al., 2010; Etournay et al., 2015). We use TissueAnalyzer to segment and track the cell network over time. This results in a series of digital images that contain this information (Figure 6—figure supplement 1). To facilitate its access and use, we developed tools in the TissueMiner framework to extract and convert this information initially stored in images into a specific database format (see details in appendix 1), which we call 'TM-DB' (schematically outlined in Figure 6A).
First, the history of each tracked cell in the movie is stored as a separate row in the cell_histories table of the TM-DB (Figure 6A). This includes the movie frames in which it first appears and disappears and why, along with its lineage relationship to other cells (see appendix 1). The reason for cell appearance and disappearance is inferred by the parser. A primary reason could be a cell division, which results in the disappearance of the mother cell and in the appearance of two daughter cells. It could be a cell extrusion that results in its disappearance. It could also be that cells move in and out of the field of view of the microscope lens, resulting in gain and loss of cells. Furthermore, we use this information to establish the lineage relationship that corresponds to each group of cells related by ancestry (Figure 6B). Each cell within the lineage group is assigned a generation number. The lineage group and generation number for each cell are listed in the cell_histories table.
We store the time points at which the movie images were recorded into a frames table that links each movie frame to its corresponding time point. For each movie frame, we need to store geometrical and topological information about cells within the cellular network. Geometrical information includes position and shape descriptors, whereas topological information indicates the arrangement of neighboring cells around each cell. We use cell histories, geometry and topology to understand how individual cells contribute to the whole tissue deformation during morphogenesis (Etournay et al., 2015).
The geometrical information is stored in three tables of the TM-DB: cells, bonds and vertices. They correspond to the 3 generic entities - cells, cell-cell contacts and intersections between cell-cell contacts, respectively illustrated in Figure 6C. These entities are commonly used in vertex model simulations (for review [Fletcher et al., 2014]). The cells table contains cell geometrical data (center of mass, area, shape anisotropy) and the polarized distribution of proteins along the cell circumference, as represented by a polarity nematic tensor (Aigouy et al., 2010). The bonds table informs about bond length, and the vertices table about vertex position in each movie frame.
The directed_bonds table exclusively stores the cell neighbor topological information at each frame, i.e. how bonds are organized around each cell along with the cell neighbor relationship information. To store the cell neighbor topology in an unambiguous manner, we define for each cell a directed path of consecutive bond vectors oriented counterclockwise, which forms the oriented circumference of the cell (Figure 6D, see also [Kachalo et al., 2015]). We link each directed bond to its counter clockwise follower (left directed bond) in the same cell. To store the cell neighbor relationship, we link each directed bond to its corresponding directed bond (conjugated bond) of the neighboring cell (Figure 6D, and appendix 1).
The TM-DB is relational, which means that it establishes contextual relationships between items stored in one ore more tables (see appendix 1). These relationships are outlined in rounded boxes in the conceptual scheme of the TM-DB (Figure 6A). Technically, each item in a table is stored in a separate row and is given a unique number as identifier. For a relationship between two tables, one of the tables contains an additional column, which refers to items in the other table by holding their identifier number. Such additional columns for the TM-DB format are shown in blue in Figure 6—figure supplement 2. When extracting information from a database using so-called queries, these columns serve as bridges connecting the information stored about related items.
In essence, this structure creates a generic relational model to represent complex cell tracking data in 2D. In practice, the data for each movie is stored in a separate SQLite database file. Since all movie files are stored using the same database structure, automated data mining and visualization are greatly facilitated. For the same reason, usage of the TissueMiner database format encourages exchange of both movie data and analysis tools.
An automated workflow compliant with high performance computing platforms
To help the user to perform complex tissue morphogenesis analysis, we developed an automated pipeline that uses the tracked data from TissueAnalyzer as an input to build the database and perform all downstream analyses described above. To do so, we use the snakemake workflow engine developed by Koster and Rahmann (Koster and Rahmann, 2012). This engine channels the different processing steps into a well-formed workflow graph. Snakemake automatically determines the execution order, provides means for error recovery and job control, and supports High Performance Computing (HPC) environments. By using snakemake we enable the user to easily run and monitor TissueMiner, while maintaining a proper decoupling of tools as independent executables.
Practically, the user defines a workflow definition file in which processing steps are defined as a set of execution rules, namely a list of scripts to be run along with required input(s) and expected output(s). Snakemake automatically builds a directed graph from which the execution order of processing steps is inferred. If only one branch of the graph needs to be run, the engine will ensure that all input data are present and will automatically run upstream steps if necessary. This engine also provides the possibility to visualize a directed acyclic execution dependency and execution state graph (DAG) for a given workflow (see Figure 7).
One major advantage of a workflow engine such as snakemake is that it can run the workflow on various architectures - from single-core workstations to multi-core servers and clusters - without the need to modify the rules, thereby facilitating reproducible research. To simplify the TissueMiner installation procedure, we provide a pre-configured system to be loaded in the docker software available at http://docker.com. The TissueMiner docker image can be run without any setup using provided example data or custom user data as detailed out on the TissueMiner GitHub project page. More advanced users can use TissueMiner directly from the command-line with or without snakemake and can thus perform simultaneous analyses of multiple movies.
A user-friendly data-mining library to easily collect information for comparing multiple datasets
After applying our automated workflow to different movies, the results can be easily compared using a collection of command-line tools written in R and Python. These tools aggregate different experiments for plotting and performing comparative analysis. Here we describe the tools written in R. Python tools are described in the corresponding tutorial. The R tools are designed to be used in an integrated development environment such as RStudio, which provides a user-friendly environment to assist the user in writing and executing command lines. These command line tools are organized in the spirit of a grammar of data manipulation and they can be combined with the existing R tools like dplyr (Francois and Wickham, 2015) or ggplot2 (Wickham, 2009) for manipulating and visualizing data (https://mpicbg-scicomp.github.io/tissue_miner/user_manual/Learning_the_R_basics_for_TissueMiner.html).
We developed generic 'multi-query functions' (mqf) to collect specific information for individual movies. These mqf tools are organized into fine-grained and coarse-grained categories according to the type of analysis to be carried out. The fine-grained tools aggregate data at cellular level, namely individual cell properties inside regions of interest. These tools are prefixed with 'mqf_fg_'. The coarse-grained mqf tools are further separated into 'roi' and 'grid' categories to distinguish between regions moving with the tissue and static square regions tiled into a grid. They allow one to visualize and quantify average cell properties at different tissue locations and various spatial scales, and are prefixed with 'mqf_cg_roi_' and 'mqf_cg_grid_' respectively.
To compare fine-grained and coarse-grained cell properties amongst movies we developed a 'multi-db-query' tool, which streamlines the application of the mqf tools to a set of movies. To use this tool, the user should first align the movies in time, using convenient morphological or cellular landmarks. As for the Drosophila wing, we align movies such that the peaks of cell elongation coincide in the different movies. The user can then apply a chosen mqf tool to multiple movies and multiple ROI’s. All mqf tools, alone or in combination with the 'multi-db-query' tool, generate a table that contains individual or averaged measurements to be visualized on the tissue (Figure 1 A–E’, Figure 2A–D’, Figure 3A,D, Figure 4B,D, Figure 5D) or in graphs (Figure 2E–F, Figure 3B, Figure 4C, Figure 5E–F). This library of tools is described in detail in the TM R-User Manual, which also provides many examples. These tools can be easily extended to address project specific questions.
Detecting gain and loss of cell contacts
To detect cell neighbor changes, we developed a routine in R that queries the DB and establishes the cell-neighbor relationship at each frame. By comparing the list of neighboring cell identifiers for a given cell between two consecutive frames [f, f + 1], can one identify and count the changes in neighbor relationships. These can be subdivided into those caused by cell divisions, cell extrusions or the simple gain or loss of a cell contact (not due to division or extrusion). We call these half-T1’s because they resemble the gain and loss of cell contacts that occurs during a T1 transition – although they may also be generated by other events such as rosette formation. To assign a neighbor change to the half-T1 category, the corresponding cell identifiers must be present in both frames, ruling out extrusions and cells moving in and out of the field of view. To detect half-T1’s that occur simultaneously with divisions, we mask neighbor changes due to divisions by propagating the mother cell identifier (frame f) to the two daughter cells (frame f+1) that we fuse into one fake cell having the mother cell identifier. We iterate over each pair of consecutive frames and store the half-T1 events due to a gain and a loss of cell neighbors.
Cell lineages and lineage browsing to follow ROI’s forward and backward in time
We pool all lineage information (as contained in the cell_id, left_daughter_cell_id and right_daughter_cell_id columns from the cell_histories table) into a directed lineage graph (Nepusz, 2006) from which we infer a lineage group identifier and a generation number. By definition the root of each lineage tree is considered as the F0 generation and is thus given a generation value of 0. We follow ROI’s backward and forward in time by browsing lineage graphs that were selected based on the regions drawn by using the draw_n_get_ROIcoord.ijm Fiji macro. However cells may be lost or not detected when browsing the lineages. One primary reason is that extruding cells are not detected when browsing the lineage backward in time. Cells could also be lost due to possible tracking mistakes. To improve spatial continuity of ROI’s we have implemented a method to reassign lost cells to ROI’s when located within ROI’s. To identify lost cells for a frame within a given ROI, we first distinguish bonds that connect two cells within the ROI, only one cell within the ROI or none. All corresponding cell-pairs define an undirected graph on which a connected component analysis (Nepusz, 2006) allows to identify the ROI and non-ROI regions. All cells of non-ROI regions, except for the largest one, are reassigned to become part of the ROI. By doing so, we make the assumption that the largest non-ROI component is defined by the tissue surrounding the ROI.
Nematic tensors to describe cell elongation and the orientation of cellular processes
When analyzing and visualizing single cell properties, we use the same cell elongation definition as in Aigouy et al. (2010). For a given Cartesian xy coordinate system, the elongation of a given cell is defined by the nematic tensor
with
Here, Ac is the area of the given cell, and the integrals are carried out over all points r within the cell. The angle is the angle between the vector r − rc and the x axis, where rc is the cell center defined as
Here, the integral is again carried out over all points r within the cell. The magnitude of the elongation is given by ϵ = (ϵxx2+ϵxy2)½ and the elongation angle φ is given by the following two equations
Note that this definition of cell elongation is different from the triangle-based definition that is also discussed in this article. However for the fruit fly wing, both cell elongation definitions yield very similar results.
To characterize the axes of cell divisions and T1 transition, we introduce the unit nematic tensors
Here, the angle ϕ is the angle of the line connecting both cell centers with respect to the x axis, measured in counter-clockwise sense. The orientation for a half-T1 transition during which two cell lose neighborship is characterized by:
where ϕT1+ is the angle of the line connecting the centers of the cells losing neighborship. The orientation for a half-T1 transition during which two cell gain neighborship is characterized by:
where ϕT1− is the angle of the line connecting the centers of the cells that gain neighborship. The axes of the nematics
Tissue deformation and cellular contributions to it
Here we discuss the formal definitions used to characterize tissue deformation, area change, and shear. We characterize the local rate of tissue deformation by the gradient of the velocity field v(r). We then define the overall deformation rate V of a given piece of tissue by the integral over the area At of this piece:
This 2x2 tensor can be decomposed into an isotropic part Viso characterizing the relative growth rate of tissue area, a symmetric, traceless part
Here, we have defined
In recent work, we have shown that the overall shear rate
Here, the nematic tensors
The operator d/dt denotes the total derivative, c = tanh(2Q)/(2Q), and the dot denotes the tensor dot product. The quantities Q and Φ denote magnitude and angle of the average cell elongation tensor
These formal definitions for
Acknowledgements
This work was a truly collaborative effort and the authors jointly wrote the manuscript. We are grateful to Christian Dahmann, Marcus Michel and Jacques Boutet de Monvel for critical reading of the manuscript, Benoit Lombardo for his help in Fiji macroing, and Peter Steinbach for useful discussion about workflow engines. We thank Franz Gruber, Vincent Michel and Nathalie Gourreau for testing the quickstart tutorials. RE acknowledges a Marie Curie fellowship from the EU 7th Framework Programme (FP7). This work was supported by the Max Planck Gesellschaft, and by the BMBF. SE acknowledges funding from the ERC.
Appendix 1
Funding Statement
The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
Funding Information
This paper was supported by the following grants:
European Research Council to Raphaël Etournay, Natalie A Dye, Suzanne Eaton.
Seventh Framework Programme to Raphaël Etournay.
Max-Planck-Gesellschaft to Matthias Merkel, Marko Popović, Guillaume Salbreux, Frank Jülicher.
Bundesministerium für Bildung und Forschung to Matthias Merkel, Marko Popović, Suzanne Eaton, Frank Jülicher.
Alfred P. Sloan Foundation to Matthias Merkel.
Gordon and Betty Moore Foundation to Matthias Merkel.
National Science Foundation NSF-DMR-1352184 to Matthias Merkel.
Additional information
Competing interests
FJ: Reviewing editor, eLife.
SE: Reviewing editor, eLife.
The other authors declare that no competing interests exist.
Author contributions
RE, Participated in regular group discussions to develop the ideas presented, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.
MM, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.
MP, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.
HB, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.
NAD, Extensively tested TissueMiner and suggested key improvements for TissueMiner, Participated in regular group discussions to develop the ideas presented, Drafting or revising the article, Contributed unpublished essential data or reagents.
BA, Developed key image processing and image analysis methods in TissueAnalyzer, Drafting or revising the article, Contributed unpublished essential data or reagents.
GS, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data.
SE, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.
FJ, Participated in regular group discussions to develop the ideas presented, Conception and design, Analysis and interpretation of data, Drafting or revising the article.
References
- Aigouy B, Farhadifar R, Staple DB, Sagner A, Röper JC, Jülicher F, Eaton S. Cell flow reorients the axis of planar polarity in the wing epithelium of Drosophila. Cell. 2010;142:773–786. 10.1016/j.cell.2010.07.042. [Abstract] [CrossRef] [Google Scholar]
- Aigouy B, Umetsu D, Eaton S. Drosophila: Methods and Protocols. 2016. Segmentation and quantitative analysis of epithelial tissues. [Abstract] [Google Scholar]
- Barbier de Reuille P, Routier-Kierzkowska AL, Kierzkowski D, Bassel GW, Schüpbach T, Tauriello G, Bajpai N, Strauss S, Weber A, Kiss A, Burian A, Hofhuis H, Sapala A, Lipowczan M, Heimlicher MB, Robinson S, Bayer EM, Basler K, Koumoutsakos P, Roeder AH, Aegerter-Wilmsen T, Nakayama N, Tsiantis M, Hay A, Kwiatkowska D, Xenarios I, Kuhlemeier C, Smith RS. MorphoGraphX: A platform for quantifying morphogenesis in 4D. eLife. 2015;4:e14334. 10.7554/eLife.05864. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Blair SS. Wing vein patterning in Drosophila and the analysis of intercellular signaling. Annual Review of Cell and Developmental Biology. 2007;23:293–319. 10.1146/annurev.cellbio.23.090506.123606. [Abstract] [CrossRef] [Google Scholar]
- Blanchard GB, Kabla AJ, Schultz NL, Butler LC, Sanson B, Gorfinkiel N, Mahadevan L, Adams RJ. Tissue tectonics: morphogenetic strain rates, cell shape change and intercalation. Nature Methods. 2009;6:458–464. 10.1038/nmeth.1327. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Bosveld F, Bonnet I, Guirao B, Tlili S, Wang Z, Petitalot A, Marchand R, Bardet PL, Marcq P, Graner F, Bellaïche Y. Mechanical control of morphogenesis by Fat/Dachsous/Four-jointed planar cell polarity pathway. Science. 2012;336:724–727. 10.1126/science.1221071. [Abstract] [CrossRef] [Google Scholar]
- Cilla R, Mechery V, Hernandez de Madrid B, Del Signore S, Dotu I, Hatini V. Segmentation and tracking of adherens junctions in 3D for the analysis of epithelial tissue morphogenesis. PLOS Computational Biology. 2015;11:e14334. 10.1371/journal.pcbi.1004124. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Classen AK, Anderson KI, Marois E, Eaton S. Hexagonal packing of Drosophila wing epithelial cells by the planar cell polarity pathway. Developmental Cell. 2005;9:805–817. 10.1016/j.devcel.2005.10.016. [Abstract] [CrossRef] [Google Scholar]
- Codd EF. A relational model of data for large shared data banks. Communications of the ACM. 1970;13:377–387. 10.1145/362384.362685. [CrossRef] [Google Scholar]
- Codd EF. International Business Machines Corporation; 1971. Further Normalization of the Data Base Relational Model. [Google Scholar]
- Codd EF. International Business Machines Corporation; 1972. Relational Completeness of Data Base Sublanguages. [Google Scholar]
- Codd EF. International Business Machines Corporation; 1974. Recent Investigations in Relational Data Base Systems. [Google Scholar]
- Etournay R, Popović M, Merkel M, Nandi A, Blasse C, Aigouy B, Brandl H, Myers G, Salbreux G, Jülicher F, Eaton S. Interplay of cell dynamics and epithelial tension during morphogenesis of the Drosophila pupal wing. eLife. 2015;4:e14334. 10.7554/eLife.07090. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Fletcher AG, Osterfield M, Baker RE, Shvartsman SY. Vertex models of epithelial morphogenesis. Biophysical Journal. 2014;106:2291–2304. 10.1016/j.bpj.2013.11.4498. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Francois R, Wickham H. Dplyr : A Grammar of Data Manipulation. 2015. [Google Scholar]
- García-Bellido A, Cortés F, Milán M. Cell interactions in the control of size in Drosophila wings. Proceedings of the National Academy of Sciences of the United States of America. 1994;91:10222–10226. 10.1073/pnas.91.21.10222. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Guirao B, Rigaud SU, Bosveld F, Bailles A, López-Gay J, Ishihara S, Sugimura K, Graner F, Bellaïche Y. Unified quantitative characterization of epithelial tissue development. eLife. 2015;4:e14334. 10.7554/eLife.08519. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Heller D, Hoppe A, Restrepo S, Gatti L, Tournier AL, Tapon N, Basler K, Mao Y. EpiTools: An open-source image analysis toolkit for quantifying epithelial growth dynamics. Developmental Cell. 2016;36:103–119. 10.1016/j.devcel.2015.12.012. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Huang ZP, Zhou H, He HL, Chen CL, Liang D, Qu LH. Genome-wide analyses of two families of snoRNA genes from Drosophila melanogaster, demonstrating the extensive utilization of introns for coding of snoRNAs. RNA. 2005;11:1303–1316. 10.1261/rna.2380905. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Jay AK. Using SQLite. O'Reilly Media; 2010. p. 528. [Google Scholar]
- Kachalo S, Naveed H, Cao Y, Zhao J, Liang J. Mechanical model of geometric cell and topological algorithm for cell dynamics from single-cell to formation of monolayered tissues with pattern. PLOS ONE. 2015;10:e14334. 10.1371/journal.pone.0126484. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Köster J, Rahmann S. Snakemake--a scalable bioinformatics workflow engine. Bioinformatics. 2012;28:2520–2522. 10.1093/bioinformatics/bts480. [Abstract] [CrossRef] [Google Scholar]
- Merkel M, Etournay M, Popović M, Salbreux G, Eaton S, Jülicher F. Triangles Bridge the Scales: Quantifying Cellular Contributions to Tissue Deformation. 2016. [Abstract] [Google Scholar]
- Merkel M. From cells to tissues: Remodeling and polarity reorientation in epithelial tissues .Vol. Ph.D. Thesis. Technische Universität Dresden; 2014. [Google Scholar]
- Milán M, Campuzano S, García-Bellido A. Cell cycling and patterned cell proliferation in the Drosophila wing during metamorphosis. Proceedings of the National Academy of Sciences of the United States of America. 1996;93:11687–11692. 10.1073/pnas.93.21.11687. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Mosaliganti KR, Noche RR, Xiong F, Swinburne IA, Megason SG. ACME: automated cell morphology extractor for comprehensive reconstruction of cell membranes. PLoS Computational Biology. 2012;8:e14334. 10.1371/journal.pcbi.1002780. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Nepusz T, Csardi G. The igraph software package for complex network research. InterJournal Complex Systems. 2006;1695 [Google Scholar]
- Nickoloff J. Docker in Action. Shelter Island, NY: Manning Publications; 2015. [Google Scholar]
- Peter Pin-Shan C. The entity-relationship model—toward a unified view of data. ACM Transactions on Database Systems. 1976;15:9–36. 10.1145/320434.320440. [CrossRef] [Google Scholar]
- Sagner A, Merkel M, Aigouy B, Gaebel J, Brankatschk M, Jülicher F, Eaton S. Establishment of global patterns of planar polarity during growth of the Drosophila wing epithelium. Current Biology. 2012;22:1296–1301. 10.1016/j.cub.2012.04.066. [Abstract] [CrossRef] [Google Scholar]
- Schubiger M, Palka J. Changing spatial patterns of DNA replication in the developing wing of Drosophila. Developmental Biology. 1987;123:145–153. 10.1016/0012-1606(87)90436-2. [Abstract] [CrossRef] [Google Scholar]
- Tardieu H, Rochfeld A, Colletti R, d'Organisation Editions . La Méthode MERISE: Principes Et Outils. 2000. [Google Scholar]
- Walck-Shannon E, Hardin J. Cell intercalation from top to bottom. Nature Reviews Molecular Cell Biology. 2014;15:34–48. 10.1038/nrm3723. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
- Wickham H. Ggplot2: Elegant Graphics for Data Analysis. New York: Springer; 2009. [Google Scholar]
- Wiesmann V, Franz D, Held C, Münzenmayer C, Palmisano R, Wittenberg T. Review of free software tools for image analysis of fluorescence cell micrographs. Journal of Microscopy. 2015;257:39–53. 10.1111/jmi.12184. [Abstract] [CrossRef] [Google Scholar]
Decision letter
Thank you for submitting your work entitled "TissueMiner: a multiscale analysis toolkit to quantify how cellular processes create tissue dynamics." for consideration by eLife. Your article has been reviewed by two peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Naama Barkai as the Senior Editor.
The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.
Summary:
Large scale tissue imaging is becoming an everyday tool in developmental biology, but high throughput analysis methods are restricted to few laboratories. Etournay and co-workers present a software platform for segmentation and tracking of 2D epithelial monolayers. The software provides a number of geometric outputs including deformation tensors and cell connectivity. As the tools within TissueMiner appear to be identical to those used by the authors previously (2015 eLife publication), the value of TissueMiner lies not its novelty or proof-of-concept, but squarely in the likelihood of it being used by an average researcher. Both reviewers felt that the software and current tutorials, while expansive, were far from user-friendly in their current implementation, and the tutorials, while extensive, assumed far more technical background than the average user would have. Nevertheless, both reviewers felt that TissueMiner, if improved as described below, would be an important advance and could be very useful to the community.
Some of the key problems encountered were:
1) Installation of the software is not user-friendly. Between the large size of the sample datasets (2+ GB each), and tedious GitHub interactions presented as a series of code fragments rather than a single script, it took one reviewer over an hour to get the sample data downloaded, and to deal with various docker virtual machine bugs. Even pulling the demo data down caused bugs due to permission errors that were not addressed in the tutorial.
2) Once docker was running, execution of the initial data processing step resulted in a dizzying array of analysis steps with no explanation of what was happening or how much time it would take.
3) Users must already have R installed, but this is not mentioned.
4) Preparing TissueMiner to run through RStudio should not require the user to copy-paste code fragments each time. A single script that does this should be included.
5) Tissue Analyzer – the preprocessing tool made by the same group and essentially required for use with Tissue Miner – works beautifully and has a series of video tutorials. This would be a good template for TissueMiner.
6) RStudio initialization script kept crashing. Apparently additional toolboxes are required, such as devtools.
Essential revisions:
In the context of the software, the reviewers specifically recommended that you:
1) Make the sample data much smaller in size (<2GB) so that the initial docker processing takes <5min, or at least has a remaining time clock;
2) Make the software completely scripted, including all the initial importing in RStudio (no copy-pasting, just loading an RStudio file);
3) Prepare a new "First Use" R tutorial, skipping most of the R-101 training and just getting a user up and running with something simple, like plotting cell-area.
Then:
1) Conduct user testing with testers who truly have "no programming experience". Allow them to test installation unsupervised.
2) Create more streamlined, focused tutorials rather than an enormous master guide. TissueMiner tries to do everything at one time (e.g. the initial docker processing), and it is difficult to follow. Give the user the simplest data-set you can provide, and have them perform the simplest analysis (e.g. only focus on cell area, or tracking, or divisions).
3) The authors illustrate the capabilities of the software using the case of wing vein morphogenesis, but this is not a validation. To validate the software, the authors could use computer generated images of cell monolayers with known geometry, and demonstrate that the software is able to recover the different geometric parameters of each cell in the tissue.
Author response
Some of the key problems encountered were:
1) Installation of the software is not user-friendly. Between the large size of the sample datasets (2+ GB each), and tedious GitHub interactions presented as a series of code fragments rather than a single script, it took one reviewer over an hour to get the sample data downloaded, and to deal with various docker virtual machine bugs. Even pulling the demo data down caused bugs due to permission errors that were not addressed in the tutorial.
To address these problems, we have significantly simplified the installation procedure. For Ubuntu Linux, we now provide a single script install_tm.sh that fully installs TissueMiner. Instructions are found directly on the landing page of the TissueMiner GitHub repository: https://github.com/mpicbg-scicomp/tissue_miner#installation. For other systems, we provide a 2-step protocol for installing TissueMiner: first install the Docker Engine and second download the TissueMiner docker image. The installation procedure of this dockerized TissueMiner is explained in detail on the TissueMiner GitHub site: https://github.com/mpicbg-scicomp/tissue_miner#installation
The new installation procedure was successfully tested by two users with no programming experience on Ubuntu 14.04, Ubuntu 15.10, and on different versions of MacOSX (Mavericks, Yosemite and El Capitan).
We enhanced the former “R tutorial” and renamed it “TM R-User Manual”: https://mpicbg-scicomp.github.io/tissue_miner/user_manual/TM_R-UserManual.html
In addition, we now provide simplified “Quickstart Tutorials” that are designed to help new users quickly learn the procedures and capabilities of TissueMiner:
https://github.com/mpicbg-scicomp/tissue_miner#documentation
TM R-User Manual and Quickstart Tutorials are now based on a demo dataset that is smaller than that which was originally provided (~35Mb of compressed data). Nonetheless, we also provide the large datasets for more sophisticated analysis or for those interested in pupal wing morphogenesis. This larger dataset is now compressed (<800Mb each) to facilitate download and is used in the last part of the documentation (see TM R-User Manual, section 3).
2) Once docker was running, execution of the initial data processing step resulted in a dizzying array of analysis steps with no explanation of what was happening or how much time it would take.
To improve clarity and speed, we streamlined the initial processing (removing many of the automatic video creations) and now only display messages that indicate the progression of the workflow. With these changes, the initial data processing steps in the automated workflow require less than 3min on the provided demo sample.
The new Quickstart Tutorials are designed to guide the user through the database creation as well as through specific type of analysis (cell area, elongation, etc…) by running appropriate R scripts (a few seconds of run time for each script using the demo dataset). Their source code can be found on the GitHub repository:
https://github.com/mpicbg-scicomp/tissue_miner/tree/master/docs/quickstart/scripts
3) Users must already have R installed, but this is not mentioned.
On Ubuntu, our new installer script install_tm.sh solves this issue. On other systems, we provide a docker image of TissueMiner that doesn’t require additional R installation. Details about how to configure and use this dockerized TissueMiner for developing custom R scripts can be found in the TM R-User Manual (section 1.3.1):
https://mpicbg-scicomp.github.io/tissue_miner/user_manual/TM_R-UserManual.html
Optionally, MacOSX users may also find easier to install R, Rstudio and the TissueMiner API independently from the docker image (just a matter of taste). We therefore provide an installer install_tm_api.sh. Instructions are present in the TM R-User Manual(section 1.3.1).
4) Preparing TissueMiner to run through RStudio should not require the user to copy-paste code fragments each time. A single script that does this should be included.
We now provide a single script entitled analyze_movie.R that combines all individual quickstart tutorial scripts. This script is present on the GitHub repository: https://github.com/mpicbg-scicomp/tissue_miner/tree/master/docs/quickstart/scripts
Instructions about how to run this script can be found in the Quickstart R-Tutorials.
Additional explanations and code fragments for advanced users of TM have been moved to the TM R-User Manual.
5) Tissue Analyzer – the preprocessing tool made by the same group and essentially required for use with Tissue Miner – works beautifully and has a series of video tutorials. This would be a good template for TissueMiner.
Because TissueMiner is based on command line input, rather than graphical input as in TissueAnalyzer, we believe video tutorials are not as useful. Instead we designed the web-based tutorials: Quickstart TM-R Tutorials for Ubuntu and Quickstart docker-TM-R Tutorials for other systems. These tutorials guide even complete novices through the use of TissueMiner by providing not just instructions but commands that can be directly copy-pasted into the terminal to run the analyses:
https://github.com/mpicbg-scicomp/tissue_miner#documentation
We provide video tutorials for the ImageJ macros in TissueMiner (for drawing regions of interest and orienting the tissue).
6) RStudio initialization script kept crashing. Apparently additional toolboxes are required, such as devtools.
As mentioned in point 3), the new installation procedure resolves this problem.
Essential revisions:
In the context of the software, the reviewers specifically recommended that you:
1) Make the sample data much smaller in size (<2GB) so that the initial docker processing takes <5min, or at least has a remaining time clock;
As explained above, we now provide a smaller sample dataset (a cropped region of the pupal wing, ~35Mb in size) to guide new users through the Quickstart Tutorials. Furthermore, the user can now select specific steps of the analysis to run. For example, it may not be necessary or desired at the beginning of the tutorial to create videos of patterned cell behaviors. For this example dataset, a simplified initial analysis workflow, including database creation, cell neighbor change detection and shear calculation, requires less than 3min on a single core(2.7 GHz Intel Core i5; 2Gb of RAM). Subsequent analysis steps, such as video creation, can be run on demand by selecting the corresponding snakemake rules, see Figure 7 and TM R-User Manual.
The R programming language doesn’t easily allow one to systematically provide a progress bar. Nonetheless, we estimated the processing time required for three datasets of increasing size (excluding extra video creation):
dataset | Compressed Size (Mb) | True size (Gb) | Number of cell-cell contacts | Number of cell contours | Number of cell lineages | Number of ROIs | Run time |
demo | 31 | 0.1 | ~200 000 | ~68 000 | ~1200 | 3 | 2min48sec |
WT_3 | 563 | 2.1 | ~1 450 000 | ~487 000 | ~8600 | 6 | 17min00sec |
WT_1 | 786 | 2.5 | ~1 610 000 | ~540 000 | ~9400 | 6 | 18min03sec |
The size of WT_2 dataset is similar to the size of WT_1.
Running these analyses on multicore computers would significantly reduce these times.
2) Make the software completely scripted, including all the initial importing in RStudio (no copy-pasting, just loading an RStudio file);
The full analysis of single movies is now completely encoded in a single script, analyze_movie.R. This script is explained in the Quickstart Tutorials and can either be used as is or custom edited in Rstudio.
Comparisons between multiple movies require adaption to the specific tissue being analyzed. Therefore, we provide a single template script, compare_multiple_movies.R, which requires the user to enter the location of the movies and/or ROIs to compare.
3) Prepare a new "First Use" R tutorial, skipping most of the R-101 training and just getting a user up and running with something simple, like plotting cell-area.
As requested, we developed “First Use” Quickstart Tutorials that quickly introduces novices to the capabilities and procedures of TissueMiner.
To help users that are interested in further developing TissueMiner in R for more sophisticated and customized analyses, we still provide some information about R, including the grammar of data manipulation and graphics, in a dedicated tutorial referenced in the TM R-User Manual: https://mpicbg-scicomp.github.io/tissue_miner/user_manual/Learning_the_R_basics_for_TissueMiner.html
Then:
1) Conduct user testing with testers who truly have "no programming experience". Allow them to test installation unsupervised.
Two testers with "no programming experience" managed to install and run the Quickstart Tutorials on both Ubuntu and MacOSX using the updated installation procedures.
We also provide two additional tutorials to help users getting started with the analysis of their own data. For Ubuntu:
For other systems:
2) Create more streamlined, focused tutorials rather than an enormous master guide. TissueMiner tries to do everything at one time (e.g. the initial docker processing), and it is difficult to follow. Give the user the simplest data-set you can provide, and have them perform the simplest analysis (e.g. only focus on cell area, or tracking, or divisions).
The new Quickstart Tutorials were designed to address this point. We also created the TM R-User Manual in html format, including a table of contents and links to each type of analysis, for those with more experience with R.
3) The authors illustrate the capabilities of the software using the case of wing vein morphogenesis, but this is not a validation. To validate the software, the authors could use computer generated images of cell monolayers with known geometry, and demonstrate that the software is able to recover the different geometric parameters of each cell in the tissue.
We produced two computer-generated movies of tissues with known geometry and deformation. One dataset corresponds to hexagonal cells undergoing homogeneous isotropic expansion with an imposed rate, and the other corresponds to hexagonal cells undergoing homogeneous pure shear with an imposed rate. We segmented and tracked both movies and performed the entire TissueMiner deformation analysis. We found that TissueMiner correctly recapitulates the imposed deformations and their cellular contributions. Moreover, we find for these movies that expected deviations due to pixelation are two orders of magnitude smaller than the deformations that we aim to measure.
We added a subsection entitled “Validation of tissue deformation measurements using computer-generated cells” – where we discuss all of these points.
Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd
Full text links
Read article at publisher's site: https://doi.org/10.7554/elife.14334
HAL Open Archive
https://hal-pasteur.archives-ouvertes.fr/pasteur-01545827
Citations & impact
Impact metrics
Citations of article over time
Alternative metrics
Article citations
Assessing mechanical agency during apical apoptotic cell extrusion.
iScience, 27(11):111017, 23 Sep 2024
Cited by: 0 articles | PMID: 39507245 | PMCID: PMC11539584
Deep learning reveals a damage signalling hierarchy that coordinates different cell behaviours driving wound re-epithelialisation.
Development, 151(18):dev202943, 24 Sep 2024
Cited by: 0 articles | PMID: 39177163 | PMCID: PMC11449448
Deep learning for rapid analysis of cell divisions in vivo during epithelial morphogenesis and repair.
Elife, 12:RP87949, 23 Sep 2024
Cited by: 2 articles | PMID: 39312468 | PMCID: PMC11419669
Automated Layer Analysis (ALAn): An Image Analysis Tool for the Unbiased Characterization of Mammalian Epithelial Architecture in Culture.
Bio Protoc, 14(8):e4971, 20 Apr 2024
Cited by: 0 articles | PMID: 38686346 | PMCID: PMC11056004
α-Spectrin regulates cell shape changes during disassembly of microtubule-driven protrusions in Drosophila wings.
MicroPubl Biol, 2024, 15 Apr 2024
Cited by: 0 articles | PMID: 38690064 | PMCID: PMC11058509
Go to all (56) article citations
Data
Similar Articles
To arrive at the top five similar articles we use a word-weighted algorithm to compare words from the Title and Abstract of each citation.
Interplay of cell dynamics and epithelial tension during morphogenesis of the Drosophila pupal wing.
Elife, 4:e07090, 23 Jun 2015
Cited by: 136 articles | PMID: 26102528 | PMCID: PMC4574473
EpiTools: An Open-Source Image Analysis Toolkit for Quantifying Epithelial Growth Dynamics.
Dev Cell, 36(1):103-116, 01 Jan 2016
Cited by: 49 articles | PMID: 26766446 | PMCID: PMC4712040
Dynamic clonal analysis based on chronic in vivo imaging allows multiscale quantification of growth in the Drosophila wing disc.
Development, 141(11):2339-2348, 01 Jun 2014
Cited by: 23 articles | PMID: 24866118
Bioimage Informatics in the context of Drosophila research.
Methods, 68(1):60-73, 13 Apr 2014
Cited by: 13 articles | PMID: 24732429
Review
Funding
Funders who supported this work.
Alfred P. Sloan Foundation
Bundesministerium für Bildung und Forschung
European Research Council
Gordon and Betty Moore Foundation
Max-Planck-Gesellschaft
National Science Foundation (1)
Grant ID: NSF-DMR-1352184
Seventh Framework Programme
The Francis Crick Institute (1)
Grant ID: 10317