1. Introduction
Somatic embryogenesis (SE) plays a crucial role in the genetic transformation of plants and in their in vitro rapid propagation. SE refers to the process of transformation from the somatic to the embryogenic state and is a unique phenomenon similar to what occurs during the development of zygotic embryos. The SE process can be artificially controlled in vitro conditions. It is also a classic example of cell totipotency. The underlying mechanisms of SE are significant for revealing important scientific theoretical problems that involve cell development, differentiation, and morphogenesis [
1,
2,
3,
4].
Nic-Can et al. [
5] reported that SE is a complex process of molecular regulation in which somatic cells acquire totipotency to transform into embryonic cells. How does a somatic cell become a whole plant? This question has been reported to be one of the 25 questions most important within the scientific community [
6]. However, the regulatory networks that are involved in the transition from the nonembryogenic staged callus (NEC) to the somatic embryo during SE remain poorly understood [
7]. In addition to environmental factors, external stimuli and hormones, many specifically expressed genes also participate in the SE process and play a decisive role [
8,
9,
10,
11]. Therefore, it is particularly important to identify and isolate key genes that regulate SE. Candidate genes that may regulate SE were identified using bioinformatics tools [
12]. The results showed that
auxin response factor (
ARF) [
11,
13,
14],
leafy cotyledon (
LEC) [
10,
15],
Wuschel (
WUS) [
16,
17,
18],
somatic embryogenesis receptor kinase (
SERK) [
8],
shoot meristemless (
STM), and
baby boom (
BBM) [
10,
19,
20] were involved in SE. In addition, the salicylic acid (SA) and jasmonic acid (JA) signaling pathways were also predicted to regulate SE [
12,
21,
22,
23,
24,
25,
26,
27].
During the process of cotton SE, Hu et al. [
28] found that interference of the high-mobility group box 3 (
GhHmgb3) gene enhanced the proliferation and differentiation of embryogenic callus. Poon et al. [
29] found that embryogenic callus could secrete a specific AGP protein (GhPLA1), which was extracted and added to the medium to enhance SE. Min et al. [
7] showed that the process of SE was negatively regulated by a casein kinase gene (
GhCKI) via a complex regulatory network. Several differentially expressed small RNAs and their target genes were identified via small RNA sequencing and degradation sequencing, which indicated that the SE process in cotton was regulated by complex gene expression networks [
30]. Genes related to stress, such as
SERK1,
abscisic aldehyde synthesis enzyme 2 (
ABA2),
abscisic acid insensitive 3 (
ABI3),
jasmonate ZIM-domain 1 (
JAZ1),
late embriogenesis abundant protein 1 (
LEA1), and transcription factors (NACs, WRKYs, MYBs, ERFs, Zinc finger family proteins) were also involved in callus induction [
31,
32,
33,
34,
35,
36,
37]. Transcriptome sequencing was carried out during callus dedifferentiation and redifferentiation. Auxin and stress response molecules were found to be involved in SE in cotton [
38]. Xu et al. [
39] also found that plant growth regulators played an important regulatory role in SE.
In recent years, metabolomics, which involves the collection of all low molecular weight metabolites that are present in a certain organism, cell, or tissue during a specific physiological period, has attracted a great deal of attention. It mainly involves the examination of endogenous small molecules with molecular weights less than 1000 Da. Plants can produce 200,000 to 1 million kinds of metabolites, which can be divided into primary metabolites and secondary metabolites [
40].
Metabolites are the ultimate result of gene transcription and protein expression in organisms that are under the influence of internal and external factors. They form a bridge between genes and phenotypes and can directly reflect the physiological phenomena within plants. At the same time, metabolites can regulate gene transcription and protein expression. As the final products of cellular regulatory processes, metabolites represent the building blocks of macromolecules and are also an essential component of cellular energy pathways [
41]. Metabolomic technologies enable the examination and identification of endogenous biochemical reaction products and thereby reveal information about the metabolic pathways and processes occurring within a living cell [
42].
Metabolomics data can provide a wealth of information about the biochemical status of tissues, and the interpretation of such data offers an effective approach that can be used for the functional characterization of genes [
43,
44,
45]. Transcriptomic and proteomic studies are only able to predict changes at the gene expression and protein levels, respectively, while metabolomics studies investigate the changes in functioning exhibited by these genes or proteins [
46]. Compared with the genome, transcriptome, or proteome, the metabolome more accurately reflects the phenotype of the organism, and the minor changes in the genome and proteome can be reflected and amplified by the metabolome [
47]. The applicability of metabolomics to the SE process has been demonstrated in many plant genera [
48,
49,
50,
51,
52].
The development of ‘omics’ technology has allowed for comprehensive analysis of the SE process at the transcript, protein, and metabolite levels [
50,
52]. The combination of modern omics technologies such as transcriptomics and metabolomics provides a great opportunity to acquire a deeper understanding of the mechanisms of cell totipotency at the molecular and biochemical levels [
52]. To date, there have been few studies that have used transcriptomics and metabolomics techniques to identify potential key factors involved in the SE process [
48,
49,
51,
52]. Additionally, the relationship between callus tissues and the media used to culture them is not well understood at the biochemical level in terms of nutrient uptake by callus from the medium or release of chemicals into the medium [
52]. Integrated metabolomic and transcriptomic network analyses can elucidate the functioning of a series of secondary metabolites along with changes in content, as well as the corresponding differentially expressed genes, which can broaden the global view of SE regulation [
52].
In this study, we comparatively investigated the dynamic metabolomic and transcriptomic profiles of two transdifferentiation processes, embryogenic differentiation and somatic embryo development, during the SE process in cotton. By interactively comparing metabolomic and transcriptomic data, we were able to identify potential metabolites and the corresponding differentially expressed genes at the molecular and biochemical levels, as well as draw a comprehensive picture of the events underpinning SE development to provide a valuable foundation for uncovering the regulatory mechanisms underlying plant cell totipotency.
3. Discussion
Metabolomics is an emerging omics technology that, like genomics and proteomics, can be used to qualify and quantify all metabolites of small molecular weight within the cells of an organism. Plant metabolomics has been widely applied to the investigation of patterns of metabolite accumulation and their underlying genetic basis via the identification of genes involved in metabolism, which is currently a topic of interest in modern plant biology. As the final products of genome expression, metabolites directly define the biochemical characteristics of a cell or tissue, which allows metabolomic data to be used to explain the biochemical mechanisms underlying SE. Meanwhile, integrated transcriptomic and metabolomic analysis allows for the more precise representation of gene-to-metabolite networks and thus will be an effective method that can be used to decipher the mechanisms involved in SE regulation in cotton. In the current study, we combined transcriptome and metabolome analyses to generate dynamic profiles of two transdifferentiation processes, embryogenic differentiation, and somatic embryo development, during SE in cotton, with the aim to provide a better understanding of the processes of SE transdifferentiation that resulted in cell totipotency at the molecular and biochemical levels.
3.1. Accumulated DAMs Specifically Involved in Two SE Transdifferentiation Processes in Cotton
In this study, a hierarchical cluster analysis (HCA) was performed to assess the patterns of accumulation of the metabolites among the different samples. Results showed that PEC and GE can be clustered together while NEC forms a separate cluster, which suggests that there is significant differential accumulation of metabolites between the two transdifferentiation processes, embryogenic differentiation (PEC-NEC) and somatic embryo developmental initiation (GE-PEC).
To investigate the differential metabolites specifically accumulated in (PEC-NEC) vs. (GE-PEC) vs. (GE-NEC), a Venn diagram was generated (
Figure 5a;
Table S3). The results showed that 26 metabolites were accumulated in common among (PEC-NEC) vs. (GE-PEC) vs. (GE-NEC). Meanwhile, 45 DAMs were accumulated in both (PEC-NEC) vs. (GE-PEC), 118 DAMs in both (PEC-NEC) vs. (GE-NEC), and 63 DAMs in both (GE-PEC) vs. (GE-NEC).
We examined the metabolites that were specifically accumulated between the two transdifferentiation processes, embryogenic differentiation (PEC-NEC), and somatic embryo development (GE-PEC). The results showed that there were 111 DAMs that were specifically accumulated in PEC vs. NEC, of which amino acids, organic acids, nucleotides, flavones, and lipids were the most represented (
Figure 5b). Meanwhile, 94 DAMs were accumulated specifically in GE vs. PEC, of which flavones hydroxycinnamoyl derivatives, other metabolites, amino acids and organic acids accounted for a large proportion (
Figure 5c). Specifically, nucleotides and lipids were more greatly accumulated during embryogenic differentiation, whereas greater amounts of flavones and hydroxycinnamoyl derivatives were accumulated during the somatic embryo development process. These data indicated that nucleotides and lipids may play important and special roles during embryogenic differentiation, whereas flavonoids are more important during the embryo maturation process.
3.2. Enrichment of Purine Metabolism in Embryogenic Differentiation
Purine metabolism refers to the metabolic pathways that synthesize and break down the purines that are present in many organisms. In our study, to investigate the functioning of SE-related metabolites, we analyzed the differences and dynamic metabolite changes among the three groups, PEC vs. NEC, GE vs. PEC, and GE vs. NEC. KEGG enrichment analysis showed that the ‘purine metabolism’ pathway was significantly enriched during embryogenic differentiation (
Figure 6b). All of the differentially accumulated metabolites related to purine metabolism were upregulated. Meanwhile, enrichment analysis of DAMs and DEGs showed that ‘purine metabolism’ was co-mapped based on results from the KEGG database (
Table S5).
In our study, several major DEGs of regulatory factors were identified and significantly associated with purine metabolism in embryogenic differentiation, including receptor-like protein kinases, signal recognition, transcription, stress regulation, lipid binding, hormone responses, and histone modification were significantly upregulated during the embryogenic differentiation process (
Table 4).
The role of purine metabolism in the SE process has been demonstrated in many plants. A comparative omic analysis of tree fern
Cyathea delgadii explants that were undergoing direct SE was performed. The results revealed that the differentially regulated proteins adenine phosphoribosyl transferase 3 and adenosine kinase 2 were assigned to the purine metabolism category and were associated with direct SE in
C. delgadii [
60]. To understand the molecular mechanisms that regulate early SE in
Eleutherococcus senticosus Maxim, a high-throughput RNA-seq technology was used to investigate its transcriptome. The initiation of SE affected gene expression in many KEGG pathways but predominantly affected expression in metabolic pathways and those related to the biosynthesis of secondary metabolites. Other unigenes were classified into the purine metabolism pathway [
61]. Similar results were obtained in cotton (
Gossypium hirsutum L.). RNA-seq was performed to analyze the genes expressed during SE and their expression dynamics using RNAs isolated from nonembryogenic callus (NEC), embryogenic callus (EC), and somatic embryos (SEs). The differentially expressed genes in NEC, EC, and SEs were identified, annotated, and classified. Partial DEGs were identified that were related to purine metabolism [
62]. A possible critical role for purines during embryogenesis in geranium hypocotyl tissues (
Pelargonium x
hortorum) has also been reported [
63]. The above results revealed the significant role of purine metabolism in EC proliferation.
3.3. Enrichment of Flavonoid Biosynthesis in Somatic Embryo Developmental Initiation
Flavonoids are synthesized by the phenylpropanoid metabolic pathway in which the amino acid phenylalanine is used to produce 4-coumaroyl-CoA. The metabolic pathway continues through a series of enzymatic modifications to yield flavanones, dihydroflavonols, and then anthocyanins. Along this pathway, many products can be formed, including flavonols, flavan-3-ols, proanthocyanidins (tannins) and a host of other various polyphenolics. In the current study, 139 differentially accumulated metabolites were identified in GE vs. PEC, and flavone was largely detected (
Figure 4h). 94 DAMs accumulated specifically in somatic embryo development, of which flavone accounted for a large proportion (
Figure 5c). KEGG enrichment analysis of DAMs suggested that ‘phenylpropanoid biosynthesis’, ‘flavone and flavonol biosynthesis’, ‘flavonoid biosynthesis’, and ‘biosynthesis of phenylpropanoids’ were enriched in GE vs. PEC (
Figure 6b). Meanwhile, in co-mapped pathways between genes and metabolites, ‘phenylpropanoid biosynthesis’ (ko00940), ‘flavonoid biosynthesis’ (ko00941), and ‘flavone and flavonol biosynthesis’ (ko00944) showed significant accumulation in GE vs. PEC (
Table S5).
The significant role of flavonoid biosynthesis in the somatic embryo development process has been investigated. In citrus cell cultures, there was no detectable accumulation of flavonoid in the undifferentiated calli, but flavonoid accumulated after the morphological changes to embryos. Two chalcone synthase (
CHS) genes differentially expressed during citrus SE and
CHS gene may regulate the accumulation of flavonoid [
64].
With the goal to better understand SE development and to improve the efficiency of SE conversion in
Theobroma cacao L., gene expression differences between zygotic and somatic embryos were examined using a whole genome microarray. Expression levels of genes involved in fatty acid metabolism and flavonoid biosynthesis were differentially expressed in the two types of embryos. The relatively higher expression of flavonoid related genes during SE suggested that the developing tissues may be experiencing high levels of stress during SE maturation caused by the in vitro environment [
65].
Moreover, Wang et al. reported that overexpression of
GhSPL10, a target of
GhmiR157a, activated the flavonoid biosynthesis pathway, and promoted initial cellular dedifferentiation and callus proliferation [
66]. The synthesis of flavonoids like anthocyanin in several plant tissues has been associated with increased phenylalanine ammonia lyase (PAL) activity. The increased PAL level was detected in samples collected from different growth phases during SE in
Silybum marianum. As intermediary products of phenylpropanoid metabolism, flavonoids may stimulate differentiation and create a situation that is more favorable for embryogenesis [
67]. These conclusions demonstrated that flavonoids biosynthesis might be frequently associated with somatic embryo development during cotton SE transdifferentiation.
5. Conclusions
Somatic embryogenesis is the developmental reprogramming of somatic cells toward the embryogenesis pathway. In this study, the dynamic metabolomic and transcriptomic profiling of cotton SE transdifferentiation processes, embryogenic differentiation, and somatic embryo development, were comparatively investigated.
During embryogenic differentiation (PEC vs. NEC), nucleotides and lipids were specifically accumulated. And the metabolome wide DAMs significantly enriched in purine metabolism (
Table 2). In addition, purine metabolism-related genes associated with signal recognition, transcription, stress, and lipid binding were remarkably activated (
Table 4). Moreover, several classic SE genes that specifically activated during embryogenic differentiation, including
BBM,
SERK1,
LEC1,
ARF2,
AGL15,
AGP1,
GLP2,
AGO1,
LTP2, and
AMY1, were highly correlated with the corresponding metabolites that were involved in purine metabolism (
Table 5).
During somatic embryo development (GE vs. PEC), flavones and hydroxycinnamoyl derivatives were largely accumulated. DAMs were most significantly associated with flavonoid biosynthesis (
Table 3). Classic SE genes that control somatic embryo development, including
WUS,
CLV1,
CUC2,
SHR, and
SCW, were highly correlated with the corresponding metabolites that were involved in flavonoid biosynthesis (
Table 6).
By interactively comparing metabolomic and transcriptomic data, we identified a series of potential metabolites and the corresponding differentially expressed genes candidate for a relevant role in SE transdifferentiation. The findings in our work provide new insights into the underlying molecular and biochemical basis associated with embryogenic competence acquisition underpinning cotton SE development.