AprGPD: the apricot genomic and phenotypic database

Chen, Chen; Liu, Huimin; Gou, Ningning; Huang, Mengzhen; Xu, Wanyu; Zhu, Xuchun; Yin, Mingyu; Bai, Haikun; Wang, Lin; Wuyun, Ta-na

doi:10.1186/s13007-021-00797-4

Database
Open access
Published: 23 September 2021

AprGPD: the apricot genomic and phenotypic database

Chen Chen^1,2,3,
Huimin Liu^1,2,3,
Ningning Gou^1,2,3,
Mengzhen Huang^1,2,3,
Wanyu Xu^1,2,3,
Xuchun Zhu^1,2,3,
Mingyu Yin^1,2,3,
Haikun Bai^1,2,3,
Lin Wang^1,2,3 &
…
Ta-na Wuyun ORCID: orcid.org/0000-0003-1227-0158^1,2,3

Plant Methods volume 17, Article number: 98 (2021) Cite this article

2250 Accesses
2 Citations
3 Altmetric
Metrics details

Abstract

Background

Apricot is cultivated worldwide because of its high nutritive content and strong adaptability. Its flesh is delicious and has a unique and pleasant aroma. Apricot kernel is also consumed as nuts. The genome of apricot has been sequenced, and the transcriptome, resequencing, and phenotype data have been increasely generated. However, with the emergence of new information, the data are expected to integrate, and disseminate.

Results

To better manage the continuous addition of new data and increase convenience, we constructed the apricot genomic and phenotypic database (AprGPD, http://apricotgpd.com). At present, AprGPD contains three reference genomes, 1692 germplasms, 306 genome resequencing data, 90 RNA sequencing data. A set of user-friendly query, analysis, and visualization tools have been implemented in AprGPD. We have also performed a detailed analysis of 59 transcription factor families for the three genomes of apricot.

Conclusion

Six modules are displayed in AprGPD, including species, germplasm, genome, variation, product, tools. The data integrated by AprGPD will be helpful for the molecular breeding of apricot.

Background

Apricots, belonging to the Rosaceae family and section Armeniaca (Lam.) Koch [1], are cultivated worldwide and considered one of the most delicious temperate tree fruits [2]. Apricot has a long history of cultivation in China[3]. A total of ten species belongs to the Armeniaca section, among these species, nine species, including Prunus armeniaca, P. sibirica, P. mandshurica, P. holosericeae, Prunus × dasycarpa, P. zhidanensis, P. zhengheeniss, P. limeixing, and P. mume, are native to China [3,4,5], whereas, P. brigantina is endemic in French Alps [6].

The modern sequencing technology has boosted the genome and transcriptome data of apricot and the phenotypic data of rich germplasm resources in apricot have been accumulated. Therefore, a comprehensive database is needed to construct for integrating these data, which is convenience for researchers and breeders of apricot and other species in Rosaceae. Here, we have summarized three high-quality and chromosome scale assemblies of genomes in AprGPD, including P. sibirica (F106, 219 Mb with a contig N50 length of 6.70 Mb, eight chromosomes), P. armeniaca (Sungold, 217 Mb with a contig N50 length of 7.13 Mb, eight chromosomes), and kernel consumption apricot (Longwangmao, 225 Mb with a contig N50 length of 6.91 Mb, eight chromosomes), the reference genome was version 0.9 for all species. AprGPD includes 90 RNA sequencing (RNA-seq) data, 306 genome resquencing data, and 1692 germplasms (58 phenotypic types). All data included in the AprGPD were collected by the authors. Based on these data, we have conducted gene annotation, transcriptome analysis, detailed analysis of transcription factor (TF) family, annotation of variation sites, construction of the metabolic pathways, classification of quantitative characteristics, and prepared figures for the visual representation of the phenotypic data. AprGPD also offers various query, analysis, and visualization tools. A total of six modules are displayed in AprGPD, including species, germplasm, genome, variation, product, and tools. This database is a useful resource for the apricot research.

Construction and content

The data of AprGPD are stored in MySQL database on a Ubuntu server. User-friendly interfaces are developed using JavaScript, HTML5, and CSS3. Query searches are achieved suing JavaScript and PHP. AprGPD is divided into six modules base on different data and applications. The data and functions of each module are described as follow:

Species module

The species module exhibits the distribution and phenotypic traits, including flower, leaf, fruit, seed, and tree type of the nine species from Armeniaca section in China (Additional file 1: Table S1) and provides a visual representation of each species (Fig. 1).

Germplasm module

The germplasm module contains 58 phenotypic data types of 1692 germplasms, and clicking on the corresponding name, then, displaying pictures of their flowers, fruits, seeds, and distribution information (Fig. 2b). The intersting traits can also be quickly observed (Fig. 2a). The pie chart provides statistical information on qualitative traits, and the frequency chart provides the overall summary of quantitative traits (Fig. 2c). We divided the quantitative traits into three levels by using μ± 0.5246δ for data showing normal distribution (μ and δ indicated mean and standard deviation, respectively) and \(y = G \pm \left( {\text{1/2 + n}} \right)x\) (n = 0, 1, 2, 3, 4; G is the median of the data, and x is the grade difference) for data showing non-normal distribution as previous descriptions [7, 8].

Genome module

The genome module is divided into five submodules, including gene annotation, metabolic pathway, gene family, gene network, and transcription factors. For each submodule, the data and basic search functions have been described.

Gene annotation

A total of 98,615 protein-coding genes were predicted from these three genome assemblies, which included 32,959 genes from P. sibirica (F106), 32,669 genes from P. armeniaca (Sungold), and 32,987 genes from P. armeniaca × P. sibirica (Longwangmao). The information of gene structure, gene annotations from Gene Ontology (GO) and KEGG annotation (Fig. 3a), gene expression profiles (Fig. 3b), and variations are displayed interactively (Fig. 3c) in this submodule. Other submodules, including metabolic pathways, transcription factors, gene network, and gene family, are integrated with gene annotation.

Metabolic pathways

KEGG pathway maps are the graphical representations of the reaction networks, and each map is summarized by experimental evidence from the literature [9]. We have obtained the KEGG orthologs for the Sungold, F106, and Longwangmao genomes and generated their metabolic pathways. Moreover, we have constructed five additional pathways (Additional file 1: Figure S1) related to the flowering [10]. When the cursor of the mouse is moved to the green nodes, all the gene IDs corresponding to the enzymes are displayed, and the expression of the genes for fruit development, kernel development, and different tissues can be observed conveniently. Each pathway is displayed on a separate web page, and detailed information of all the genes in this pathway is displayed in a tabular form at the bottom of the page (Fig. 4).

Gene family

Based on the HMM Pfam, we have summarized the protein domain families (gene families) and established tables for the ease of querying and viewing (Fig. 5). In total, 63,121 genes, including Longwangmao (21,012), Sungold (21,444), and F106 (20,665) are assigned to 3842 gene families.

Gene network

We have collected 90 RNA-seq samples for F106 (30), Sungold (22), and Longwangmao (38), including fruit, kernel, leaves, flower, and flower bud at different stages. The expression pattern of each gene is calculated and normalized to fragments per kilobase of transcript per million mapped fragments (FPKM). Based on the FPKM values, the co-expression network is constructed by the WGCNA[11] (version 1.0) package in R (weight value > 0.15) (Fig. 6).

Transcription factors

Transcription factors are important regulators of plant growth, development, and external stress. A total of 59 transcription factor (TF) families have been analyzed in detail at the whole-genome level of three apricot genomes. TFs are determined by HMMER [12] (version 3.0) and iTak [13]. The presence of the TF domain is further confirmed using SMART [14] and CDD [15]. Multiple sequences alignment are performed using ClustalX [16] (version 2.1), and phylogenetic trees are constructed using MEGA (version X) [17]. MEME Suite [18] is used to determine motifs in transcription factor protein sequences, where the width of the motif is 6–200, and the maximum number of motifs is 20. Syntenic blocks are inferred using MCScanX [19]. Gene structure, chromosome location, and collinearity are visualized using TBtools (version 1.089) [20]. Phylogenetic trees are aesthetically improved using iTol [21]. Additionally, the links are established with PlantTFDB [22] to provide users with valuable information.

Users can view the respective TF page or find more detailed information using the query function (Fig. 7a). The valuable information is collected through transcription factor analysis, including homology alignment (Fig. 7b), comparison among the three genomes (Fig. 7b, c), subfamily classification (Fig. 7c), domains (Fig. 7d), gene expression (Fig. 7e), chromosome location (Fig. 7f), and collinearity results (Fig. 7g).

Variation module

The single-nucleotide polymorphisms (SNPs) and insertions-deletions (Indels) of 306 accessions have been collected from our previous study, and minor allele frequencies over 0.05 of variants have been filtered out using plink [23] (version 1.9); all filtered nucleotide variants are annotated using ANNOVAR [24]. Annotation information of 8,838,420 (SNPs) and 1,650,013 (indels) is obtained. The information on the variation of each sample, including variation in positions and allele types, is sorted, and a comparative search tool is established (Fig. 8a), which can be searched by gene ID or locations; the information on variations are displayed in tabular form (Fig. 8b). Variations can also be browsed interactively through JBrowse [25] by clicking on the variation ID (Fig. 8c). Statistical information on variation is provided through pie charts and tables, which can be accessed by clicking the pie chart icon (Fig. 7c). In addition, The information about structural variations (SVs, ≥50 bp) was obtained through a comparison among the three genomes. In total, 2306 insertions (843, 721, and 742 for F106 vs. Sungold, F106 vs. Longwangmao, and Sungold vs. Longwangmao, respectively) and 1296 deletions (427, 409, and 460 for F106 vs. Sungold, F106 vs. Longwangmao, and Sungold vs. Longwangmao) were obtained, which contain 2888 genes.

Product module

Apricot has high nutritional and commercial value [2]. Apricot kernel is rich in protein, crude fat, calcium, phosphorus, and iron, and has good nutritional and health effects. The apricot fruit has a pleasant aroma and is rich in nutrients such as sugar, vitamin C, and carotenes. We have developed some products of apricot, including foods and cosmetics. The edible products of apricot include apricot wine, apricot kernel oil [26], apricot kernel tofu [27], apricot kernel noodles, and apricot kernel biscuits. The use of apricot in cosmetics includes facial masks, essential oils, shampoos, and shower gels. The detailed information of all products, such as product image, patent number, etc., are displayed in this module by clicking on the product name (Fig. 9).

Tools module

The BLAST program, developed by ViroBLAST [28], provides more BLAST options to obtain specific information from the gene, CDS, and protein databases of all reference genomes. Information on genomes (Sungold, Longwangmao, and F106) and the variations of 306 accessions (reference F106) were visualized by JBrowse [25]. Bud dormancy plays an important role in fruit yield. The chilling requirement (CR) is the temperature required for deciduous fruit trees to release their natural dormancy and is an important quantitative trait for measuring dormant release. We developed a tool for calculating chilling requirements (CHR) for the convenience of the researchers; the tool contains 0–7.2 °C, Utah, and dynamic models. The CHR tool is developed by using PyQt5 (https://github.com/Chilling-requirements/Chilling-software/releases), which also had more functions, such as, the custom model and the model of growing degree hours for heart requirements. Expression visualization [29] could help users quickly observe the expression patterns of the related genes. The results of the synteny analysis with three pairs of reference genomes (Sungold vs. F106, Sungold vs. Longwangmao, and F106 vs. Longwangmao) are visualized by SynVisio (https://github.com/kiranbandi/synvisio).

Utility and discussion

To provide more convenience for users, the query tools in each module are constructed. Users could be obtain information, including phenotype, gene annotation, transcription factors, metabolic pathways, co-expression network, variation, by query tools. The help module provides the detailed of user tutorial information. To make the AprGPD even better to understand, several demonstrations have been provided below.

Species and phenotype query

If users are interested in species information, they can click on the species number. For example, a user can click on No. 4 to retrieve information and images of P. holosericeae. The germplasm module addresses any query on phenotype data. Boxplots were used to show three levels of quantitative traits. The Pie-charts and frequency charts were used to illustrate the distribution of quantitative and qualitative traits, respectively. For example, users could click on the name of Gongfuoxing to show its image and location information, and also could obtain the overall information on kernel dry weight using frequency charts. All quantitative traits are divided into three grades, if users want to get samples with larger kernel dry weight, they could select the maximum level (≥ 0.39 g) to filter the samples.

Gene query

Abundant query functions could help users quickly find the information of related genes. For example, the information on PaF106G0500018564.01 needs to be obtained. First, the user inserts the gene ID in the annotation module and clicks on the search button. The expression level of the gene is showen by histogram, and the highest expression at the end of fruit development, and gradually decreased and then increased during kernel development were displayed. Second, the user inserts the gene ID in Pathway module and finds that it is not annotated to the metabolic pathway. Third, the user searches for this gene in the transcription factor module and finds that belongs to the subfamily of the WOX family. Fourth, the user searches for this gene in the Gene network module and sets the weight at 0.35, and two co-expression genes (WRKY, PaF106G0500019874.01 and PaF106G0600024291.01.T01) are showed. Fifth, the user searches and observes the information of variation in this gene using the comparative search tool of the variation module and finds that PaF106G0500018564.01 has 20 SNPs and 9 INDELs.

Transcription factor family query

The detailed analysis of transcription factors are of great use to researchers. An example of MIKC_MADS is illustrated. First, the user could be find the number of members in the MIKC_MADS family page. Second, comparative analysis of all MIKC_MADS family genes in three species were shown by using a phylogenetic tree (Additional file 1: Figure S2) and table, a total of 14 subfamilies are divided. Third, the distributions of conserved domains of MIKC_MADS family genes were obtained (Additional file 1: Figures S3–5). For example, SOC1 contains motif17, SVP contains motif5 and motif6. Fourth, AP3, SVP, AGL6, SOC1, AGL9, and AP1/FUL members have fragment duplication by chromosomal collinearity (Additional file 1: Figures S6–8).

The heatmap of related genes were displayed by expression visualization in tool (Additional file 1: Figures S9–11). For example, SVPs (PaF106G0100006115.01.T01, PaF106G0100006117.01.T02, PaF106G0100006114.01.T06, PaLWMG0100006313.01.T01, PaLWMG0100006315.01.T02, PaLWMG0100006312.01.T02) shows an increasing trend during kernel development, whereas, AGL9s (PaF106G0100002725.01.T01, PaF106G0300011661.01.T03, PaLWMG0100002950.01.T04) show a gradually decrease during kernel development.

Value and future directions

At present, AprGPD contains the phenotypic, genome, transcriptome data, and variation information, and a series of analysis base on these data, which provides more valuable information to aptivot research. We completed a detailed analysis of 58 transcription factor families from three genomes in AprGPD, which allows comparisons in various apricots and in plant species of Rosaceae. We developed a calculation tool of chilling requirements, which provids convenience for researchers. In the future, we will be further the new genome, transcriptome, and phenotype data of apricot, establish, and improve the multi-omics analysis platform.

Conclusions

In this study, the database of apricot genome and phenotype was established, which provides a database of comprehensive phenotype, genome, and transcriptome resources of apricots. AprGPD is composed of six modules, including genome, predicted genes and proteins, functional annotations and gene expression profiles. In addition, this database also providess various query, analysis, and visualization tools. Our AprGPD will become a active platform for researchers and breeders of apricot.

Availability of data and materials

AprGPD is freely available at the following address: http://apricotgpd.com. All functionalities of AprGPD have been tested in Google Chrome, 360, QQ, and Safari Browser.

References

Rehder A. Manual of cultivated trees and shrubs hardy in North America. Taxon. 1927;27(4):424.
Google Scholar
Kafkaletou M, Kalantzis I, Karantzi A, Christopoulos MV, Tsantili E. Phytochemical characterization in traditional and modern apricot (Prunus armeniaca L.) cultivars—nutritional value and its relation to origin. Sci Horticult. 2019;253:195–202.
Article CAS Google Scholar
Zhebentyayeva TN, Ledbetter C, Burgos L, Llácer G. Fruit breeding. In: Handbook of plant breeding. New York: Springer; 2012. p. 415–58.
Google Scholar
Zhang JT, Zhang J. Chinese fruit tree: apricot. Beijing: China Forestry Press; 2003. p. 18–26.
Google Scholar
Zhang QP, Liu WS, Ning L, Zhang YP, Ming X. Allelic variation of simple sequence repeats markers linked to PPV resistance in Chinese apricot. Hortic Sci. 2017;44(1):6–13.
Article CAS Google Scholar
Liu S, Decroocq S, Harte E, Tricon D, Decroocq V. Genetic diversity and population structure analyses in the Alpine plum (Prunus brigantina Vill.) confirm its affiliation to the Armeniaca section. Tree Genet Genomes. 2020;17(1):1–12.
Google Scholar
Wang YZ, Sun H, Li Y, Zhang J. Classification criteria of some quantitative characteristics of apricot germplasm resources. Chin Agric Sci Bull. 2008;24:147–51.
CAS Google Scholar
Zhou HT, Wang FH, Jiang ZL, Hao CH, Wang W, Liu TF. Studies on the classify standard for quantitative characters of sorghum DUS testing in Jilin province I. Measurement of single characters. Jilin Agric Sci. 2015;40(5):21–5.
Google Scholar
Minoru K, Susumu G, Yoko S, Miho F, Mao T. KEGG for integration and interpretation of large-scale molecular datasets. Nucleic Acids Res. 2012;40(D1):D109–14.
Article Google Scholar
Srikanth A, Schmid M. Regulation of flowering time:all roads lead to Rome. Cell Mol Life Sci. 2011;68(12):2013–37.
Article CAS Google Scholar
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinform. 2008;9(1):559.
Article Google Scholar
Eddy SR. A new generation of homology search tools based on probabilistic inference. Genome Inf. 2009;23:205–11.
Google Scholar
Yi Z, Chen J, Sun H, Rosli HG, Pombo MA, Zhang P, et al. iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol Plant. 2016;9(012):1667–70.
Article Google Scholar
Schultz J, Milpetz F, Bork P, Ponting CP. SMART: a simple modular architecture research tool: Identification of signaling domains. Proc Natl Acad Sci. 1998;95(11):5857–64.
Article CAS Google Scholar
Aron MB, Lu S, Anderson JB, Farideh C, Derbyshire MK, Carol DWS, et al. CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2011;39(Database):D225–9.
Article Google Scholar
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, Mcwilliam H, et al. ClustalW and ClustalX version 2. Bioinformatics. 2007;23(21):2947–8.
Article CAS Google Scholar
Sudhir K, Glen S, Koichiro T. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4.
Article Google Scholar
Bailey TL, Mikael B, Buske FA, Martin F, Grant CE, Luca C, et al. MEME Suite: tools for motif discovery and searching. Nucleic Acids Res. 2009;37(Web Server issue):W202–8.
Article CAS Google Scholar
Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Tae-Ho L, Jin H, Barry M, Guo H. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49–e49.
Article CAS Google Scholar
Chen C, Chen H, Zhang Y, Thomas HR, Xia R. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13(8):1194–202.
Article CAS Google Scholar
Letunic I, Bork P. Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47(W1):W256–9.
Article CAS Google Scholar
Jin J, Zhang H, Kong L, Gao G, Luo J. PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res. 2014;42(D1):1182–7.
Article Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira M, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Human Genet. 2007;81(3):559–75.
Article CAS Google Scholar
Wang K, Mingyao L, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;16:e164.
Article Google Scholar
Skinner ME, Uzilov AV, Stein LD, Mungall CJ, Holmes IH. JBrowse: a next-generation genome browser. Genome Res. 2009;19(9):1630–8.
Article CAS Google Scholar
Wuyun T, Zhu XC, Zhu GP, Zhao H. Extraction and refining technology of Siberian apricot kernel oil (CN 105950277 B), China, Patent. June, 11, 2019.
Wuyun T, Jiang ZM, Zhu XC, Zhu XC, Zhao H. Production method of Siberian apricot kernel tofu ice cream (CN 105913934 B), China, Patent. June, 11, 2019.
Deng WJ, Nickle DN, Learn GH, Maust B, Mullins J. ViroBLAST: a stand-alone BLAST web server for flexible queries of multiple databases and user's datasets. Bioinformatics. 2007;23(17):2334–2336.
Article Google Scholar
Tal G, Alan O, Jonathan S, Carson S. heatmaply: an R package for creating interactive cluster heatmaps for online publishing. Bioinformatics. 2017;34(9): 1600–2.
Google Scholar

Download references

Acknowledgements

We thank Wang Chu, Ying Han, and Yujing Zhang, at the Non-timber Forest Research and Development Center, Chinese Academy of Forestry, Zhengzhou, China for help in data analysis. We thank Nextomics Biosciences, Wuhan, China for help in maintain the server.

Funding

This study was supported by the Fundamental Research Funds for the Central Non-profit Research Institution of Chinese Academy of Forestry (CAFYBB2020ZY003) and the National Key R&D Program of China (2019YFD1000601-4).

Author information

Authors and Affiliations

State Key Laboratory of Tree Genetics and Breeding, Non-Timber Forest Research and Development Center, Chinese Academy of Forestry, Zhengzhou, China
Chen Chen, Huimin Liu, Ningning Gou, Mengzhen Huang, Wanyu Xu, Xuchun Zhu, Mingyu Yin, Haikun Bai, Lin Wang & Ta-na Wuyun
Kernel-Apricot Engineering and Technology Research Center of State Forestry and Grassland Administration, Zhengzhou, China
Chen Chen, Huimin Liu, Ningning Gou, Mengzhen Huang, Wanyu Xu, Xuchun Zhu, Mingyu Yin, Haikun Bai, Lin Wang & Ta-na Wuyun
Key Laboratory of Non-Timber Forest Germplasm Enhancement and Utilization of National Forestry and Grassland Administration, Zhengzhou, China
Chen Chen, Huimin Liu, Ningning Gou, Mengzhen Huang, Wanyu Xu, Xuchun Zhu, Mingyu Yin, Haikun Bai, Lin Wang & Ta-na Wuyun

Authors

Chen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Huimin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ningning Gou
View author publications
You can also search for this author in PubMed Google Scholar
Mengzhen Huang
View author publications
You can also search for this author in PubMed Google Scholar
Wanyu Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xuchun Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Mingyu Yin
View author publications
You can also search for this author in PubMed Google Scholar
Haikun Bai
View author publications
You can also search for this author in PubMed Google Scholar
Lin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ta-na Wuyun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

TW, LW, and CC designed the study. CC constructed the database, maintain the server, and wrote this article. HL, NG, MH, MY, WX, XZ, HB, and LW helped with collected the data and performed the analyses. TW and LW took the lead on revising the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Lin Wang or Ta-na Wuyun.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Distribution of nine species in China. Figure S1. Pathways involved in flowering time. Figure S2. Phylogenetic tree of MIKC_MADS family. Figure S3. Gene structure and conserved motifs of MIKC_MADS family in P. sibirica (F106). Figure S4. Gene structure and conserved motifs of MIKC_MADS family in P. armeniaca (Sungold). Figure S5. Gene structure and conserved motifs of MIKC_MADS family in P. armeniaca × P. sibirica (Longwangmao). Figure S6. Chromosomal collinear of MIKC_MADS family in P. sibirica (F106). Figure S7. Chromosomal collinear of MIKC_MADS family in P. armeniaca (Sungold). Figure S8. Chromosomal collinear of MIKC_MADS family in P. armeniaca × P. sibirica (Longwangmao). Figure S9. Expression of MIKC_MADS family in P. sibirica (F106). (a) Expression of fruit development. (b) Expression of kernel development. Figure S10. Expression of fruit development of MIKC_MADS family in P. armeniaca (Sungold). Figure S11. Expression of MIKC_MADS family in P. armeniaca × P. sibirica (Longwangmao). (a) Expression of fruit development. (b) Expression of kernel development.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Chen, C., Liu, H., Gou, N. et al. AprGPD: the apricot genomic and phenotypic database. Plant Methods 17, 98 (2021). https://doi.org/10.1186/s13007-021-00797-4

Download citation

Received: 28 May 2021
Accepted: 08 September 2021
Published: 23 September 2021
DOI: https://doi.org/10.1186/s13007-021-00797-4

AprGPD: the apricot genomic and phenotypic database

Abstract

Background

Results

Conclusion

Background

Construction and content

Species module

Germplasm module

Genome module

Gene annotation

Metabolic pathways

Gene family

Gene network

Transcription factors

Variation module

Product module

Tools module

Utility and discussion

Species and phenotype query

Gene query

Transcription factor family query

Value and future directions

Conclusions

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1: Table S1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Plant Methods

Contact us