J Plant Biotechnol 2018; 45(1): 45-54
Published online March 31, 2018
https://doi.org/10.5010/JPB.2018.45.1.045
© The Korean Society of Plant Biotechnology
Correspondence to : e-mail: whkang@kangwon.ac.kr
This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
This research was conducted to study the gene expression of coffee (
Keywords
The study from the Royal Botanic Garden implies that coffee is the second most widely traded commodity in world market exchanges (Davis et al. 2012). Among the coffee species, arabica coffee accommodates 70% of total worldwide production, with an estimated production of 8.5 million tons in 2015 (ICO 2015). Recently, climatic change has played a crucial role in the reduction and seasonal variation in coffee bean yields worldwide (Camargo 2010). The relationship between agricultural production and climatic parameters is complicated due to environmental factors influencing the growth and development of the coffee plant in various ways during the phenological stage (Camargo 2010).
Salinity is one of the important abiotic stress factors that limit plant growth and crop production (Shrivastava and Kumar 2015). The excessive accumulation of minerals (Na and Cl) in the plant shoot can lead to salt stress effects in various ways, such as ionic toxicity and imbalance nutrient uptake by plants and osmotic stress, thus, adversely affecting the growth and development of plants (Munns 2006). The development of stress-tolerant crops is vital to combat the stress effects primarily in areas where the agricultural lands are exposed to such stress conditions (Nakashima et al. 2012).
Recently, the advancement of plant genomic studies and techniques of molecular biology have played an important role in understanding the contributions of gene responses as the plants are exposed to a particular environmental stress factor ( Joseph et al. 2011). Presently, RNA sequence analysis has been widely used to study the gene expression and profiling transcripts at the whole genome level in different organisms, which are used as a model and non-model organisms (Annadurai et al. 2012). By using the
Generally, transcriptomic analysis technology facilitates the identification of transcripts that are involved in stress response in a given organism and is used to analyze gene expression based on the absolute abundance of transcripts (Mortazavi et al. 2008). Transcriptome sequencing technology can produce a lot of data set to generate transcriptome map, quantifying gene expression, determining metabolic pathways, and discovering unidentified genes. Transcriptome sequence analysis has been used to examine molecular mechanisms of plants in response to different environmental stress factors for various crops such as maize (Lu et al. 2013), sunflower (Livaja et al. 2013), grape (Liu et al. 2012), and sorghum (Mizuno et al. 2012). There is limited information related to the gene responses of coffee plants to abiotic stress factor, particularly salt stress. Therefore, this experiment was conducted to estimate transcripts and study the gene expression of coffee seedlings under normal water and saline water irrigation conditions.
In this experiment, 6 months old coffee (
Total RNA was extracted for three biological replicates from the leaves of the coffee seedlings, which were irrigated with tap water (control), and 5% diluted deep sea water (DSW) following the Concert™ (Invitrogen) method. The tissue was ground in liquid N2, and 0.5 mL of Concert™ was added to 100 mg of the ground tissue. The samples were homogenized and centrifuged for 2 minutes at 12,000 rpm at room temperature. Afterward, 100 µL of 5 M NaCl and 300 µL of chloroform were added to the supernatant, and the solution was exhaustively homogenized by inversion. The samples were centrifuged again for 10 minutes at 12,000 rpm at +4 °C, and the supernatant was transferred to a new tube. An equal volume of isopropanol was added, and the tubes were centrifuged as described before; the supernatant was then discarded. 1 mL of 75% ethanol was added to wash the pellet, and the tubes were then centrifuged for 1 minute at 12,000 rpm in room temperature, and the supernatant was again discarded. The tubes containing the precipitated RNA were left at room temperature to dry completely. The pellet was diluted with RNase free water. There were three biological and two technical replicates of each treatment, from which equal amounts of RNA were pooled for cDNA synthesis. The cDNA preparation was done according to the Illumina TruSeq Stranded protocol. The library was sequenced using the Illumina HiSeq− 2000 platform. To obtain high-quality clean reads for de novo assembly, raw reads from mRNA-seq were filtered by discarding reads with adapter contamination and regions of low quality reads. The processed reads from both treatments were used for further analysis.
The sequences from the libraries were compared to
RNA sequence analysis was used to study the gene expressions of coffee seedlings under tap water (control) and DSW (deep sea water) irrigation conditions. cDNA libraries were prepared from the coffee seedling leaves and subjected to RNA sequence analysis using the Illumina HiSeq 2000 platform. A total of 127.3 million and 143.6 million raw reads were obtained from the control and DSW (5%) treatment conditions, respectively (Table 1). The trimmed (clean) reads were 125.0 million in the control treatment, and 140.5 million reads in DSW irrigated coffee seedling leaves sample. Guanine-cytosine (GC) content was estimated, and it accounted for 46% of the total read bases in control and 47% in diluted DSW (5%) treatment (Table 1). The Phred quality score (%) were checked (Q20 and Q30) to assess the sequence quality. Phred quality scores developed and used to identify repeated sequences and remove low- quality sequences, and estimate the sequence quality and quantification of an accurate consensus sequence.
Table 1 Raw and trimmed data from both treatments
Reads | Treatments | Total read bases | Total reads | GC (%) | Q20 (%) | Q30 (%) |
---|---|---|---|---|---|---|
Raw | Control | 12,863,176,853 | 127,358,187 | 46 | 99 | 97 |
DSW (5%) | 14,505,329,591 | 143,617,125 | 47 | 99 | 97 | |
Clean | Control | 12,550,884,035 | 125,080,102 | 46 | 99 | 98 |
DSW (5%) | 14,096,701,745 | 140,506,411 | 47 | 99 | 98 |
Note; Total read bases: total reads X read length
GC (%): GC (Guanine-cytosine) content
Q20 (%): phred quality score20, 99% certainty (1/100 chance of an incorrect base call)
Q30 (%): phred quality score30, 99.9% certainty (1/1,000 chance of an incorrect base call)
According to
Table 2 Summary of Illumina transcriptome reads mapped to the reference genes
Reads mapping | Reads number (%) | |
---|---|---|
Control | DSW (5%) | |
Processed reads | 62,540,051 | 70,253,206 |
Total mapped reads | 47,537,918 (76.02) | 50,999,074 (72.68) |
Unique match | 46,136,502 (73.7) | 48,830,704 (69.51) |
Multiple position match | 1,401,416 (2.95) | 2,168,370 (4.23) |
Total unmapped reads | 15,002,133 (23.8) | 19,254,132 (27.32) |
Overall mapping ratio | 76% | 72.10% |
Note: DSW (Deep Sea Water) was used as salt treatment
Table 3 List of identified differentially expressed transcription factor genes in coffee seedlings under salt stress condition
Locus_Tag | Gene Description | Fold change | E-Value | |
---|---|---|---|---|
Up-regulated | Cc00_g13890 | Double WRKY type transfactor | 2.1 | 2.00E-124 |
Cc10_g04710 | Ethylene-responsive transcription factor ERF011 | 2.0 | 9.00E-49 | |
Cc02_g14240 | Pathogenesis-related genes transcriptional activator PTI5 | 2.4 | 1.00E-25 | |
Cc04_g05080 | Probable WRKY transcription factor 40 | 2.2 | 7.00E-90 | |
Cc08_g11060 | Putative Probable WRKY transcription factor 50 | 2.4 | 1.00E-43 | |
Cc06_g01240 | Trihelix transcription factor GT-3a | 2.1 | 2.00E-61 | |
Down-regulated | Cc05_g16570 | Myb family transcription factor APL | -2.5 | 9E-99 |
Cc02_g10740 | Putative transcription elongation factor SPT5 homolog 1 | -2.3 | 0 | |
Cc06_g21410 | Putative Transcription elongation factor SPT6 | -2.0 | 0 | |
Cc02_g17440 | Putative Transcription factor bHLH63 | -2.4 | 3E-52 | |
Cc07_g03240 | Transcription factor bHLH135 | -2.4 | 2E-23 |
The distribution of genes coverage in the leaves of the coffee seedling. The identified gene coverage is the percentage of a gene that is covered by reads and defined as the ratio of the number of bases in a gene covered by uniquely mapped reads to the number of total bases in the gene. The pie chart shows the percentage of the different gene coverage listing on the left of the pie chart (A. Control treatment. B. DSW treatment)
A total of 19,581 genes were aligned to the reference sequences avaliable in coffee genome hub (
Gene Ontology (GO) analysis result
The biological process category was represented by a large number of genes. In the biological process category, metabolic process, response to stimulus and biological regulation were the most abundant processes and accounts, 25.1%, 14.8% and 12.4% respectively (Fig. 3). The other processes were single organism process (8.5%), cellular process (8.2%), localization (5.9%), developmental process (3.4%), cellular component organization (3%), unclassified (15.3%) and others (3.36%), (Fig. 3). Within the cellular component category, a large number of genes were involved in cell parts (44.6%), organelles (24.7) and membrane (7.3%) components. Binding and catalytic activities were the most abundant groups within the molecular function category and estimated 38.6% and 33.6%, respectively (Fig. 3).
The percentage of genes involved in different GO sub- categories
The number of sequences in each read counts were used to determine the differentially expressed genes between the libraries of comparison samples using the DEseq packages (Anders and Huber 2010). A total of 611 differentially expressed genes between the salt treated and control treatments were identified. Among them 336 genes were showed up-regulation and 275 genes were showed down-regulation. We identified 60 significantly (p < 0.05) differentially expressed genes. Of the significantly differentially expressed genes, 16 were down-regulated and 44 genes were up-regulated (Tables 4 and 5, respectively). We also found 15 significantly differentially expressed hypothetical genes, 3 that were down-regulated and the rest 12 genes were up-regulated (Tables 4 and 5).
Table 4 List of significantly (p < 0.05) differentially expressed down regulated genes under salt stress condition
Accession ID | Locus_Tag | Description | Fold change |
---|---|---|---|
ID77370 | Cc01_g02340 | Hypothetical protein | -9.8 |
ID216235 | Cc04_g06680 | Putative Probable S-adenosylmethionine | -6.4 |
ID238605 | Cc05_g07560 | Hypothetical protein | -5.4 |
ID198486 | Cc03_g10470 | Putative disease resistance protein RGA4 | -4.3 |
ID133785 | Cc02_g20110 | Amino acid permease 6 | -2.9 |
ID221155 | Cc04_g10190 | Hypothetical protein | -2.7 |
ID251461 | Cc05_g16570 | Myb family transcription factor APL | -2.5 |
ID208311 | Cc04_g01570 | Putative NADH dehydrogenase | -2.5 |
ID412256 | Cc00_g11240 | Putative Protein of unknown function | -2.4 |
ID289967 | Cc07_g03240 | Transcription factor bHLH135 | -2.4 |
ID296423 | Cc07_g07990 | Putative unknown protein | -2.4 |
ID228137 | Cc04_g15750 | Probable peptide/nitrate transporter | -2.2 |
ID220611 | Cc04_g09840 | Putative NAC domain-containing protein 68 | -2.2 |
ID321865 | Cc08_g05640 | ABC transporter G family member 14 | -2.1 |
ID378315 | Cc11_g02080 | Acetylornithine aminotransferase, chloroplastic/mitochondrial | -2.1 |
ID349824 | Cc09_g08740 | Auxin response factor 6 | -2.1 |
Table 5 List of significantly (p < 0.05) differentially expressed upregulated genes under salt stress condition
Accession ID | Locus_Tag | Description | Fold change |
---|---|---|---|
ID196801 | Cc03_g08920 | Hypothetical protein | 160.3 |
ID197373 | Cc03_g09460 | Hypothetical protein | 27.5 |
ID262669 | Cc06_g07480 | Hypothetical protein | 12.8 |
ID197396 | Cc03_g09490 | Hypothetical protein | 8.7 |
ID128885 | Cc02_g16600 | Snakin-2 | 5 |
ID312625 | Cc07_g19850 | Bifunctional monodehydroascorbate reductase | 4.4 |
ID217289 | Cc04_g07360 | Putative Protein aspartic protease in guard cell 1 | 4.3 |
ID412790 | Cc00_g11630 | Probable pre-mRNA-splicing factor | 4.1 |
ID94194 | Cc01_g14620 | Putative Probable LRR receptor | 3.9 |
ID140324 | Cc02_g24340 | Hypothetical protein | 3.8 |
ID238942 | Cc05_g07810 | Glutaredoxin-C9 | 3.6 |
ID130154 | Cc02_g17510 | Hypothetical protein | 3.3 |
ID241357 | Cc05_g09770 | Putative uncharacterized protein | 3.2 |
ID432820 | Cc00_g30460 | Putative RING/U-box superfamily protein (PUB) | 3.2 |
ID143867 | Cc02_g26780 | Putative uncharacterized protein | 3.1 |
ID425123 | Cc00_g22460 | COBRA-like protein 1 | 3.1 |
ID161846 | Cc02_g39350 | Cytochrome b561/ferric reductase transmembrane | 3 |
ID349096 | Cc09_g08190 | Putative UDP-glycosyltransferase 85A2 | 3 |
ID217284 | Cc04_g07350 | Putative Protein aspartic protease in guard cell 1 | 2.9 |
ID220977 | Cc04_g10090 | Putative unknown protein | 2.9 |
ID213561 | Cc04_g05040 | Putative Bifunctional dihydroflavonol 4-reductase | 2.9 |
ID216639 | Cc04_g06970 | Calmodulin binding protein 60 | 2.7 |
ID111904 | Cc02_g04570 | Hexose carrier protein HEX6 | 2.7 |
ID421434 | Cc00_g19080 | Hypothetical protein | 2.6 |
ID329413 | Cc08_g11350 | Hypothetical protein | 2.6 |
ID241348 | Cc05_g09760 | Putative uncharacterized protein | 2.6 |
ID271405 | Cc06_g13370 | 4-coumarate--CoA ligase 1 | 2.4 |
ID328966 | Cc08_g11060 | Putative Probable WRKY transcription factor 50 | 2.4 |
ID324636 | Cc08_g07970 | Hypothetical protein | 2.2 |
ID371323 | Cc10_g13630 | Putative Probable calcium-binding protein CML44 | 2.2 |
ID271443 | Cc06_g13410 | Xyloglucan endotransglucosylase | 2.2 |
ID133388 | Cc02_g19820 | Putative Uncharacterized protein | 2.2 |
ID421361 | Cc00_g19040 | Hypothetical protein | 2.2 |
ID431183 | Cc00_g28570 | Putative Urease accessory protein | 2.2 |
ID198445 | Cc03_g10440 | Cytokinin riboside 5’-monophosphate phosphoribohydrolase | 2.1 |
ID213871 | Cc04_g05230 | Hydroxycinnamoyl-Coenzyme A | 2.1 |
ID415458 | Cc00_g13890 | Double WRKY type transfactor | 2.1 |
ID404570 | Cc00_g05030 | Putative Probably inactive leucine-rich repeat receptor- | 2.1 |
ID127357 | Cc02_g15570 | Putative uncharacterized protein | 2.1 |
ID338558 | Cc09_g00710 | Auxin-responsive family protein | 2.1 |
ID214818 | Cc04_g05850 | Protein Transporter, Pam16 | 2.1 |
ID244479 | Cc05_g11900 | Putative Early nodulin-like protein 2 | 2.1 |
ID313389 | Cc07_g20430 | Hypothetical protein | 2 |
ID156312 | Cc02_g35670 | Hypothetical protein | 2 |
The gene ontology enrichment analysis was performed for significantly expressed genes. Sixty differentially expressed genes were classified into four categories. These groups are biological processes (40.83%), cellular components (20.76%), molecular functions (25.08%), and no-hit (13.33%) (Fig. 4). Among the significantly differentially expressed genes, a large number of genes were involved in biological processes. The up and down-regulated genes were further classified into several functional categories. Within the biological category, metabolic process, cellular process, single-organism process, response to a stimulus, biological regulation and localization comprised relatively a large number of up-regulated genes (20, 18, 17, 15, and 12 genes, respectively) (Fig. 5A). According to figure 5A, the down-regulated genes were relatively more involved in the metabolic process, cellular process, single-organism process, response to a stimulus, developmental process and biological regulations (7, 7, 7, 6, 5 and 5). Within the cellular component group, the number of up-regulated and down-regulated genes were classified as follows respectively; cell part (30 and 8), organelle (20 and 6), membrane part (10 and 1), membrane (9 and 2), extracellular region (8 and 0), cell (6 and 1), organelle part (3 and 1), and cell rejection (1 and 1) (Fig. 5B). Within the molecular function group, the number of up and down regulated genes were further classified into different activities accordingly such as binding (23 and 6), catalytic activity (11 and 2), transcription factor activity (3 and 3), transporter activity (3 and 2), electron carrier activity (2 and 0), molecular function regulator (1 and 0), molecular transducer activity (1 and 0), enzyme regulator activity (1 and 0), and unclassified (5 and 4) (Fig. 5C).
Gene ontology (GO) enrichment analysis performed for significantly differentially expressed genes in salt-treated coffee seedlings
Significantly (p < 0.05) differentially expressed genes in salt-treated coffee seedlings were grouped in different sub-categories (A, Biological process; B, Cellular component; C, Molecular function)
Abiotic stress is one of the serious constraints that limits agricultural productions and caused severe yield reduction, such as salinity and drought (Bray et al. 2000). However several plants have developed various mechanisms to tolerate these effects (Munns 2002). A previously conducted research has indicated that the transcription factor genes are expected to have a crucial role in regulating gene and a group of two or more genes (Nakashima et al. 2009). In the present study a total of 11 differential expressed transcription factor genes were identified (Table 3). Among differentially expressed transcription factor genes, 5 genes were showed down-regulation, and the other 6 genes were showed upregulation in salt treated coffee seedlings compared to control treatment (Table 3).
The products of several differentially expressed genes have been known for protecting plant cells from injury by producing various enzymes for the synthesis of osmolytes and enzymes to avoid reactive oxygen species and dehydrins (Bartels et al. 2005). Transcription factor genes have been widely involved in regulating the productions of functional proteins, which have a key role in plant defense mechanism (Rahaie et al. 2010 and Singh et al. 2002). Within the up regulated transcription factor genes, a significantly differential expressed WRKY genes were identified in salt-stressed coffee seedlings (Table. 3). These genes are Cc08_g11060 (Putative Probable WRKY transcription factor 50) and Cc00_g13890 (Double WRKY type transfactor). The WRKY genes are frequently reported to be involved in various stress responses. In salt- stressed roots of cotton plants, several WRKY genes showed a significant expression, such as WRKY6, WRKY33, WRKY40, and WRKY53 (Yao et al. 2011). The Cc06_g01240, (Trihelix transcription factor GT-3a) gene showed overexpression in salt-stressed coffee seedlings (Table 3). Trihelix transcription factors gene play an essential role in controlling the developmental process and response to abiotic and biotic stress factors (Wang et al. 2016). The GT-1 clade, GT-3a, and GT-3b have been shown to respond to salt stress in Arabidopsis (Park et al. 2004). Ethylene is an important stress hormone because its synthesis is induced under different oxidative environments. In the present study, the ethylene-responsive transcription factor ERF011 gene (Cc10_g04710) was significantly expressed in salt stressed coffee seedlings and it was up regulated (Table 3). Down regulated MYB family transcription factor APL gene (Cc05_g16570) was found in salt stressed coffee seedlings. Previously published studies indicated that MYB proteins are involved in many significant physiological and biochemical processes, including the regulation of primary and secondary metabolism, the control of cell development and the cell cycle, the participation in defense and response to various biotic and abiotic stresses, and hormone synthesis and signal transduction (Dubos et al. 2010 and Zhang et al. 2011). The bHLH superfamily is the second largest TF family in plants (Feller et al., 2011). Results from previous studies showed, bHLH-coding genes have suggested that they are involved in regulating a diverse array of biological and biochemical processes, such as light signaling (Roig-Villanova et al. 2007 and Leivar et al. 2008), and abiotic stress responses (Chinnusamy et al. 2003 and Kiribuchi et al. 2004). Under salt stressed coffee seedlings, down regulated bHLH transcription factors genes existed (Table 3). These genes are putative transcription factor bHLH63 (Cc05_g16570) and transcription factor bHLH135 (Cc07_g03240). According to Mao et al. (2017), under salt stress condition some special bHLH TFs are activated and bind to the promoter of the key genes involved in various signaling pathways and regulate the stress tolerance of plants by regulating the transcription level of these target genes. In this experiment, two down regulated STP transcription factor genes were significantly expressed under salt stress condition. These genes are putative transcription elongation factor SPT5 homolog 1 (Cc02_g10740) and putative transcription elongation factor SPT6 (Cc06_g21410) (Table 3).
The genes that were previously studied and known to be involved in response to salt stress and existed in our study are described below. The Cc04_g06970 (Calmodulin binding protein 60) gene expression was higher in diluted deep sea water (salt water) irrigated treatments (Table 5). Recent research studies have indicated that calmodulin binding protein 60 (CBP60) family associated in response to both biotic and abiotic stresses (Wan et al. 2012). Some members of calmodulin binding protein (for example, Q8H6T7) are identified to be involved in plant defense mechanisms against stress conditions (Ali et al. 2003). The researcher also found a significantly expressed up-regulated auxin-responsive protein coding gene (Cc09_ g00710) in salt stress condition (Table 5). Subsequently, significantly expressed down regulated auxin response factor gene (Cc09_g08740) was identified. The response of plants to various environmental stress factors at the molecular level associated with the expression of many genes involved in different pathways. Plant hormones have been associated with several abiotic and biotic stress factors (abscisic acid, ethylene, salicylic acid and jasmonic acid). Some recent research suggests that auxin is also linked to abiotic and biotic stress signaling pathways (Wang et al. 2003). The result of our experiment is in line with the finding of Jain and Khurana (2009) who reported that during various abiotic stress conditions, several auxin- responsive genes showed differential expression, which indicated a crosstalk between auxin and abiotic stress signaling. The Cc00_g22460 (COBRA-like protein 1), gene was significantly expressed in salt stressed coffee seedlings. The cell wall plays important functions in establishing the morphology of the plant cell, defense response to biotic and abiotic stresses, and mechanical properties of organs. The COBRA gene encodes a putative glycosylphosphatidylinositol (GPI)-anchored protein that controls the ability to change cellulose deposition and determine cell development in the plant cell (Gao et al. 2013).
Plants use different mechanisms to tolerate salinity stress. Among them, accumulation of lignin or modification of the monomeric composition of lignin in the cell wall is one of the major mechanisms (Neves et al. 2010). The expression of Cc04_g05230 (Hydroxycinnamoyl-Coenzyme A) gene was significant in salt treated coffee seedlings (Table 5). The accumulation of hydroxycinnamoyl-CoA, shikimate hydroxycinnamoyl transferase, acid peroxidase, and cysteine protein, is associated with lignification and was induced by salt stress in xylem sap of
A significantly differentially expressed two aspartic protease coding genes (putative aspartic protease in guard cell 1) were identified. The expression of putative aspartic protease coding genes (Cc04_g07350 and Cc04_g07360) were upregulated in salt stressed coffee seedlings (Table 5). Some research studies indicated that genes encoding plant aspartic proteases have been identified from different plant species (Mutlu and Gal 1999 and Murakami et al. 2000). Several studies have reported the functions of aspartic proteases in different physiological processes during plant development such as seed germination (Belozersky et al. 1989 and Dunaevsky et al. 1989), leaf senescence (Kato et al. 2004), the immunity response (Xia, et al. 2004), cell death (Ge, et al., 2005) and reproduction (Chen et al. 2008), little is known that aspartic proteases involving in abiotic stress responses (Yao et al. 2012). The result from this experiment also can be an additional supporting information regarding the involvement of aspartic protease gene in response to salt stress.
Up regulated putative RING/U-box superfamily protein (PUB) coding gene was identified in salt treated coffee seedlings and its expression was significant (p < 0.05) (Table 5). This result is similar to the finding of Banzai et al. (2002), who reported that a study in mangrove (
Coffee is one of the most important commercial crops worldwide. Currently, there are a lot of constraints that decline its production including biotic and abiotic factors. From this research, we can suggest there are some genes that involve in different abiotic stress factor including salt stress. Detail investigations of genomic study are crucial to figuring out salt-responsive genes in coffee. The data generated in this study will help in understanding the response of coffee seedlings at the genomic level associated with abiotic stresses in general, salt stress in particular. This study will also provide resources for functional genomic studies.
The study was supported by 2015 research grant from Kangwon National University (No. 520150118).
J Plant Biotechnol 2018; 45(1): 45-54
Published online March 31, 2018 https://doi.org/10.5010/JPB.2018.45.1.045
Copyright © The Korean Society of Plant Biotechnology.
Mesfin Haile, and Won Hee Kang
Department of Horticulture and Bio-system Engineering, Kangwon National University, Chuncheon 24341, Korea
Correspondence to:e-mail: whkang@kangwon.ac.kr
This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
This research was conducted to study the gene expression of coffee (
Keywords:
The study from the Royal Botanic Garden implies that coffee is the second most widely traded commodity in world market exchanges (Davis et al. 2012). Among the coffee species, arabica coffee accommodates 70% of total worldwide production, with an estimated production of 8.5 million tons in 2015 (ICO 2015). Recently, climatic change has played a crucial role in the reduction and seasonal variation in coffee bean yields worldwide (Camargo 2010). The relationship between agricultural production and climatic parameters is complicated due to environmental factors influencing the growth and development of the coffee plant in various ways during the phenological stage (Camargo 2010).
Salinity is one of the important abiotic stress factors that limit plant growth and crop production (Shrivastava and Kumar 2015). The excessive accumulation of minerals (Na and Cl) in the plant shoot can lead to salt stress effects in various ways, such as ionic toxicity and imbalance nutrient uptake by plants and osmotic stress, thus, adversely affecting the growth and development of plants (Munns 2006). The development of stress-tolerant crops is vital to combat the stress effects primarily in areas where the agricultural lands are exposed to such stress conditions (Nakashima et al. 2012).
Recently, the advancement of plant genomic studies and techniques of molecular biology have played an important role in understanding the contributions of gene responses as the plants are exposed to a particular environmental stress factor ( Joseph et al. 2011). Presently, RNA sequence analysis has been widely used to study the gene expression and profiling transcripts at the whole genome level in different organisms, which are used as a model and non-model organisms (Annadurai et al. 2012). By using the
Generally, transcriptomic analysis technology facilitates the identification of transcripts that are involved in stress response in a given organism and is used to analyze gene expression based on the absolute abundance of transcripts (Mortazavi et al. 2008). Transcriptome sequencing technology can produce a lot of data set to generate transcriptome map, quantifying gene expression, determining metabolic pathways, and discovering unidentified genes. Transcriptome sequence analysis has been used to examine molecular mechanisms of plants in response to different environmental stress factors for various crops such as maize (Lu et al. 2013), sunflower (Livaja et al. 2013), grape (Liu et al. 2012), and sorghum (Mizuno et al. 2012). There is limited information related to the gene responses of coffee plants to abiotic stress factor, particularly salt stress. Therefore, this experiment was conducted to estimate transcripts and study the gene expression of coffee seedlings under normal water and saline water irrigation conditions.
In this experiment, 6 months old coffee (
Total RNA was extracted for three biological replicates from the leaves of the coffee seedlings, which were irrigated with tap water (control), and 5% diluted deep sea water (DSW) following the Concert™ (Invitrogen) method. The tissue was ground in liquid N2, and 0.5 mL of Concert™ was added to 100 mg of the ground tissue. The samples were homogenized and centrifuged for 2 minutes at 12,000 rpm at room temperature. Afterward, 100 µL of 5 M NaCl and 300 µL of chloroform were added to the supernatant, and the solution was exhaustively homogenized by inversion. The samples were centrifuged again for 10 minutes at 12,000 rpm at +4 °C, and the supernatant was transferred to a new tube. An equal volume of isopropanol was added, and the tubes were centrifuged as described before; the supernatant was then discarded. 1 mL of 75% ethanol was added to wash the pellet, and the tubes were then centrifuged for 1 minute at 12,000 rpm in room temperature, and the supernatant was again discarded. The tubes containing the precipitated RNA were left at room temperature to dry completely. The pellet was diluted with RNase free water. There were three biological and two technical replicates of each treatment, from which equal amounts of RNA were pooled for cDNA synthesis. The cDNA preparation was done according to the Illumina TruSeq Stranded protocol. The library was sequenced using the Illumina HiSeq− 2000 platform. To obtain high-quality clean reads for de novo assembly, raw reads from mRNA-seq were filtered by discarding reads with adapter contamination and regions of low quality reads. The processed reads from both treatments were used for further analysis.
The sequences from the libraries were compared to
RNA sequence analysis was used to study the gene expressions of coffee seedlings under tap water (control) and DSW (deep sea water) irrigation conditions. cDNA libraries were prepared from the coffee seedling leaves and subjected to RNA sequence analysis using the Illumina HiSeq 2000 platform. A total of 127.3 million and 143.6 million raw reads were obtained from the control and DSW (5%) treatment conditions, respectively (Table 1). The trimmed (clean) reads were 125.0 million in the control treatment, and 140.5 million reads in DSW irrigated coffee seedling leaves sample. Guanine-cytosine (GC) content was estimated, and it accounted for 46% of the total read bases in control and 47% in diluted DSW (5%) treatment (Table 1). The Phred quality score (%) were checked (Q20 and Q30) to assess the sequence quality. Phred quality scores developed and used to identify repeated sequences and remove low- quality sequences, and estimate the sequence quality and quantification of an accurate consensus sequence.
Table 1 . Raw and trimmed data from both treatments.
Reads | Treatments | Total read bases | Total reads | GC (%) | Q20 (%) | Q30 (%) |
---|---|---|---|---|---|---|
Raw | Control | 12,863,176,853 | 127,358,187 | 46 | 99 | 97 |
DSW (5%) | 14,505,329,591 | 143,617,125 | 47 | 99 | 97 | |
Clean | Control | 12,550,884,035 | 125,080,102 | 46 | 99 | 98 |
DSW (5%) | 14,096,701,745 | 140,506,411 | 47 | 99 | 98 |
Note; Total read bases: total reads X read length.
GC (%): GC (Guanine-cytosine) content.
Q20 (%): phred quality score20, 99% certainty (1/100 chance of an incorrect base call).
Q30 (%): phred quality score30, 99.9% certainty (1/1,000 chance of an incorrect base call).
According to
Table 2 . Summary of Illumina transcriptome reads mapped to the reference genes.
Reads mapping | Reads number (%) | |
---|---|---|
Control | DSW (5%) | |
Processed reads | 62,540,051 | 70,253,206 |
Total mapped reads | 47,537,918 (76.02) | 50,999,074 (72.68) |
Unique match | 46,136,502 (73.7) | 48,830,704 (69.51) |
Multiple position match | 1,401,416 (2.95) | 2,168,370 (4.23) |
Total unmapped reads | 15,002,133 (23.8) | 19,254,132 (27.32) |
Overall mapping ratio | 76% | 72.10% |
Note: DSW (Deep Sea Water) was used as salt treatment.
Table 3 . List of identified differentially expressed transcription factor genes in coffee seedlings under salt stress condition.
Locus_Tag | Gene Description | Fold change | E-Value | |
---|---|---|---|---|
Up-regulated | Cc00_g13890 | Double WRKY type transfactor | 2.1 | 2.00E-124 |
Cc10_g04710 | Ethylene-responsive transcription factor ERF011 | 2.0 | 9.00E-49 | |
Cc02_g14240 | Pathogenesis-related genes transcriptional activator PTI5 | 2.4 | 1.00E-25 | |
Cc04_g05080 | Probable WRKY transcription factor 40 | 2.2 | 7.00E-90 | |
Cc08_g11060 | Putative Probable WRKY transcription factor 50 | 2.4 | 1.00E-43 | |
Cc06_g01240 | Trihelix transcription factor GT-3a | 2.1 | 2.00E-61 | |
Down-regulated | Cc05_g16570 | Myb family transcription factor APL | -2.5 | 9E-99 |
Cc02_g10740 | Putative transcription elongation factor SPT5 homolog 1 | -2.3 | 0 | |
Cc06_g21410 | Putative Transcription elongation factor SPT6 | -2.0 | 0 | |
Cc02_g17440 | Putative Transcription factor bHLH63 | -2.4 | 3E-52 | |
Cc07_g03240 | Transcription factor bHLH135 | -2.4 | 2E-23 |
The distribution of genes coverage in the leaves of the coffee seedling. The identified gene coverage is the percentage of a gene that is covered by reads and defined as the ratio of the number of bases in a gene covered by uniquely mapped reads to the number of total bases in the gene. The pie chart shows the percentage of the different gene coverage listing on the left of the pie chart (A. Control treatment. B. DSW treatment)
A total of 19,581 genes were aligned to the reference sequences avaliable in coffee genome hub (
Gene Ontology (GO) analysis result
The biological process category was represented by a large number of genes. In the biological process category, metabolic process, response to stimulus and biological regulation were the most abundant processes and accounts, 25.1%, 14.8% and 12.4% respectively (Fig. 3). The other processes were single organism process (8.5%), cellular process (8.2%), localization (5.9%), developmental process (3.4%), cellular component organization (3%), unclassified (15.3%) and others (3.36%), (Fig. 3). Within the cellular component category, a large number of genes were involved in cell parts (44.6%), organelles (24.7) and membrane (7.3%) components. Binding and catalytic activities were the most abundant groups within the molecular function category and estimated 38.6% and 33.6%, respectively (Fig. 3).
The percentage of genes involved in different GO sub- categories
The number of sequences in each read counts were used to determine the differentially expressed genes between the libraries of comparison samples using the DEseq packages (Anders and Huber 2010). A total of 611 differentially expressed genes between the salt treated and control treatments were identified. Among them 336 genes were showed up-regulation and 275 genes were showed down-regulation. We identified 60 significantly (p < 0.05) differentially expressed genes. Of the significantly differentially expressed genes, 16 were down-regulated and 44 genes were up-regulated (Tables 4 and 5, respectively). We also found 15 significantly differentially expressed hypothetical genes, 3 that were down-regulated and the rest 12 genes were up-regulated (Tables 4 and 5).
Table 4 . List of significantly (p < 0.05) differentially expressed down regulated genes under salt stress condition.
Accession ID | Locus_Tag | Description | Fold change |
---|---|---|---|
ID77370 | Cc01_g02340 | Hypothetical protein | -9.8 |
ID216235 | Cc04_g06680 | Putative Probable S-adenosylmethionine | -6.4 |
ID238605 | Cc05_g07560 | Hypothetical protein | -5.4 |
ID198486 | Cc03_g10470 | Putative disease resistance protein RGA4 | -4.3 |
ID133785 | Cc02_g20110 | Amino acid permease 6 | -2.9 |
ID221155 | Cc04_g10190 | Hypothetical protein | -2.7 |
ID251461 | Cc05_g16570 | Myb family transcription factor APL | -2.5 |
ID208311 | Cc04_g01570 | Putative NADH dehydrogenase | -2.5 |
ID412256 | Cc00_g11240 | Putative Protein of unknown function | -2.4 |
ID289967 | Cc07_g03240 | Transcription factor bHLH135 | -2.4 |
ID296423 | Cc07_g07990 | Putative unknown protein | -2.4 |
ID228137 | Cc04_g15750 | Probable peptide/nitrate transporter | -2.2 |
ID220611 | Cc04_g09840 | Putative NAC domain-containing protein 68 | -2.2 |
ID321865 | Cc08_g05640 | ABC transporter G family member 14 | -2.1 |
ID378315 | Cc11_g02080 | Acetylornithine aminotransferase, chloroplastic/mitochondrial | -2.1 |
ID349824 | Cc09_g08740 | Auxin response factor 6 | -2.1 |
Table 5 . List of significantly (p < 0.05) differentially expressed upregulated genes under salt stress condition.
Accession ID | Locus_Tag | Description | Fold change |
---|---|---|---|
ID196801 | Cc03_g08920 | Hypothetical protein | 160.3 |
ID197373 | Cc03_g09460 | Hypothetical protein | 27.5 |
ID262669 | Cc06_g07480 | Hypothetical protein | 12.8 |
ID197396 | Cc03_g09490 | Hypothetical protein | 8.7 |
ID128885 | Cc02_g16600 | Snakin-2 | 5 |
ID312625 | Cc07_g19850 | Bifunctional monodehydroascorbate reductase | 4.4 |
ID217289 | Cc04_g07360 | Putative Protein aspartic protease in guard cell 1 | 4.3 |
ID412790 | Cc00_g11630 | Probable pre-mRNA-splicing factor | 4.1 |
ID94194 | Cc01_g14620 | Putative Probable LRR receptor | 3.9 |
ID140324 | Cc02_g24340 | Hypothetical protein | 3.8 |
ID238942 | Cc05_g07810 | Glutaredoxin-C9 | 3.6 |
ID130154 | Cc02_g17510 | Hypothetical protein | 3.3 |
ID241357 | Cc05_g09770 | Putative uncharacterized protein | 3.2 |
ID432820 | Cc00_g30460 | Putative RING/U-box superfamily protein (PUB) | 3.2 |
ID143867 | Cc02_g26780 | Putative uncharacterized protein | 3.1 |
ID425123 | Cc00_g22460 | COBRA-like protein 1 | 3.1 |
ID161846 | Cc02_g39350 | Cytochrome b561/ferric reductase transmembrane | 3 |
ID349096 | Cc09_g08190 | Putative UDP-glycosyltransferase 85A2 | 3 |
ID217284 | Cc04_g07350 | Putative Protein aspartic protease in guard cell 1 | 2.9 |
ID220977 | Cc04_g10090 | Putative unknown protein | 2.9 |
ID213561 | Cc04_g05040 | Putative Bifunctional dihydroflavonol 4-reductase | 2.9 |
ID216639 | Cc04_g06970 | Calmodulin binding protein 60 | 2.7 |
ID111904 | Cc02_g04570 | Hexose carrier protein HEX6 | 2.7 |
ID421434 | Cc00_g19080 | Hypothetical protein | 2.6 |
ID329413 | Cc08_g11350 | Hypothetical protein | 2.6 |
ID241348 | Cc05_g09760 | Putative uncharacterized protein | 2.6 |
ID271405 | Cc06_g13370 | 4-coumarate--CoA ligase 1 | 2.4 |
ID328966 | Cc08_g11060 | Putative Probable WRKY transcription factor 50 | 2.4 |
ID324636 | Cc08_g07970 | Hypothetical protein | 2.2 |
ID371323 | Cc10_g13630 | Putative Probable calcium-binding protein CML44 | 2.2 |
ID271443 | Cc06_g13410 | Xyloglucan endotransglucosylase | 2.2 |
ID133388 | Cc02_g19820 | Putative Uncharacterized protein | 2.2 |
ID421361 | Cc00_g19040 | Hypothetical protein | 2.2 |
ID431183 | Cc00_g28570 | Putative Urease accessory protein | 2.2 |
ID198445 | Cc03_g10440 | Cytokinin riboside 5’-monophosphate phosphoribohydrolase | 2.1 |
ID213871 | Cc04_g05230 | Hydroxycinnamoyl-Coenzyme A | 2.1 |
ID415458 | Cc00_g13890 | Double WRKY type transfactor | 2.1 |
ID404570 | Cc00_g05030 | Putative Probably inactive leucine-rich repeat receptor- | 2.1 |
ID127357 | Cc02_g15570 | Putative uncharacterized protein | 2.1 |
ID338558 | Cc09_g00710 | Auxin-responsive family protein | 2.1 |
ID214818 | Cc04_g05850 | Protein Transporter, Pam16 | 2.1 |
ID244479 | Cc05_g11900 | Putative Early nodulin-like protein 2 | 2.1 |
ID313389 | Cc07_g20430 | Hypothetical protein | 2 |
ID156312 | Cc02_g35670 | Hypothetical protein | 2 |
The gene ontology enrichment analysis was performed for significantly expressed genes. Sixty differentially expressed genes were classified into four categories. These groups are biological processes (40.83%), cellular components (20.76%), molecular functions (25.08%), and no-hit (13.33%) (Fig. 4). Among the significantly differentially expressed genes, a large number of genes were involved in biological processes. The up and down-regulated genes were further classified into several functional categories. Within the biological category, metabolic process, cellular process, single-organism process, response to a stimulus, biological regulation and localization comprised relatively a large number of up-regulated genes (20, 18, 17, 15, and 12 genes, respectively) (Fig. 5A). According to figure 5A, the down-regulated genes were relatively more involved in the metabolic process, cellular process, single-organism process, response to a stimulus, developmental process and biological regulations (7, 7, 7, 6, 5 and 5). Within the cellular component group, the number of up-regulated and down-regulated genes were classified as follows respectively; cell part (30 and 8), organelle (20 and 6), membrane part (10 and 1), membrane (9 and 2), extracellular region (8 and 0), cell (6 and 1), organelle part (3 and 1), and cell rejection (1 and 1) (Fig. 5B). Within the molecular function group, the number of up and down regulated genes were further classified into different activities accordingly such as binding (23 and 6), catalytic activity (11 and 2), transcription factor activity (3 and 3), transporter activity (3 and 2), electron carrier activity (2 and 0), molecular function regulator (1 and 0), molecular transducer activity (1 and 0), enzyme regulator activity (1 and 0), and unclassified (5 and 4) (Fig. 5C).
Gene ontology (GO) enrichment analysis performed for significantly differentially expressed genes in salt-treated coffee seedlings
Significantly (p < 0.05) differentially expressed genes in salt-treated coffee seedlings were grouped in different sub-categories (A, Biological process; B, Cellular component; C, Molecular function)
Abiotic stress is one of the serious constraints that limits agricultural productions and caused severe yield reduction, such as salinity and drought (Bray et al. 2000). However several plants have developed various mechanisms to tolerate these effects (Munns 2002). A previously conducted research has indicated that the transcription factor genes are expected to have a crucial role in regulating gene and a group of two or more genes (Nakashima et al. 2009). In the present study a total of 11 differential expressed transcription factor genes were identified (Table 3). Among differentially expressed transcription factor genes, 5 genes were showed down-regulation, and the other 6 genes were showed upregulation in salt treated coffee seedlings compared to control treatment (Table 3).
The products of several differentially expressed genes have been known for protecting plant cells from injury by producing various enzymes for the synthesis of osmolytes and enzymes to avoid reactive oxygen species and dehydrins (Bartels et al. 2005). Transcription factor genes have been widely involved in regulating the productions of functional proteins, which have a key role in plant defense mechanism (Rahaie et al. 2010 and Singh et al. 2002). Within the up regulated transcription factor genes, a significantly differential expressed WRKY genes were identified in salt-stressed coffee seedlings (Table. 3). These genes are Cc08_g11060 (Putative Probable WRKY transcription factor 50) and Cc00_g13890 (Double WRKY type transfactor). The WRKY genes are frequently reported to be involved in various stress responses. In salt- stressed roots of cotton plants, several WRKY genes showed a significant expression, such as WRKY6, WRKY33, WRKY40, and WRKY53 (Yao et al. 2011). The Cc06_g01240, (Trihelix transcription factor GT-3a) gene showed overexpression in salt-stressed coffee seedlings (Table 3). Trihelix transcription factors gene play an essential role in controlling the developmental process and response to abiotic and biotic stress factors (Wang et al. 2016). The GT-1 clade, GT-3a, and GT-3b have been shown to respond to salt stress in Arabidopsis (Park et al. 2004). Ethylene is an important stress hormone because its synthesis is induced under different oxidative environments. In the present study, the ethylene-responsive transcription factor ERF011 gene (Cc10_g04710) was significantly expressed in salt stressed coffee seedlings and it was up regulated (Table 3). Down regulated MYB family transcription factor APL gene (Cc05_g16570) was found in salt stressed coffee seedlings. Previously published studies indicated that MYB proteins are involved in many significant physiological and biochemical processes, including the regulation of primary and secondary metabolism, the control of cell development and the cell cycle, the participation in defense and response to various biotic and abiotic stresses, and hormone synthesis and signal transduction (Dubos et al. 2010 and Zhang et al. 2011). The bHLH superfamily is the second largest TF family in plants (Feller et al., 2011). Results from previous studies showed, bHLH-coding genes have suggested that they are involved in regulating a diverse array of biological and biochemical processes, such as light signaling (Roig-Villanova et al. 2007 and Leivar et al. 2008), and abiotic stress responses (Chinnusamy et al. 2003 and Kiribuchi et al. 2004). Under salt stressed coffee seedlings, down regulated bHLH transcription factors genes existed (Table 3). These genes are putative transcription factor bHLH63 (Cc05_g16570) and transcription factor bHLH135 (Cc07_g03240). According to Mao et al. (2017), under salt stress condition some special bHLH TFs are activated and bind to the promoter of the key genes involved in various signaling pathways and regulate the stress tolerance of plants by regulating the transcription level of these target genes. In this experiment, two down regulated STP transcription factor genes were significantly expressed under salt stress condition. These genes are putative transcription elongation factor SPT5 homolog 1 (Cc02_g10740) and putative transcription elongation factor SPT6 (Cc06_g21410) (Table 3).
The genes that were previously studied and known to be involved in response to salt stress and existed in our study are described below. The Cc04_g06970 (Calmodulin binding protein 60) gene expression was higher in diluted deep sea water (salt water) irrigated treatments (Table 5). Recent research studies have indicated that calmodulin binding protein 60 (CBP60) family associated in response to both biotic and abiotic stresses (Wan et al. 2012). Some members of calmodulin binding protein (for example, Q8H6T7) are identified to be involved in plant defense mechanisms against stress conditions (Ali et al. 2003). The researcher also found a significantly expressed up-regulated auxin-responsive protein coding gene (Cc09_ g00710) in salt stress condition (Table 5). Subsequently, significantly expressed down regulated auxin response factor gene (Cc09_g08740) was identified. The response of plants to various environmental stress factors at the molecular level associated with the expression of many genes involved in different pathways. Plant hormones have been associated with several abiotic and biotic stress factors (abscisic acid, ethylene, salicylic acid and jasmonic acid). Some recent research suggests that auxin is also linked to abiotic and biotic stress signaling pathways (Wang et al. 2003). The result of our experiment is in line with the finding of Jain and Khurana (2009) who reported that during various abiotic stress conditions, several auxin- responsive genes showed differential expression, which indicated a crosstalk between auxin and abiotic stress signaling. The Cc00_g22460 (COBRA-like protein 1), gene was significantly expressed in salt stressed coffee seedlings. The cell wall plays important functions in establishing the morphology of the plant cell, defense response to biotic and abiotic stresses, and mechanical properties of organs. The COBRA gene encodes a putative glycosylphosphatidylinositol (GPI)-anchored protein that controls the ability to change cellulose deposition and determine cell development in the plant cell (Gao et al. 2013).
Plants use different mechanisms to tolerate salinity stress. Among them, accumulation of lignin or modification of the monomeric composition of lignin in the cell wall is one of the major mechanisms (Neves et al. 2010). The expression of Cc04_g05230 (Hydroxycinnamoyl-Coenzyme A) gene was significant in salt treated coffee seedlings (Table 5). The accumulation of hydroxycinnamoyl-CoA, shikimate hydroxycinnamoyl transferase, acid peroxidase, and cysteine protein, is associated with lignification and was induced by salt stress in xylem sap of
A significantly differentially expressed two aspartic protease coding genes (putative aspartic protease in guard cell 1) were identified. The expression of putative aspartic protease coding genes (Cc04_g07350 and Cc04_g07360) were upregulated in salt stressed coffee seedlings (Table 5). Some research studies indicated that genes encoding plant aspartic proteases have been identified from different plant species (Mutlu and Gal 1999 and Murakami et al. 2000). Several studies have reported the functions of aspartic proteases in different physiological processes during plant development such as seed germination (Belozersky et al. 1989 and Dunaevsky et al. 1989), leaf senescence (Kato et al. 2004), the immunity response (Xia, et al. 2004), cell death (Ge, et al., 2005) and reproduction (Chen et al. 2008), little is known that aspartic proteases involving in abiotic stress responses (Yao et al. 2012). The result from this experiment also can be an additional supporting information regarding the involvement of aspartic protease gene in response to salt stress.
Up regulated putative RING/U-box superfamily protein (PUB) coding gene was identified in salt treated coffee seedlings and its expression was significant (p < 0.05) (Table 5). This result is similar to the finding of Banzai et al. (2002), who reported that a study in mangrove (
Coffee is one of the most important commercial crops worldwide. Currently, there are a lot of constraints that decline its production including biotic and abiotic factors. From this research, we can suggest there are some genes that involve in different abiotic stress factor including salt stress. Detail investigations of genomic study are crucial to figuring out salt-responsive genes in coffee. The data generated in this study will help in understanding the response of coffee seedlings at the genomic level associated with abiotic stresses in general, salt stress in particular. This study will also provide resources for functional genomic studies.
The study was supported by 2015 research grant from Kangwon National University (No. 520150118).
The distribution of genes coverage in the leaves of the coffee seedling. The identified gene coverage is the percentage of a gene that is covered by reads and defined as the ratio of the number of bases in a gene covered by uniquely mapped reads to the number of total bases in the gene. The pie chart shows the percentage of the different gene coverage listing on the left of the pie chart (A. Control treatment. B. DSW treatment)
Gene Ontology (GO) analysis result
The percentage of genes involved in different GO sub- categories
Gene ontology (GO) enrichment analysis performed for significantly differentially expressed genes in salt-treated coffee seedlings
Significantly (p < 0.05) differentially expressed genes in salt-treated coffee seedlings were grouped in different sub-categories (A, Biological process; B, Cellular component; C, Molecular function)
Table 1 . Raw and trimmed data from both treatments.
Reads | Treatments | Total read bases | Total reads | GC (%) | Q20 (%) | Q30 (%) |
---|---|---|---|---|---|---|
Raw | Control | 12,863,176,853 | 127,358,187 | 46 | 99 | 97 |
DSW (5%) | 14,505,329,591 | 143,617,125 | 47 | 99 | 97 | |
Clean | Control | 12,550,884,035 | 125,080,102 | 46 | 99 | 98 |
DSW (5%) | 14,096,701,745 | 140,506,411 | 47 | 99 | 98 |
Note; Total read bases: total reads X read length.
GC (%): GC (Guanine-cytosine) content.
Q20 (%): phred quality score20, 99% certainty (1/100 chance of an incorrect base call).
Q30 (%): phred quality score30, 99.9% certainty (1/1,000 chance of an incorrect base call).
Table 2 . Summary of Illumina transcriptome reads mapped to the reference genes.
Reads mapping | Reads number (%) | |
---|---|---|
Control | DSW (5%) | |
Processed reads | 62,540,051 | 70,253,206 |
Total mapped reads | 47,537,918 (76.02) | 50,999,074 (72.68) |
Unique match | 46,136,502 (73.7) | 48,830,704 (69.51) |
Multiple position match | 1,401,416 (2.95) | 2,168,370 (4.23) |
Total unmapped reads | 15,002,133 (23.8) | 19,254,132 (27.32) |
Overall mapping ratio | 76% | 72.10% |
Note: DSW (Deep Sea Water) was used as salt treatment.
Table 3 . List of identified differentially expressed transcription factor genes in coffee seedlings under salt stress condition.
Locus_Tag | Gene Description | Fold change | E-Value | |
---|---|---|---|---|
Up-regulated | Cc00_g13890 | Double WRKY type transfactor | 2.1 | 2.00E-124 |
Cc10_g04710 | Ethylene-responsive transcription factor ERF011 | 2.0 | 9.00E-49 | |
Cc02_g14240 | Pathogenesis-related genes transcriptional activator PTI5 | 2.4 | 1.00E-25 | |
Cc04_g05080 | Probable WRKY transcription factor 40 | 2.2 | 7.00E-90 | |
Cc08_g11060 | Putative Probable WRKY transcription factor 50 | 2.4 | 1.00E-43 | |
Cc06_g01240 | Trihelix transcription factor GT-3a | 2.1 | 2.00E-61 | |
Down-regulated | Cc05_g16570 | Myb family transcription factor APL | -2.5 | 9E-99 |
Cc02_g10740 | Putative transcription elongation factor SPT5 homolog 1 | -2.3 | 0 | |
Cc06_g21410 | Putative Transcription elongation factor SPT6 | -2.0 | 0 | |
Cc02_g17440 | Putative Transcription factor bHLH63 | -2.4 | 3E-52 | |
Cc07_g03240 | Transcription factor bHLH135 | -2.4 | 2E-23 |
Table 4 . List of significantly (p < 0.05) differentially expressed down regulated genes under salt stress condition.
Accession ID | Locus_Tag | Description | Fold change |
---|---|---|---|
ID77370 | Cc01_g02340 | Hypothetical protein | -9.8 |
ID216235 | Cc04_g06680 | Putative Probable S-adenosylmethionine | -6.4 |
ID238605 | Cc05_g07560 | Hypothetical protein | -5.4 |
ID198486 | Cc03_g10470 | Putative disease resistance protein RGA4 | -4.3 |
ID133785 | Cc02_g20110 | Amino acid permease 6 | -2.9 |
ID221155 | Cc04_g10190 | Hypothetical protein | -2.7 |
ID251461 | Cc05_g16570 | Myb family transcription factor APL | -2.5 |
ID208311 | Cc04_g01570 | Putative NADH dehydrogenase | -2.5 |
ID412256 | Cc00_g11240 | Putative Protein of unknown function | -2.4 |
ID289967 | Cc07_g03240 | Transcription factor bHLH135 | -2.4 |
ID296423 | Cc07_g07990 | Putative unknown protein | -2.4 |
ID228137 | Cc04_g15750 | Probable peptide/nitrate transporter | -2.2 |
ID220611 | Cc04_g09840 | Putative NAC domain-containing protein 68 | -2.2 |
ID321865 | Cc08_g05640 | ABC transporter G family member 14 | -2.1 |
ID378315 | Cc11_g02080 | Acetylornithine aminotransferase, chloroplastic/mitochondrial | -2.1 |
ID349824 | Cc09_g08740 | Auxin response factor 6 | -2.1 |
Table 5 . List of significantly (p < 0.05) differentially expressed upregulated genes under salt stress condition.
Accession ID | Locus_Tag | Description | Fold change |
---|---|---|---|
ID196801 | Cc03_g08920 | Hypothetical protein | 160.3 |
ID197373 | Cc03_g09460 | Hypothetical protein | 27.5 |
ID262669 | Cc06_g07480 | Hypothetical protein | 12.8 |
ID197396 | Cc03_g09490 | Hypothetical protein | 8.7 |
ID128885 | Cc02_g16600 | Snakin-2 | 5 |
ID312625 | Cc07_g19850 | Bifunctional monodehydroascorbate reductase | 4.4 |
ID217289 | Cc04_g07360 | Putative Protein aspartic protease in guard cell 1 | 4.3 |
ID412790 | Cc00_g11630 | Probable pre-mRNA-splicing factor | 4.1 |
ID94194 | Cc01_g14620 | Putative Probable LRR receptor | 3.9 |
ID140324 | Cc02_g24340 | Hypothetical protein | 3.8 |
ID238942 | Cc05_g07810 | Glutaredoxin-C9 | 3.6 |
ID130154 | Cc02_g17510 | Hypothetical protein | 3.3 |
ID241357 | Cc05_g09770 | Putative uncharacterized protein | 3.2 |
ID432820 | Cc00_g30460 | Putative RING/U-box superfamily protein (PUB) | 3.2 |
ID143867 | Cc02_g26780 | Putative uncharacterized protein | 3.1 |
ID425123 | Cc00_g22460 | COBRA-like protein 1 | 3.1 |
ID161846 | Cc02_g39350 | Cytochrome b561/ferric reductase transmembrane | 3 |
ID349096 | Cc09_g08190 | Putative UDP-glycosyltransferase 85A2 | 3 |
ID217284 | Cc04_g07350 | Putative Protein aspartic protease in guard cell 1 | 2.9 |
ID220977 | Cc04_g10090 | Putative unknown protein | 2.9 |
ID213561 | Cc04_g05040 | Putative Bifunctional dihydroflavonol 4-reductase | 2.9 |
ID216639 | Cc04_g06970 | Calmodulin binding protein 60 | 2.7 |
ID111904 | Cc02_g04570 | Hexose carrier protein HEX6 | 2.7 |
ID421434 | Cc00_g19080 | Hypothetical protein | 2.6 |
ID329413 | Cc08_g11350 | Hypothetical protein | 2.6 |
ID241348 | Cc05_g09760 | Putative uncharacterized protein | 2.6 |
ID271405 | Cc06_g13370 | 4-coumarate--CoA ligase 1 | 2.4 |
ID328966 | Cc08_g11060 | Putative Probable WRKY transcription factor 50 | 2.4 |
ID324636 | Cc08_g07970 | Hypothetical protein | 2.2 |
ID371323 | Cc10_g13630 | Putative Probable calcium-binding protein CML44 | 2.2 |
ID271443 | Cc06_g13410 | Xyloglucan endotransglucosylase | 2.2 |
ID133388 | Cc02_g19820 | Putative Uncharacterized protein | 2.2 |
ID421361 | Cc00_g19040 | Hypothetical protein | 2.2 |
ID431183 | Cc00_g28570 | Putative Urease accessory protein | 2.2 |
ID198445 | Cc03_g10440 | Cytokinin riboside 5’-monophosphate phosphoribohydrolase | 2.1 |
ID213871 | Cc04_g05230 | Hydroxycinnamoyl-Coenzyme A | 2.1 |
ID415458 | Cc00_g13890 | Double WRKY type transfactor | 2.1 |
ID404570 | Cc00_g05030 | Putative Probably inactive leucine-rich repeat receptor- | 2.1 |
ID127357 | Cc02_g15570 | Putative uncharacterized protein | 2.1 |
ID338558 | Cc09_g00710 | Auxin-responsive family protein | 2.1 |
ID214818 | Cc04_g05850 | Protein Transporter, Pam16 | 2.1 |
ID244479 | Cc05_g11900 | Putative Early nodulin-like protein 2 | 2.1 |
ID313389 | Cc07_g20430 | Hypothetical protein | 2 |
ID156312 | Cc02_g35670 | Hypothetical protein | 2 |
Seon Ae Kim, and Hae Keun Yun
J Plant Biotechnol 2016; 43(2): 204-212Toan Khac Nguyen ・Jin Hee Lim
J Plant Biotechnol 2021; 48(3): 139-147Yu Jin Jung・Joung Soon Park ・Ji Yun Go ・Hyo Ju Lee ・Jin Young Kim・Ye Ji Lee ・Ki Hong Nam ・ Yong-Gu Cho ・Kwon Kyoo Kang
J Plant Biotechnol 2021; 48(3): 124-130
Journal of
Plant BiotechnologyThe distribution of genes coverage in the leaves of the coffee seedling. The identified gene coverage is the percentage of a gene that is covered by reads and defined as the ratio of the number of bases in a gene covered by uniquely mapped reads to the number of total bases in the gene. The pie chart shows the percentage of the different gene coverage listing on the left of the pie chart (A. Control treatment. B. DSW treatment)
|@|~(^,^)~|@|Gene Ontology (GO) analysis result
|@|~(^,^)~|@|The percentage of genes involved in different GO sub- categories
|@|~(^,^)~|@|Gene ontology (GO) enrichment analysis performed for significantly differentially expressed genes in salt-treated coffee seedlings
|@|~(^,^)~|@|Significantly (p < 0.05) differentially expressed genes in salt-treated coffee seedlings were grouped in different sub-categories (A, Biological process; B, Cellular component; C, Molecular function)