J Plant Biotechnol 2020; 47(1): 15-25
Published online March 31, 2020
https://doi.org/10.5010/JPB.2020.47.1.015
© The Korean Society of Plant Biotechnology
Correspondence to : e-mail: h.zarei1989@gmail.com
This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Allium L. is one of the largest genera of the Amaryllidaceae family, with more than 920 species including many economically important species used as vegetables, spices, medicines, or ornamental plants. Currently, DNA barcoding tools are being successfully used for the molecular taxonomy of Allium. A total of 46 Allium species were collected from their native areas, and DNA was extracted using the IBRC DNA extraction kit. We used specific primers to PCR amplify matK. DNA sequences were edited and aligned for homology, and a phylogenetic tree was constructed using the neighbor-joining method. The results show thymine (38.5%) was the most frequent and guanine (13.9%) the least frequent nucleotide. The matK regions of the populations were quite highly conserved, and the amount of C and CT was calculated at 0.162 and 0.26, respectively. Analysis of the nucleotide substitution showed C-T (26.22%) and A-G (8.08%) to have the highest and lowest percent, respectively. The natural selection process dN/dS was 1.16, and the naturality test results were -1.5 for Tajima’s D and -1.19 for Fu’s Fs. The NJ dendrogram generated three distinct clades: the first contained Allium austroiranicum and A. ampeloprasum; the second contained A. iranshahrii, A. bisotunense, and A. cf assadi; and the third contained A. rubellum and other species. In this study, we tested the utility of the matK region as a DNA barcode for discriminating Allium. species.
Keywords cpDNA, marker, Molecular phylogeny, matK, Allium, Taxa
The main focus of investigators has been on existing natural plants that are recently cultivated in investigations, and most of
To increase the number of qualitative traits for tightly related species, molecular markers have greater capability than morphologic characters, most of which present only quantitatively diverse attributes (Harpke et al. 2013). Linne von Berg pioneered in a publication for the organization of the
Currently, the DNA barcoding technique has been proven as an instructive and efficacious procedure for assessing plant phylogenies and presented successful applications in the molecular classification of
The
Totally, 46 species in 11 sections representative of five subgenera of
Table 1 .
Code | Species | Subgenus | Section | Accession number |
---|---|---|---|---|
1 | P1009858 | |||
2 | P1011312 | |||
3 | P1010457 | |||
4 | P1010913 | |||
5 | P1010908 | |||
6 | P1004474 | |||
7 | P1006775 | |||
8 | P1009666 | |||
9 | P1011153 | |||
10 | P1010598 | |||
11 | P1010598 | |||
12 | P1010909 | |||
13 | P1011015 | |||
14 | P1009998 | |||
15 | P1010022 | |||
16 | P1010382 | |||
17 | P1009475 | |||
18 | P1011306 | |||
19 | P1010910 | |||
20 | P1011119 | |||
21 | P1009733 | |||
22 | P1011271 | |||
23 | P1010341 | |||
24 | P1010423 | |||
25 | P1010355 | |||
26 | P1011289 | |||
27 | P1010524 | |||
28 | P1009955 | |||
29 | P1010986 | |||
30 | P1011021 | |||
31 | P1011232 | |||
32 | P1010545 | |||
33 | P1009877 | |||
34 | P1009972 | |||
35 | P1009837 | |||
36 | P1010653 | |||
37 | P1009668 | |||
38 | P1009668 | |||
39 | P1009964 | |||
40 | P1010429 | |||
41 | P1009965 | |||
42 | P1009676 | |||
43 | P1010544 | |||
44 | P1009439 | |||
45 | P1011079 | |||
46 | P1009928 |
The entire DNA was extracted from fresh leaf tissues of
The
Extraction of DNA sequences was performed from the chromatograms of the company by the use of Chromas v2, which were edited by the BioEdit program (Hall et al. 1999). The sequences were then put together with the CAP3 tool (Huang et al. 1999) and alignment of the homologous sequences was carried out using the EMBL-EBI CLUSTAL W tool (Edgar, 2004). The MEGA v6.0 software was used to analyze the multiple sequence alignment (MSA) file (Tamura et al. 2011). The genetic distance per loci among the accessions was estimated based on the number of base-pair replacement among the sequences. All the positions with missing data were eliminated by Kimura’s 2-parameter model. Moreover, the distance between matrices from the three loci by DNAsp was analyzed with Pearson’s correlation. The phylogenetic tree was developed by the Maximum Parsimony (MP) and Neighbor Joining (NJ) approach with a 1000 replicate bootstrap by the MEGA 6.0 software.
For estimating the resolution of DNA barcode, the percentage of produced monophyletic groups was determined using a bootstrap greater than 50% as a factor for defining the nodes, as recommended by Tripathi in DNAsp (Tripathi et al. 2013). The Tajima’s D and Fu’s Fs naturality tests were calculated by the DNAsp software. The amount of the dN/dS ratio was obtained numerically by the use of HIV databases. Haplotype network was scrutinized by the popART software (Leigh and Beryant, 2015).
The scientific progressive innovations in molecular science and sequencing approaches has empowered the recognition of organismal genomes. Besides, important data are provided by continuing variety of ongoing genome projects for multiple species concerning their classification, gene structure, and application scientifically. Here, nucleotide polymorphisms of the
According to our findings on the analyzed
Table 2 . Nucleotide abundances derived from the
G | C | T/U | A | Nucleotide |
---|---|---|---|---|
13.9 | 15.2 | 38.5 | 32.4 | Frequency |
Table 3 . Nucleotide substitution pattern estimation matrix of the
G | C | T | A | From/To |
---|---|---|---|---|
2.75 | 6.74 | - | A | |
2.49 | - | 5.92 | T | |
8.49 | - | 5.92 | C | |
- | 2.75 | 6.74 | G |
Each value represents the frequency of substitutions from one base (row) to another base (column). In this table, the percentage of transition mutations (purine-purine substitution, pyrimidine-pyrimidine substitution) and transversion mutations (purine-pyrimidine substitution and vice versa) are shown in bold and italics, respectively.
A total of 595 mutations was identified for genetic indicators of the
Table 4 . Gene polymorphism of
K | Eta | Pi | Hd | H | S | Population |
---|---|---|---|---|---|---|
80.112 | 595 | 0.160 | 0.9874 | 39 | 423 |
S: Number of polymorphic positions, H: number of haplotypes, Pi: nucleotide diversity, Eta: total number of mutations, K: number of nucleotide differences between populations or species (nucleotide divergence)
Table 5 . Conserved DNA regions of the
CT | MWL | C | Population |
---|---|---|---|
0.26 | 50 | 0.162 |
C: Sequence conservation, MWL: Minimum conservation Length, CT: Conservation threshold
An estimated value of 1.16 was obtained for the dN/dS proportion in
Table 6 . Identification of the
Numerical value | Parameter |
---|---|
2.3254 | dN |
1.9970 | dS |
1.1644 | dN/dS† |
†The numerical value of dn / ds represents the natural selection process
Table 7 . Results obtained from an evaluation of the natural evolution of the
Fu's Fs | Tajima's D | Gene |
---|---|---|
-1.19 | -1.50 |
It is of paramount importance to determine haplotype groups (by the popART software) for determination of the geographic regions of the examined breeds in comparison to other breeds. The entire 46 samples grouped in haplotype bunch B is the largest in various species around the world. Based on our observations, the haplogroup A is usually present in all continents, and the haplogroup B may similarly have originated from Asia (Fig. 3) (Ghanbari et al. 2018).
The NJ tree was drawn by the Kimura distance (determination of distances aimed at building NJ tree using the MEGA 6.0 with the program defaults). The nucleotide sequences of the
In the current investigation, the
In a previous study, the
Here, the efficacy of the
The length of
Phylogenetic associations among
These observations indicated a close relation between
Here,
As in the phylogenetic tree drawn inFriesen et al. (2006) in which subclades
A research confirmed the
J Plant Biotechnol 2020; 47(1): 15-25
Published online March 31, 2020 https://doi.org/10.5010/JPB.2020.47.1.015
Copyright © The Korean Society of Plant Biotechnology.
Hemadollah Zarei · Barat Ali Fakheri · Mohammad Reza Naghavi · Nafiseh Mahdinezhad
Department of Plant Breeding and Biotechnology, Faculty of Agriculture, University of Zabol, Zabol, Iran
Department of Agronomy and Plant Breeding, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
Plant Bank, Iranian Biological Resource Center (IBRC), ACECR, Tehran, Iran
Correspondence to:e-mail: h.zarei1989@gmail.com
This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Allium L. is one of the largest genera of the Amaryllidaceae family, with more than 920 species including many economically important species used as vegetables, spices, medicines, or ornamental plants. Currently, DNA barcoding tools are being successfully used for the molecular taxonomy of Allium. A total of 46 Allium species were collected from their native areas, and DNA was extracted using the IBRC DNA extraction kit. We used specific primers to PCR amplify matK. DNA sequences were edited and aligned for homology, and a phylogenetic tree was constructed using the neighbor-joining method. The results show thymine (38.5%) was the most frequent and guanine (13.9%) the least frequent nucleotide. The matK regions of the populations were quite highly conserved, and the amount of C and CT was calculated at 0.162 and 0.26, respectively. Analysis of the nucleotide substitution showed C-T (26.22%) and A-G (8.08%) to have the highest and lowest percent, respectively. The natural selection process dN/dS was 1.16, and the naturality test results were -1.5 for Tajima’s D and -1.19 for Fu’s Fs. The NJ dendrogram generated three distinct clades: the first contained Allium austroiranicum and A. ampeloprasum; the second contained A. iranshahrii, A. bisotunense, and A. cf assadi; and the third contained A. rubellum and other species. In this study, we tested the utility of the matK region as a DNA barcode for discriminating Allium. species.
Keywords: cpDNA, marker, Molecular phylogeny, matK, Allium, Taxa
The main focus of investigators has been on existing natural plants that are recently cultivated in investigations, and most of
To increase the number of qualitative traits for tightly related species, molecular markers have greater capability than morphologic characters, most of which present only quantitatively diverse attributes (Harpke et al. 2013). Linne von Berg pioneered in a publication for the organization of the
Currently, the DNA barcoding technique has been proven as an instructive and efficacious procedure for assessing plant phylogenies and presented successful applications in the molecular classification of
The
Totally, 46 species in 11 sections representative of five subgenera of
Table 1 .
Code | Species | Subgenus | Section | Accession number |
---|---|---|---|---|
1 | P1009858 | |||
2 | P1011312 | |||
3 | P1010457 | |||
4 | P1010913 | |||
5 | P1010908 | |||
6 | P1004474 | |||
7 | P1006775 | |||
8 | P1009666 | |||
9 | P1011153 | |||
10 | P1010598 | |||
11 | P1010598 | |||
12 | P1010909 | |||
13 | P1011015 | |||
14 | P1009998 | |||
15 | P1010022 | |||
16 | P1010382 | |||
17 | P1009475 | |||
18 | P1011306 | |||
19 | P1010910 | |||
20 | P1011119 | |||
21 | P1009733 | |||
22 | P1011271 | |||
23 | P1010341 | |||
24 | P1010423 | |||
25 | P1010355 | |||
26 | P1011289 | |||
27 | P1010524 | |||
28 | P1009955 | |||
29 | P1010986 | |||
30 | P1011021 | |||
31 | P1011232 | |||
32 | P1010545 | |||
33 | P1009877 | |||
34 | P1009972 | |||
35 | P1009837 | |||
36 | P1010653 | |||
37 | P1009668 | |||
38 | P1009668 | |||
39 | P1009964 | |||
40 | P1010429 | |||
41 | P1009965 | |||
42 | P1009676 | |||
43 | P1010544 | |||
44 | P1009439 | |||
45 | P1011079 | |||
46 | P1009928 |
The entire DNA was extracted from fresh leaf tissues of
The
Extraction of DNA sequences was performed from the chromatograms of the company by the use of Chromas v2, which were edited by the BioEdit program (Hall et al. 1999). The sequences were then put together with the CAP3 tool (Huang et al. 1999) and alignment of the homologous sequences was carried out using the EMBL-EBI CLUSTAL W tool (Edgar, 2004). The MEGA v6.0 software was used to analyze the multiple sequence alignment (MSA) file (Tamura et al. 2011). The genetic distance per loci among the accessions was estimated based on the number of base-pair replacement among the sequences. All the positions with missing data were eliminated by Kimura’s 2-parameter model. Moreover, the distance between matrices from the three loci by DNAsp was analyzed with Pearson’s correlation. The phylogenetic tree was developed by the Maximum Parsimony (MP) and Neighbor Joining (NJ) approach with a 1000 replicate bootstrap by the MEGA 6.0 software.
For estimating the resolution of DNA barcode, the percentage of produced monophyletic groups was determined using a bootstrap greater than 50% as a factor for defining the nodes, as recommended by Tripathi in DNAsp (Tripathi et al. 2013). The Tajima’s D and Fu’s Fs naturality tests were calculated by the DNAsp software. The amount of the dN/dS ratio was obtained numerically by the use of HIV databases. Haplotype network was scrutinized by the popART software (Leigh and Beryant, 2015).
The scientific progressive innovations in molecular science and sequencing approaches has empowered the recognition of organismal genomes. Besides, important data are provided by continuing variety of ongoing genome projects for multiple species concerning their classification, gene structure, and application scientifically. Here, nucleotide polymorphisms of the
According to our findings on the analyzed
Table 2 . Nucleotide abundances derived from the
G | C | T/U | A | Nucleotide |
---|---|---|---|---|
13.9 | 15.2 | 38.5 | 32.4 | Frequency |
Table 3 . Nucleotide substitution pattern estimation matrix of the
G | C | T | A | From/To |
---|---|---|---|---|
2.75 | 6.74 | - | A | |
2.49 | - | 5.92 | T | |
8.49 | - | 5.92 | C | |
- | 2.75 | 6.74 | G |
Each value represents the frequency of substitutions from one base (row) to another base (column). In this table, the percentage of transition mutations (purine-purine substitution, pyrimidine-pyrimidine substitution) and transversion mutations (purine-pyrimidine substitution and vice versa) are shown in bold and italics, respectively..
A total of 595 mutations was identified for genetic indicators of the
Table 4 . Gene polymorphism of
K | Eta | Pi | Hd | H | S | Population |
---|---|---|---|---|---|---|
80.112 | 595 | 0.160 | 0.9874 | 39 | 423 |
S: Number of polymorphic positions, H: number of haplotypes, Pi: nucleotide diversity, Eta: total number of mutations, K: number of nucleotide differences between populations or species (nucleotide divergence).
Table 5 . Conserved DNA regions of the
CT | MWL | C | Population |
---|---|---|---|
0.26 | 50 | 0.162 |
C: Sequence conservation, MWL: Minimum conservation Length, CT: Conservation threshold.
An estimated value of 1.16 was obtained for the dN/dS proportion in
Table 6 . Identification of the
Numerical value | Parameter |
---|---|
2.3254 | dN |
1.9970 | dS |
1.1644 | dN/dS† |
†The numerical value of dn / ds represents the natural selection process.
Table 7 . Results obtained from an evaluation of the natural evolution of the
Fu's Fs | Tajima's D | Gene |
---|---|---|
-1.19 | -1.50 |
It is of paramount importance to determine haplotype groups (by the popART software) for determination of the geographic regions of the examined breeds in comparison to other breeds. The entire 46 samples grouped in haplotype bunch B is the largest in various species around the world. Based on our observations, the haplogroup A is usually present in all continents, and the haplogroup B may similarly have originated from Asia (Fig. 3) (Ghanbari et al. 2018).
The NJ tree was drawn by the Kimura distance (determination of distances aimed at building NJ tree using the MEGA 6.0 with the program defaults). The nucleotide sequences of the
In the current investigation, the
In a previous study, the
Here, the efficacy of the
The length of
Phylogenetic associations among
These observations indicated a close relation between
Here,
As in the phylogenetic tree drawn inFriesen et al. (2006) in which subclades
A research confirmed the
Table 1 .
Code | Species | Subgenus | Section | Accession number |
---|---|---|---|---|
1 | P1009858 | |||
2 | P1011312 | |||
3 | P1010457 | |||
4 | P1010913 | |||
5 | P1010908 | |||
6 | P1004474 | |||
7 | P1006775 | |||
8 | P1009666 | |||
9 | P1011153 | |||
10 | P1010598 | |||
11 | P1010598 | |||
12 | P1010909 | |||
13 | P1011015 | |||
14 | P1009998 | |||
15 | P1010022 | |||
16 | P1010382 | |||
17 | P1009475 | |||
18 | P1011306 | |||
19 | P1010910 | |||
20 | P1011119 | |||
21 | P1009733 | |||
22 | P1011271 | |||
23 | P1010341 | |||
24 | P1010423 | |||
25 | P1010355 | |||
26 | P1011289 | |||
27 | P1010524 | |||
28 | P1009955 | |||
29 | P1010986 | |||
30 | P1011021 | |||
31 | P1011232 | |||
32 | P1010545 | |||
33 | P1009877 | |||
34 | P1009972 | |||
35 | P1009837 | |||
36 | P1010653 | |||
37 | P1009668 | |||
38 | P1009668 | |||
39 | P1009964 | |||
40 | P1010429 | |||
41 | P1009965 | |||
42 | P1009676 | |||
43 | P1010544 | |||
44 | P1009439 | |||
45 | P1011079 | |||
46 | P1009928 |
Table 2 . Nucleotide abundances derived from the
G | C | T/U | A | Nucleotide |
---|---|---|---|---|
13.9 | 15.2 | 38.5 | 32.4 | Frequency |
Table 3 . Nucleotide substitution pattern estimation matrix of the
G | C | T | A | From/To |
---|---|---|---|---|
2.75 | 6.74 | - | A | |
2.49 | - | 5.92 | T | |
8.49 | - | 5.92 | C | |
- | 2.75 | 6.74 | G |
Each value represents the frequency of substitutions from one base (row) to another base (column). In this table, the percentage of transition mutations (purine-purine substitution, pyrimidine-pyrimidine substitution) and transversion mutations (purine-pyrimidine substitution and vice versa) are shown in bold and italics, respectively..
Table 4 . Gene polymorphism of
K | Eta | Pi | Hd | H | S | Population |
---|---|---|---|---|---|---|
80.112 | 595 | 0.160 | 0.9874 | 39 | 423 |
S: Number of polymorphic positions, H: number of haplotypes, Pi: nucleotide diversity, Eta: total number of mutations, K: number of nucleotide differences between populations or species (nucleotide divergence).
Table 5 . Conserved DNA regions of the
CT | MWL | C | Population |
---|---|---|---|
0.26 | 50 | 0.162 |
C: Sequence conservation, MWL: Minimum conservation Length, CT: Conservation threshold.
Table 7 . Results obtained from an evaluation of the natural evolution of the
Fu's Fs | Tajima's D | Gene |
---|---|---|
-1.19 | -1.50 |
Ju-Ryeon Jo ・Tae-Ho Park
J Plant Biotechnol 2024; 51(1): 158-166Ju-Ryeon Jo ・Tae-Ho Park
J Plant Biotechnol 2024; 51(1): 143-151Seoyeon Son ・Tae-Ho Park
J Plant Biotechnol 2024; 51(1): 121-128
Journal of
Plant Biotechnology