b. Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China;
c. Haikou Duotan Wetlands Institute, Haikou, 570100, China;
d. Orchid Conservation & Research Center of Shenzhen, Shenzhen, 518114, China;
e. Research Centre of Ecological Sciences, College of Agronomy, Jiangxi Agricultural University, Nanchang, 330045, China
Wenchengia alternifolia C.Y. Wu & S. Chow (Lamiaceae) was described on the basis of two gatherings collected on Hainan Island, southern China, in the 1930s (Wu and Chow, 1965). It is characterized by having alternate leaves, racemose inflorescences, and a unique type of nutlet attachment described as vascular funicles with slender stalks (Wu and Chow, 1965; Li et al., 2012). Because these traits are rare or unique in the Lamiaceae, Wu and Chow (1965) established the monotypic genus, Wenchengia C.Y. Wu & S. Chow, and the novel subfamily, Wenchengioideae C.Y. Wu & S. Chow, to accommodate it. W. alternifolia was believed to be extinct (Harley et al., 2004).
The phylogenetic position of Wenchengia has been controversial due to the unusual morphological traits and the lack of available materials for further studies. In 2010, a small population was rediscovered at the type locality, Shuangximu Valley, Wanning County (Fig. 1), Hainan Province, China. Using materials from this population, Li et al. (2012) inferred the phylogenetic position of W. alternifolia using comprehensive evidence from molecular, morphological, anatomical and cytological data to confirm that Wenchengia is the first diverging lineage of the subfamily Scutellarioideae. The finding was further supported by phylogenomic analyses using plastome data (Zhao et al., 2020, 2021).
Li et al. (2014) assessed the conservation status of W. alternifolia based on the rediscovered population and proposed that it be listed as Critically Endangered (CR) under the IUCN Red List Criteria. Fragmentation of habitats and persistent disturbance caused by the rapid development of urbanization, mining, plantations and tourism in Wanning County were thought to be a threat to the survival of W. alternifolia in the wild (Li et al., 2014). In 2017, a second larger population was discovered by Chunlei Xiang in Ding'an County (personal communication; Fig. 2), Hainan Province. The individuals in this population differed morphologically from the population in Wanning County, but it was unclear whether the variance was due to genetic or environmental differences.
Wenchengia alternifolia has long been considered endemic to Hainan Island (Harley et al., 2004; Li et al., 2012). However, three old gatherings of W. alternifolia from Vietnam were found in the herbarium of the Muséum national d'Histoire naturelle in Paris (P) and in the herbarium at the Royal Botanic Gardens, Kew (K) (Paton et al., 2016). In the next year, three extant populations of W. alternifolia were discovered in Vietnam (Fig. 3) by Bo Li and his team. Thus, W. alternifolia probably had a wider distribution in the past. The species should be given high priority for conservation due to its distinct phylogenetic position and the limited number of known populations. Although previous studies have used plastomes to reconstruct phylogenetic relationships within the Scutellarioideae, including Wenchengia (Zhao et al., 2020), more information on the genomes and effective intraspecific molecular markers of W. alternifolia that could be used for population genetics and DNA barcoding were unavailable.
DNA barcoding and relative approaches can not only provide valuable tools for phylogenetic reconstruction and species identification (Calonje et al., 2009; Qin et al., 2020), but also offer insight into studies below the species level, including recognition of infraspecific variation, patterns of gene flow and habitat differentiation (Bockelmann et al., 2003; Kane et al., 2012; Ståhlberg, 2007). Up to now, many fragments of intergenic spacers, coding regions and introns of plastomes have been widely used for DNA barcoding, but whether these regions are useful in closely related species or lower taxonomic levels is unclear (Dong et al., 2012; Hollingsworth et al., 2011; Kress et al., 2005). Therefore, searching for additional plastid hotspots with higher evolutionary rates and divergence is important. Simple sequence repeats (SSRs), containing 1–6 bp repetitive sequences, are widely used in genetic studies. To detect higher levels of polymorphism, genomic SSR (gSSR) markers have recently gained more attention comparing to EST-SSRs because intergenic spacers or intron contain more variations than extron sequences (Bae et al., 2015; Blair et al., 2006; Liu et al., 2021). In addition, targeted sequence capture is an efficient and cost-effective approach for generating phylogenomic data sets after the boost of high-throughput sequencing technology. A set of probes was designed from 353 putative single-copy nuclear genes (SCNGs) over 600 angiosperms to hybridize with template sequences and to capture target genes for subsequent analyses. It would be effective for phylogenetic studies at various taxonomic levels from species to the entire angiosperm clade (Johnson et al., 2019). The polymorphism detected within those genes, in combination with plastid hotspots and gSSR markers, will provide useful molecular tools for further studies on population genetics and evolutionary history.
In this study, we aim to identify plastid hotspots, polymorphic gSSRs, and SCNGs within Wenchengia alternifolia based on genome skimming data. Our findings will provide powerful tools for future studies on its conservation biology and population genetics.
2. Materials and methods 2.1. Plant material, DNA extraction and sequencingFresh leaf materials of five individuals of Wenchengia alternifolia from the two populations in Hainan Province (Pan Li & Langxing Yuan LP197753, Waning, Hainan, China; Pan Li & Langxing Yuan LP197754, Ding'an, Hainan, China) and three populations in Vietnam (Bo Li LB0647, Hue, Vietnam; Bo Li LB0824, Da Nang, Vietnam; Bo Li LB0938, Quang Nam, Vietnam) were dried in silica gel. Voucher specimens were deposited in the herbarium of Zhejiang University (HZU) and the herbarium of Jiangxi Agricultural University (JXAU). Total DNA was extracted from 3 mg of leaf tissue using the DNA Plantzol Reagent (Shanghai, China) and following the manufacturer's protocol. The quality and quantity of the genomic DNA was checked on an Agilent BioAnalyzer 2100 (Agilent Technologies). After sample quality check, the genomic DNA was fragmented by ultrasound on a Covaris E220 (Covaris, Brighton, UK). Fragments 300~500 bp long were selected using Pippin Prep (Sage Science, Beverly, MA, USA). The selected DNA fragments were then repaired to obtain a blunt end and modified at the 3′ end to obtain a dATP sticky end. The dTTP tailed adaptor was ligated to both ends of the DNA fragments. The ligation product was amplified by PCR and circularized to produce a single stranded circular (ssCir) library. The ssCir library was then amplified through rolling circle amplification (RCA) to obtain a DNA nanoball (DNB). The DNB was then loaded to a flowcell, and sequenced on a DNBSEQ Platform (Drmanac et al., 2010).
2.2. Genome assembly and annotationThe quality check of the raw reads was carried out in FastQC, and reads with Phred score < 30 (0.001 probability error) were discarded. The resulting clean reads were used for de novo assembly of the plastomes through the GetOrganelle v1.6.2 pipeline (Jin et al., 2020). The complete plastomes were annotated according to reference genome downloaded from NCBI (GenBank accession number: MN128379; MN128378) in Geneious Prime (2020).0.5 (Kearse et al., 2012). The putative starts, stops, and intron positions were identified by comparison with homologous genes of the reference genome (Zhao et al., 2020). The graphical map of the annotated circular plastomes was drawn using the OrganellarGenomeDRAW program (OGDRAW, Lohse et al., 2013).
To improve the efficiency of assembling, the overly large sequencing dataset of Wenchengia alternifolia, 8 Gb of WN and DN were sampled using Seqkit v0.14.0 and assembled into contigs in SPAdes v3.13.0 with default settings (Bankevich et al., 2012). All clean reads of each Vietnam individuals (~8 Gb) were directly processed in the pipeline to generate contigs. Contigs quality were assessed by QUAST v5.0.2 (Gurevich et al., 2013).
2.3. Genomic resources development based on Wenchengia alternifolia plastomesFive plastomes were multiple aligned using Mafft Multiple Alignment plugin v1.4.0 (Katoh and Standley, 2013) in Geneious Prime. Variable regions more than 200 bp long were selected and generated in DnaSP 6 (Rozas et al., 2017) to calculate the total number of mutation (Eta) and average number of nucleotide differences (K), which were used to determine nucleotide diversity (Pi).
2.4. Genomic resources development based on Wenchengia alternifolia nuclear genomes 2.4.1. Polymorphic gSSRs identificationThe contigs generated by SPAdes were aligned to the plastome sequence assembled by GetOrganelle and the mitochondrial DNA scaffolds of Scutellaria amoena C.H. Wright (GenBank accession number: MT277281.1, MT277264.1, MT277230.1, MT277181.1) downloaded from NCBI. Using BLAST search (BLAST v2.11.0), we were able to remove the plastome and mitogenome sequences. Then we discarded sequences less than 300 bp long. Candidate polymorphic gSSRs within W. alternifolia were identified based on the remanent multiple assembled sequences using the CandiSSR v20170602 software set at default parameters (Xia et al., 2016). For each target SSR, primers were automatically designed based on the Primer3 package built-in installation to the pipeline (Untergasser et al., 2012).
2.4.2. Putative single-copy genes and single nucleotide polymorphism identificationTo capture targeted genes and analyze single nucleotide polymorphism (SNP) within them, we searched through the clean reads for each individual using HybPiper v1.3.1 (Johnson et al., 2016) with default settings. 353 putative single-copy nuclear genes from 42 angiosperms (Johnson et al., 2019) were used as probes in the pipeline. Sequences captured for the same gene in at least two individuals were multiple aligned using the Mafft plugin in Geneious Prime to identify SNPs. To improve the accuracy and reliability of the results, we discarded sequences with a gene recovery rate of less than 25% and abandoned the polymorphism detected in sequence regions with an alignment identity of less than 80%.
3. Results 3.1. Genome assembly and featuresFive complete plastome sequences were successfully constructed with no Ns or gaps in the GetOrganelle pipeline. Genome sizes varied from 150, 204 bp in Wenchengia alternifolia DN to 152, 961 bp in W. alternifolia DA. The plastome sequences have been submitted to GenBank (accession numbers shown in Table 1). All plastomes exhibited a quadripartite structure similar to other angiosperms (Li et al., 2017; Liu et al., 2017), comprising of two copies of IR regions separated by a LSC region and an SSC region (Fig. 4, Table 1). The GC content overall and in different regions was similar among five individuals. Plastomes of W. alternifolia WN, DA and QN contained 114 unique genes, including 80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes with 18 genes duplicated in the IR regions, while plastomes of W. alternifolia H and DN lost the protein coding gene ndhF and contained only 113 unique genes (Table 2). The difference in length of the five plastomes was mainly due to the loss of the ndhF gene in W. alternifolia DN and H.
Characteristics | W. alternifolia WN | W. alternifolia DA | W. alternifolia H | W. alternifolia DN | W. alternifolia QN |
Location | Wanning, Hainan, China | Ding'an, Hainan, China | Hue, Vietnam | Da Nang, Vietnam | Quang Nam, Vietnam |
GenBank numbers | MW802188 | MW802189 | MZ905467 | MZ905468 | MZ905466 |
Total cpDNA Size (bp) | 152, 843 | 152, 961 | 150, 204 | 150, 219 | 152, 398 |
LSC length (bp) | 84, 807 | 84, 921 | 84, 314 | 84, 329 | 84, 345 |
SSC length (bp) | 16, 768 | 16, 774 | 13, 694 | 13, 694 | 16, 841 |
IR length (bp) | 25, 634 | 25, 633 | 26, 098 | 26, 098 | 25, 606 |
Total GC content (%) | 38.3 | 38.2 | 38.4 | 38.4 | 38.3 |
LSC | 36.3 | 36.3 | 36.4 | 36.3 | 36.4 |
SSC | 32.8 | 32.7 | 32.9 | 32.9 | 32.9 |
IR | 43.4 | 43.3 | 43.2 | 43.1 | 43.4 |
Total number of genes | 114 | 114 | 113 | 113 | 114 |
Protein-coding genes | 80 | 80 | 79 | 79 | 80 |
rRNAs genes | 4 | 4 | 4 | 4 | 4 |
tRNAs genes | 30 | 30 | 30 | 30 | 30 |
Duplicated genes | 20 | 20 | 20 | 20 | 20 |
Groups of gene | Name of gene |
Ribosomal RNAs | rrn16 (×2), rrn23 (×2), rrn4.5 (×2), rrn5 (×2)
trnK-UUUa, trnQ-UUG, trnS-GCU, trnG-GCCa, trnR-UCU trnC-GCA, trnD-GUC, trnY-GUA, trnE-UUC, trnT-GGU trnS-UGA, trnG-UCC, trnfM-CAU, trnS-GGA, trnT-UGU |
Transfer RNAs | trnL-UAAa, trnF-GAA, trnV-UACa, trnM-CAU, trnW-CCA trnP-UGG, trnH-GUG (×2), trnI-CAU (×2), trnL-CAA (×2) trnV-GAC (×2), trnI-GAUa (×2), trnA-UGCa (×2), trnR-ACG (×2) trnN-GUU (×2), trnL-UAG |
Photosystem I | psaB, psaA, psaI, psaJ, psaC, ycf3b |
Photosystem II | psbA, psbK, psbI, psbM, psbD, psbC, psbZ, psbJ, psbL, psbF, psbE, psbB, psbT, psbN, psbH |
Cytochrome | petN, petA, petL, petG, petBa, petDa |
ATP synthase | atpA, atpFa, atpH, atpI, atpE, atpB, |
Rubisco | rbcL |
NADH dehydrogenease | ndhJ, ndhK, ndhC, ndhBa (×2), ndhFc, ndhD, ndhE,
ndhG, ndhI, ndhAa, ndhH |
ATP-dependent protease subunit P | clpPb |
Envelope membrane protein | cemA |
Large units of ribosome | rpl33, rpl20, rpl36, rpl14, rpl16a, rpl22, rpl2a (×2), rpl23 (×2), rpl32 |
Small units of ribosome | rps16a, rps2, rps14, rps4, rps18, rps12, rps11, rps8, rps3, rps19, rps7 (×2), rps15 |
RNA polymerase subunits | rpoC2, rpoC1a, rpoB, rpoA, |
Maturase | matK |
Subunit of acetyl-CoAcarboxylase | accD |
Cytochrome c biogenesis protein |
ccsA |
Component of TIC complex | ycf1 |
Unknown function | ycf4, ycf2, ycf15 (×2) |
a Indicates genes containing a single intron. b Indicates genes containing two introns. c Indicates gene loss in W. alternifolia H and DN plastomes; (×2) indicates genes duplicated in IR regions; pseudogene is represented by Ψ. |
A total of 129, 404 (W. alternifolia WN), 126, 341 (DA), 110, 315 (H) 117, 076 (DN), 128, 838(QN) contigs with a length of more than 500 bp were generated after whole genome de novo assembly using SPAdes. The assembly results for each individual are shown in Table 3.
Characteristics | W. alternifolia WN | W. alternifolia DA | W. alternifolia H | W. alternifolia DN | W. alternifolia QN |
Clean reads dataset size (Gb) | 8 | 8 | 8 | 8 | 8 |
Number of contigs ≥ 500 bp | 129, 404 | 126, 341 | 110, 315 | 117, 076 | 128, 838 |
Total length (Mb) | 370 | 370 | 333 | 337 | 344 |
N50 | 5470 | 5669 | 5262 | 4911 | 4205 |
The divergence hotspots in the plastomes can provide effective phylogenetic information and serve as DNA barcodes. We screened 127 loci (48 inter-genic spacers, 62 coding genes, and 17 intron regions) with a length greater than 200 bp in the plastomes of five individuals of W. alternifolia (Fig. 5). Nucleotide diversity (Pi) values for each locus ranged from 0.000213 (rrn23 gene) to 0.020896 (ndhG-ndhI region). Six of the variable loci with Pi > 0.01, including four inter-genic spacers (trnH-psbA, psbA-trnK, ndhG-ndhI and rps15-ycf1) and two coding genes (rpl22 and ndhE), showed relatively high nucleotide diversity values. Among the variable region above, Pi value of trnH-psbA and ndhG-ndhI was more than 0.015. All of them can be used as highly informative molecular markers for W. alternifolia.
3.2.2. Genomic SSR (gSSR) markersWithin Wenchengia alternifolia, 20, 953 candidate polymorphic gSSRs were identified. After selecting the loci identified in all five individuals and discarding loci with either no available primers or sequence similarity < 99%, we obtained 1621 polymorphic gSSRs with a standard deviation between 0.4 and 3.32 (Table S1; Table S2). Among the polymorphic gSSRs, di- (1197), tri- (362), tetra- (52), penta- (9) and hexanucleotides (1) accounted for 73.84%, 22.33%, 3.21%, 0.56% and 0.06%, respectively (Fig. 6). The polymorphism within W. alternifolia for each type of gSSRs is shown in the results of the standard deviation of the repeats (Table S3).
3.2.3. Putative single-copy genes and SNPsClean reads of five individuals were generated in the HybPiper pipeline to identify putative single-copy genes. Among all the genes recovered in two Hainan individuals (W. alternifolia WN & W. alternifolia DA), 220 and 229 sequences were at least 75% as long as the target gene. Only 9 and 17 sequences were less than 25% as long. In the individuals from Vietnam (W. alternifolia H, W. alternifolia DN and W. alternifolia QN), only 66, 51 and 39 sequences were more than 25% as long (Table S4). 237 genes were recovered in at least two of the five individuals, among which only 23 genes were recovered in all five individuals. Within those 237 genes, 1657 candidate SNPs were identified. Some of the single copy genes were specifically polymorphic, containing more than 35 SNPs (Table S5).
4. Discussion 4.1. Development of genomic resources in Wenchengia alternifoliaPlastid hotspots are plastome regions containing relatively more variations. The average Pi in all variable loci was 0.004. We selected the six most variable loci with a Pi > 0.01, including four inter-genic spacers (trnH-psbA, psbA-trnK, ndhG-ndhI and rps15-ycf1) and two coding genes (rpl22 and ndhE), which can be used as molecular markers in population genetic studies. All of these loci exist in the LSC and SSC regions. As shown in Fig. 5, the average nucleotide diversity in IR is lower than in LSC and SSC. TrnH-psbA is highly divergent in many plant groups. Along with rps15-ycf1, it was also identified as a hyper-variable regions among 11 species of Scutellaria (Zhao et al., 2020). NdhG-ndhI has the highest Pi of 0.021, but it is rarely divergent in other plants. In addition, deletions and insertions are evident in the plastome, including the loss of the ndhF gene.
The CandiSSR pipeline has been used in many studies to develop abundant intergeneric and intrageneric polymorphic gSSR markers that can be successfully amplified in target populations. We developed 1621 candidate polymorphic gSSRs markers in W. alternifolia, which not only have available primers but are also clear of their polymorphic status. Most of the polySSRs are di- and tri-nucleotides. The mean standard deviation of di-nucleotide SSRs is higher than in tri- and tetra-nucleotides, but they also vary greatly among different single SSRs.
The gene recovery rate in individuals of W. alternifolia from Hainan was higher than the average level and much higher than in individuals from Vietnam (Johnson et al., 2019; Li et al., 2014), perhaps because the sequencing depth was much higher, with 23 Gb clean reads of each individual in total. Although few genes were recovered in all five individuals, in at least two individuals, 237 genes were recovered that contained adequate number of candidate SNPs that would be useful for genotyping and for other population genetics studies.
4.2. Polymorphism in genomes and morphologyW. alternifolia is a rheophyte, a riparian plants exposed to frequent submergence and strong currents during sporadic flooding events after heavy rain (Li et al., 2014; van Steenis, 1987). Plastid hotspots in other plants seldom contain protein coding genes. 353 putative single-copy nuclear genes used as targets in this study are suspected of being conserved in most angiosperms. However, polymorphisms in the plastome coding genes and in the recovered nuclear genes of W. alternifolia were relatively high, which might be due to long-term isolation and/or to environmental selection pressures (Mitsui and Setoguchi, 2012). The sampled populations in Hainan and Vietnam show significant variation in morphology and habitat (Fig. 1, Fig. 2, Fig. 3), which calls for further studies on population genetics and adaptive evolution.
5. ConclusionsIn this study, five individuals from distinct populations of W. alternifolia were sampled for genome skimming. Using these data, we assembled complete plastomes and characterized the plastid hotspots, polymorphic gSSRs, low or single copy gene fragments and candidate single nucleotide polymorphisms (SNPs). The rich genomic information presented here will be available for further studies on the population genetics, local adaptation and conservation biology of the critically endangered Wenchengia alternifolia.
Author contributionsPL, BL and LY conceived the ideas; PL and LXY contributed to the sampling; QYZ, HXC, ZHL and TY performed the experiments and analyzed the data. The manuscript was written by QYZ, BL and PL, then revised by all the other authors.
Data availabilityThe raw reads that support the findings of this study have been deposited into CNGB Sequence Archive (CNSA) of China National GeneBank DataBase (CNGBdb) with accession number CNP0001750.
Declaration of competing interestThe authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
AcknowledgmentsThis research was supported by the National Natural Science Foundation of China (grant number 31970225 and 31900181), the Zhejiang Provincial Natural Science Foundation (grant number LY19C030007) and the Open Fund of Shanghai Key Laboratory of Plant Functional Genomics and Resources (PFGR202104). We thank Dr. Enhua Xia, Ms. Meizhen Wang and Dr. Xinjie Jin for helping with the analyses, Dr. David E. Boufford for revising the manuscript. We thank the Guangdong Provincial Academician Workstation of BGI Synthetic Genomics, BGI-Shenzhen, Guangdong, China and the China National GeneBank for producing the sequencing data. We are grateful to anonymous reviewers for the constructive comments.
Appendix A. Supplementary dataSupplementary data to this article can be found online at https://doi.org/10.1016/j.pld.2021.09.006.
Bae, K.M., Sim, S.C., Hong, J.H., et al., 2015. Development of genomic SSR markers and genetic diversity analysis in cultivated radish (Raphanus sativus L.). Hortic. Environ. Biotechnol., 56: 216-224. DOI:10.1007/s13580-015-0089-y |
Bankevich, A., Nurk, S., Antipov, D., et al., 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol., 19: 455-477. DOI:10.1089/cmb.2012.0021 |
Blair, M.W., Giraldo, M.C., Buendía, H.F., et al., 2006. Microsatellite marker diversity in common bean (Phaseolus vulgaris L. ). Theor. Appl. Genet., 113: 100-109. DOI:10.1007/s00122-006-0276-4 |
Bockelmann, A.C., Reusch, T.B.H., Bijlsma, R., et al., 2003. Habitat differentiation vs. isolation-by-distance: the genetic population structure of Elymus athericus in European salt marshes. Mol. Ecol., 12: 505-515. DOI:10.1046/j.1365-294X.2003.01706.x |
Calonje, M., Martín-Bravo, S., Dobeš, C., et al., 2009. Non-coding nuclear DNA markers in phylogenetic reconstruction. Plant Systemat. Evol., 282: 257-280. DOI:10.1007/s00606-008-0031-1 |
Dong, W., Liu, J., Yu, J., et al., 2012. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding. PLoS One, 7: e35071. DOI:10.1371/journal.pone.0035071 |
Drmanac, R., Sparks, A.B., Callow, M.J., et al., 2010. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science, 327: 78-81. DOI:10.1126/science.1181498 |
Gurevich, A., Saveliev, V., Vyahhi, N., et al., 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics, 29: 1072-1075. DOI:10.1093/bioinformatics/btt086 |
Harley, R., Atkins, S., Budantsev, A., et al., 2004. The families and genera of vascular plants. Labiatae, 6: 241-242. |
Hollingsworth, P.M., Graham, S.W., Little, D.P., 2011. Choosing and using a plant DNA barcode. PLoS One, 6: e19254. DOI:10.1371/journal.pone.0019254 |
Jin, J. -J., Yu, W. -B., Yang, J. -B., et al., 2020. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol., 21: 241. DOI:10.1186/s13059-020-02154-5 |
Johnson, M.G., Gardner, E.M., Liu, Y., et al., 2016. HybPiper: extracting coding sequence and introns for phylogenetics from high-throughput sequencing reads using target enrichment. Appl. Plant Sci., 4: 1600016. DOI:10.3732/apps.1600016 |
Johnson, M.G., Pokorny, L., Dodsworth, S., et al., 2019. A Universal probe set for targeted sequencing of 353 nuclear genes from any flowering plant designed using k-Medoids clustering. Syst. Biol., 68: 594-606. DOI:10.1093/sysbio/syy086 |
Kane, N., Sveinsson, S., Dempewolf, H., et al., 2012. Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA. Am. J. Bot., 99: 320-329. DOI:10.3732/ajb.1100570 |
Katoh, K., Standley, D.M., 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol., 30: 772-780. DOI:10.1093/molbev/mst010 |
Kearse, M., Moir, R., Wilson, A., et al., 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics, 28: 1647-1649. DOI:10.1093/bioinformatics/bts199 |
Li, P., Lu, R. -S., Xu, W. -Q., et al., 2017. Comparative genomics and phylogenomics of East Asian tulips (Amana, Liliaceae). Front. Plant Sci., 8: 10.3389/fpls. 2017.00451. DOI:10.3389/fpls.2017.00451 |
Li, B., Xu, W., Tu, T., et al., 2012. Phylogenetic position of Wenchengia (Lamiaceae): a taxonomically enigmatic and critically endangered genus. Taxon, 61: 392-401. DOI:10.1002/tax.612010 |
Li, B., Zhang, Z., Zhang, D., 2014. Conservation status of the unique population of Wenchengia alternifolia, an enigmatic plant endemic to Hainan Island, China. Oryx, 48: 354-357. DOI:10.1017/S0030605313001373 |
Kress, W.J., Wurdack, K.J., Zimmer, E.A., et al., 2005. Use of DNA barcodes to identify flowering plants. Proc. Natl. Acad. Sci. U.S.A., 102: 8369-8374. DOI:10.1073/pnas.0503123102 |
Liu, L. -X., Li, R., Worth, J.R.P., et al., 2017. The complete chloroplast genome of Chinese bayberry (Morella rubra, myricaceae): implications for understanding the evolution of Fagales. Front. Plant Sci., 8: 968. DOI:10.3389/fpls.2017.00968 |
Liu, L.X., Zhang, Y.H., Li, P., 2021. Development of genomic resources for the genus Celtis (Cannabaceae) based on genome skimming data. Plant Divers, 43: 43-53. DOI:10.1016/j.pld.2020.09.005 |
Lohse, M., Drechsel, O., Kahlau, S., et al., 2013. OrganellarGenomeDRAW—a suite of tools for generating physical maps of plastid and mitochondrial genomes and visualizing expression data sets. Nucleic Acids Res., 41: W575-W581. DOI:10.1093/nar/gkt289 |
Mitsui, Y., Setoguchi, H., 2012. Recent origin and adaptive diversification of Ainsliaea (Asteraceae) in the Ryukyu Islands: molecular phylogenetic inference using nuclear microsatellite markers. Plant Systemat. Evol., 298: 985-996. DOI:10.1007/s00606-012-0608-6 |
Paton, A., Phillipson, P., Suddee, S., 2016. Records of Wenchengia (Lamiaceae) from Vietnam. Biodivers. Data J., 4: e9596. DOI:10.3897/BDJ.4.e9596 |
Qin, M., Zhu, C., Yang, J., et al., 2020. Comparative analysis of complete plastid genome reveals powerful barcode regions for identifying wood of Dalbergia odorifera and D. tonkinensis (Leguminosae). J. Syst. Evol.: 12598. DOI:10.1111/jse.12598 |
Rozas, J., Ferrer-Mata, A., Sánchez-DelBarrio, J.C., et al., 2017. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol. Biol. Evol., 34: 3299-3302. DOI:10.1093/molbev/msx248 |
Ståhlberg, D., 2007. Habitat differentiation, hybridization and gene flow patterns in mixed populations of diploid and autotetraploid Dactylorhiza maculata s. l. (Orchidaceae). Evol. Ecol., 1: 295. DOI:10.1007/s10682-007-9228-y |
Untergasser, A., Cutcutache, I., Koressaar, T., et al., 2012. Primer3—new capabilities and interfaces. Nucleic Acids Res., 40: e115. DOI:10.1093/nar/gks596 |
van Steenis, C.G.G.J., 1987. Rheophytes of the world: supplement. Allertonia, 4: 267-330. |
Wu, C.Y., Chow, S., 1965. Duo taxa nova Labiatarum. J. Systemat. Evol., 10: 249-256. |
Xia, E.H., Yao, Q.Y., Zhang, H.B., et al., 2016. CandiSSR: an efficient pipeline used for identifying candidate polymorphic SSRs based on multiple assembled sequences. Front. Plant Sci., 6: 1171. DOI:10.3389/fpls.2015.01171 |
Zhao, F., Chen, Y.P., Salmaki, Y., et al., 2021. An updated tribal classification of Lamiaceae based on plastome phylogenomics. BMC Biology., 19: 2. DOI:10.1186/s12915-020-00931-z |
Zhao, F., Li, B., Drew, B.T., et al., 2020. Leveraging plastomes for comparative analysis and phylogenomic inference within Scutellarioideae (Lamiaceae). PLoS One, 15: e0232602. DOI:10.1371/journal.pone.0232602 |