Chromosome-level genome and population resequencing reveal genetic diversity, demographic history, and climate risk in the East Asian relict tree Perkinsiodendron macgregorii
Jiaxin Lia,b,c, Lihua Yanga,c, Danqi Lid, Chen Fengd,**, Ming Kanga,b,c,*     
a. State Key Laboratory of Plant Diversity and Specialty Crops, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China;
b. University of Chinese Academy of Sciences, Beijing 100049, China;
c. Key Laboratory of National Forestry and Grassland Administration on Plant Conservation and Utilization in Southern China, Guangzhou 510650, China;
d. Jiangxi Provincial Key Laboratory of ex situ Plant Conservation and Utilization, Lushan Botanical Garden, Chinese Academy of Sciences, Jiujiang 332900, China
Abstract: Relict trees are key to understanding macroevolution, biogeography, and extinction risk under environmental change, yet genome-wide data on diversity and demography remain scarce. Perkinsiodendron macgregorii, a monotypic East Asian relict within Styracaceae, is of notable horticultural and conservation significance. Here, we assembled a chromosome-level genome (~1.15 Gb) and resequenced 167 individuals from 30 populations across its range. We performed a comparative analysis of single nucleotide polymorphisms (SNPs) and structural variants (SVs) to assess population structure, mutation load, climate-associated variation, and genomic vulnerability. Both SNP and SV datasets consistently resolved two evolutionarily cohesive lineages broadly separated by the Wuyi Mountains. Demographic reconstructions dated East–West divergence to ~0.16 Ma and revealed a long-term decline in effective population size (Ne) since the Late Pleistocene, with lineage-specific fluctuations through the Quaternary. Genome-wide diversity was moderate overall, but the West lineage showed elevated inbreeding and realized genetic load. Relative to SNPs, SVs showed higher proportions of HIGH-impact mutations (i.e., variants more likely to disrupt gene function), consistent with the larger genomic span of SVs and their greater potential deleterious effects. Functional enrichment of core adaptive variants (755 SNPs, 63 SVs) revealed divergent adaptive signals across marker classes. Genomic offset and RONA projections were concordant and highlighted western Jiangxi and southwestern Hunan as future vulnerability hotspots, with risk increasing under higher-emissions scenarios. Together, these results support recognizing East and West lineages as primary conservation units and prioritizing at-risk West populations (JXJGS, HNSHS) for management, while safeguarding genetically distinctive populations (e.g., JXGS, JXYJF) as secondary units to preserve evolutionary potential.
Keywords: Demographic history    Genetic load    Genomic vulnerability    Perkinsiodendron macgregorii    Relict tree    Structural variants    
1. Introduction

Relict species—lineages that have persisted as range-restricted remnants of formerly widespread clades—offer rare, valuable windows into macroevolution, historical biogeography, and extinction risk under contemporary environmental change (Grandcolas et al., 2014). They are striking because many retain conservative morphologies reminiscent of their fossil relatives, yet continue to evolve genetically and ecologically (Grandcolas et al., 2014; Shear and Werth, 2014). In contrast to their broad Tertiary distributions across the Northern Hemisphere, most extant Tertiary relicts are now confined to refugial regions in East Asia, North America, and southwestern Eurasia, typically occurring as small, isolated populations shaped by complex geology and Quaternary climate oscillations (Milne and Abbott, 2002; Milne, 2006; Qiu et al., 2011; Tang et al., 2018). Among these regions, East Asia harbors an exceptional concentration of relict trees and “living fossil” genera—such as Metasequoia glyptostroboides, Ginkgo biloba, Davidia involucrata, and Tetracentron sinense—that occur in forests notable for high biodiversity and endemism (Manchester et al., 2009; Lu et al., 2018; Zhu et al., 2020). Understanding how such relicts diversified, persisted, and currently respond to rapid climate change remains a central challenge in evolutionary biology and conservation genomics.

Over the past two decades, phylogeographic studies in China and adjacent regions have collectively shown that repeated glacial–interglacial cycles reshaped ranges and connectivity, yielding genetically structured lineages associated with long-term refugia and dispersal corridors shaped by mountainous topography and monsoon dynamics (Wang et al., 2009; Qiu et al., 2011; Sakaguchi et al., 2012; Sun et al., 2014; Zhang et al., 2016). Complementary ecological niche modeling has further highlighted “long-term stable refugia” that likely buffered relict plants during Pleistocene climate variability, leaving a legacy of divergent intraspecific lineages that persist in parapatry or secondary contact (Tang et al., 2018). However, much of this work has relied on limited molecular markers, which often lack power to resolve genome-wide patterns of diversity, selection and adaptation, and provide limited insight into genetic basis of environmental adaptation and genetic load (Ellegren, 2014; Song et al., 2023; Wang et al., 2023).

The genomics era has transformed what can be inferred about relict trees. Chromosome-level reference genomes and dense resequencing now enable precise estimation of genome-wide diversity, population structure, runs of homozygosity (ROH) and inbreeding, mutational load, and demographic history (Ellegren, 2014; Song et al., 2023; Lin et al., 2025). Whole-genome resequencing of multiple East Asian relict trees has revealed cryptic intraspecific divergences, recurrent hybridization at lineage contact zones, and heterogeneous lineage-specific demography—features with direct implications for delimiting conservation units and assessing vulnerability. In the “living fossil” dove tree Davidia involucrata, population-genomic analysis has uncovered three deeply divergent local lineages (emerging ca. 3.1–0.3 million years ago; Ma) and frequent hybridization among them, highlighting the interplay of geography, climate oscillations, and secondary contact in structuring diversity (Ren et al., 2024). Failure to recognize such cryptic lineages biases predictions of environmental responses and misallocates conservation resources (Ren et al., 2024). Comparable conclusions have been reported for Tetracentron sinense, which shows multiple divergent lineages and hybridization and lineage-specific demography. Genotype–environment association and genomic offset analyses have further delineated the populations most vulnerable under future climates (Jing et al., 2025). Broadly, Pleistocene climate fluctuations have strongly shaped divergence and demography across East Asian relicts, with repeated expansions and contractions aligned with glacial–interglacial cycles and refugial dynamics (Jing et al., 2025). Nevertheless, the evolutionary consequences of Quaternary change are far from uniform across species, and for many relicts, genome-scale assessments of demographic history, genetic diversity, inbreeding, and genomic vulnerability are still lacking—information essential for both robust evolutionary inference and conservation planning.

While most plant population-genomic inferences have historically relied on single-nucleotide polymorphisms (SNPs), mounting evidence indicates that structural variants (SVs)—including insertions, deletions, copy-number variants (CNVs), inversions, and translocations—constitute a pervasive, functionally consequential layer of standing variation that can strongly influence adaptation, demography, and genetic load (Feulner et al., 2013; Collins et al., 2020; Sang et al., 2022; Stuart et al., 2024; Zhang et al., 2024; Lin et al., 2025; Long et al., 2025). SVs can rewire gene dosage and regulation, alter recombination landscapes, and generate large-effect alleles underlying ecological differentiation and reproductive barriers (Wellenreuther et al., 2019). Empirical evidence increasingly links SVs to local adaptation and multiple components of vulnerability. For example, CNVs can surpass SNPs in detecting environment–genotype associations in marine systems and outperform them in capturing temperature-associated genomic variation (Dorant et al., 2020). In plants, SVs have been shown to both constrain and facilitate adaptation in natural populations, consistent with their dual capacity to generate beneficial novelty while also perturbing genome integrity (Hämälä et al., 2021). Pan-genome studies now underscore the role of SVs as major drivers of gene-content diversity and environmental adaptation across plant lineages, strengthening the case for routine SV interrogation in conservation genomics (Liang et al., 2025). Critically, SVs contribute to realized genetic load and fitness variation: in the inbred Scandinavian wolf population, structural genomic variation measurably contributed to realized genetic load, underscoring the need to quantify SV burden alongside deleterious SNPs when assessing population viability (Smeds et al., 2024). Because SNPs and SVs capture complementary facets of genomic variation, integrating both data types is critical for a comprehensive view of diversity, demography, adaptation, and vulnerability in relict trees.

Perkinsiodendron macgregorii (Styracaceae) is a monotypic East Asian relict that exemplifies these needs. The species was originally described within Halesia, a classic East Asia–North America disjunct genus comprising two species in the eastern United States (H. carolina, H. diptera) and a single species in China (H. macgregorii) (Fritsch et al., 2016). However, combined morphological evidence and multilocus phylogenies show that the East Asian taxon formerly known as H. macgregorii is sister to Rehderodendron, whereas the two North American Halesia species cluster with Pterostyrax hispidus. Accordingly, Fritsch et al. (2016) erected the monotypic genus Perkinsiodendron to reflect these evolutionary relationships, and subsequent family-level treatments and plastid phylogenies in Styracaceae clarified generic boundaries and corroborated this revision (Fritsch, 2001; Yan et al., 2018). Perkinsiodendron macgregorii occurs in montane forests and forest margins at elevations of ~700–1200 m in southeastern China (Chen and Chen, 1996; Fritsch et al., 2016) and is valued horticulturally for its straight trunk, abundant and fragrant white blossoms, uniquely shaped fruits, and striking red autumn foliage (Fig. 1a). Despite its apparently broad regional distribution, populations are typically small and narrowly localized, with limited regeneration linked to defective embryo development, low germination rates, and recalcitrant seeds (Liao et al., 2024). Although prior work has addressed seed biology and germination (Liao et al., 2024), phylogenetic placement within Styracaceae (Fritsch, 2001; Fritsch et al., 2016; Yan et al., 2018), and ecological niche modeling (Yan et al., 2022), genomic resources for P. macgregorii remain limited. No genome-wide studies have examined the genetic diversity, demographic history, or genomic vulnerability of P. macgregorii—information essential for evaluating genetic health and predicting adaptive potential under future climates.

Fig. 1 Morphology, chromosome-level assembly, and comparative genomics of Perkinsiodendron macgregorii. (a) Floral and seed morphology of P. macgregorii. (b) Circos view of the chromosomes showing (outer to inner tracks) chromosome length, LTR-Copia coverage, LTR-Gypsy coverage, gene density, GC content, and intragenomic synteny. Darker shading indicates higher values; lighter shading indicates lower values. (c) Maximum-likelihood species tree from low-copy orthologs across 15 angiosperms with divergence times (Ma); numbers denote gene-family expansions (red) and contractions (blue). (d) Macrosynteny between P. macgregorii and Sinojackia xylocarpa. (e) Distributions of synonymous substitution rates (Ks) for syntenic paralogs and orthologs.

Here, we combine a chromosome-level reference genome with population resequencing to address these questions across the species' range. We generate a high-quality genome assembly for Perkinsiodendron macgregorii and analyze whole-genome resequencing data from 167 individuals representing 30 populations spanning its distribution. Specifically, we aim to: (ⅰ) produce and annotate a reference genome suitable for joint SNP and SV discovery; (ⅱ) resolve lineage divergence and demographic history to contextualize Perkinsiodendron within the context of East Asian relict diversification and potential secondary contact among lineages; (ⅲ) quantify genome-wide diversity, inbreeding, and mutational load at both SNPs and SVs to assess the current genetic health at lineage and population scales; and (ⅳ) evaluate SNP- and SV-based signals of local adaptation and project genomic vulnerability under future climates. Our results provide an integrative population-genomic baseline for P. macgregorii and demonstrate that integrating SNPs and SVs sharpens evolutionary inference and improves conservation assessments in East Asia's relict flora.

2. Materials and methods 2.1. Genome assembly and quality assessment

Fresh young leaves were collected from a wild Perkinsiodendron macgregorii individual in Shaoguan City, Guangdong Province, China (113°2′3″E, 24°52′54″N). For PacBio long-read sequencing, HiFi libraries were sequenced on the PacBio Sequel Ⅱ platform (Pacific Biosciences, Menlo Park, CA, USA). Short-read libraries (350-bp insert) were constructed and sequenced on the Illumina NovaSeq 6000 platform (Illumina, San Diego, CA, USA), generating 150 bp paired-end reads. To achieve chromosome-scale assembly, a Hi-C library was prepared using the DpnII restriction enzyme and sequenced on the Illumina NovaSeq 6000 platform. To facilitate gene annotation, total RNA was extracted from four tissues (flower, branch, leaf, and bud), and subsequent transcriptome libraries were sequenced on the Illumina NovaSeq 6000 platform.

Genome characteristics (size, heterozygosity, and repeat content) were surveyed using GCE v.1.0.2 (Liu et al., 2013). HiFi reads were assembled using hifiasm v.0.20.0 (Cheng et al., 2021), and redundant sequences were removed with purge_dups v.1.2.5 (Guan et al., 2020). Chromosome-scale scaffolding used the HapHiC v.1.0.6 (Zeng et al., 2024) pipeline, and scaffolds were then adjusted for scaffolding errors in Juicebox v.2.15 (Durand et al., 2016). To evaluate the quality of the genome assembly, we estimated the Benchmarking Universal Single-Copy Orthologs (BUSCO) score using BUSCO v.5.8.0 (Manni et al., 2021) with the embryophyta_odb10 database, and calculated the Illumina short-read mapping rate and coverage using BWA v.0.7.18 (Li and Durbin, 2009) and SAMtools v.1.20 (Li et al., 2009). Additionally, to assess the quality of repetitive genome regions, LTR_FINDER v.1.0.7 (Xu and Wang, 2007) and GenomeTools v.1.6.5 (Gremme et al., 2013) were used to predict LTR-RTs, and LTR_retriever v.3.0.1 (Ou and Jiang, 2018) was then used to calculate LTR Assembly Index (LAI).

2.2. Genome annotation

Prior to gene prediction, repetitive elements were identified using EDTA v.2.2.2 (Ou et al., 2019). Protein-coding genes were predicted utilizing BRAKER3 (Gabriel et al., 2024) by integrating homolog-based, ab initio, and RNA-seq-assisted evidence. Input data included protein sequences from five plant species (Actinidia chinensis, Arabidopsis thaliana, Nyssa sinensis, Rhododendron simsii, Sinojackia xylocarpa) and RNA-seq reads generated herein. Predicted gene models encoding proteins <100 amino acids were removed. The longest transcript per gene was retained, and annotation completeness was assessed using BUSCO with the embryophyta_odb10 database. Gene functions were inferred using eggNOG-mapper, Pannzer2, and SwissProt/TrEMBL.

2.3. Comparative genomic analyses

To infer whole-genome duplication (WGD) events, we employed WGDI (Sun et al., 2022) to calculate the distribution of synonymous substitutions per synonymous site (Ks). Genome synteny between Perkinsiodendron macgregorii and Sinojackia xylocarpa was identified using MCScanX (Wang et al., 2012) and visualized using JCVI (Tang et al., 2024). OrthoFinder v.3.1.0 (Emms and Kelly, 2019) was applied to identify single-copy and low-copy orthologous (≤ 2 copies) orthogroups across P. macgregorii and 14 other angiosperms. Multiple sequence alignments of low-copy orthologous proteins were generated with MAFFT v.7.525 (Katoh and Standley, 2013), and poorly aligned regions were filtered out using trimAL v.1.5.0 (Capella-Gutiérrez et al., 2009). IQ-Tree v.3.0.1 (Wong et al., 2025) was used to construct a maximum-likelihood (ML) phylogenetic tree. Divergence times were estimated using MCMCTree in PAML v.4.10.9 (Yang, 2007). Three calibration points were chosen from the TimeTree website (http://www.timetree.org/), including Oryza sativaTetracentron sinense (142.1–163.5 million years ago; Ma), V. viniferaDipteronia sinensis (109.8–124.4 Ma), and S. xylocarpaRhododendron simsii (82.8–106.0 Ma). Gene family expansion and contraction were analyzed using CAFE v.5.1.0 (Mendes et al., 2021).

2.4. Genome resequencing and variant calling

Leaf samples from 167 individuals were collected across 30 populations of Perkinsiodendron macgregorii. Genomic DNA was extracted from silica-dried leaves using the CTAB method (Doyle and Doyle, 1987) and sequenced on the DNBSEQ-T7 platform to generate 150-bp paired-end reads. Raw reads were filtered using fastp v.0.23.4 (Chen et al., 2018) to remove adaptors and low-quality bases, and then the clean reads were mapped to the reference genome with BWA v.0.7.18 and sorted using SAMtools v.1.20. The MarkDuplicate and addOrReplaceReadGroup tools from Picard v.2.18.29 (https://github.com/broadinstitute/picard) were used to mark PCR duplicates and assign read group information, respectively. The resulting BAM files were used for downstream variant calling. SNP calling and joint genotype calling were performed using HaplotypeCaller and CombineGVCFs in the Genome Analysis Toolkit (GATK) v.4.5 (McKenna et al., 2010), respectively. Hard filters were applied, and variants were further refined with VCFtools v.0.1.16 (Danecek et al., 2011) by removing sites with > 20% missing data and mean depth < 5 or > 100, yielding 767,554,148 variant sites (Dataset 2). Subsequently, SNPs with a minor allele frequency (MAF) > 0.05 were retained using VCFtools, resulting in 22,282,414 sites (Dataset 3). Finally, linkage disequilibrium (LD) pruning was performed using PLINK (Chang et al., 2015) (r2 > 0.2), generating 2,563,601 SNPs (Dataset 4).

Structural variants (SVs) were identified using SMOOVE v.0.2.8 (https://github.com/brentp/smoove), DELLY v.1.3.2 (Rausch et al., 2012), and MANTA v.1.6.0 (Chen et al., 2016). For DELLY and SMOOVE, SVs were first called independently per individual and then merged into one a cross-sample, nonredundant SV set, which was subsequently used for joint genotyping across all samples. BCF files generated by DELLY were converted to VCF format using bcftools. Due to computational constraints, MANTA was run per individual using the default workflow (configManta.py; runWorkflow.py), and inversions were additionally extracted from breakend calls using convertInversion.py. Following Jeffares et al. (2017), calls from the three methods were then merged with SURVIVOR v.1.0.7 (parameters: 1000 2 1 1 0 50), harmonizing 0/0, 0/1, and 1/1 classes; retained variants were then merged into a joint sample file using SURVIVOR with the same parameters. To ensure high-quality SVs, we filtered sites with quality scores ≤ 30 and a missing rate >20% using VCFtools. After filtering, a total of 10,357 SVs were retained. Then, VCFtools was used to filter SVs with MAF < 0.02, generating 8549 SVs. LD pruning was performed using PLINK v.1.9 (r2 > 0.2), resulting in 5575 SVs for downstream analyses.

2.5. Population structure and genetic diversity

After LD pruning, we excluded the core adaptive variants (i.e., SNPs and SVs identified by genotype–environment association analyses) from the pruned SNP and SV VCFs using VCFtools. The resulting “neutral” SNP and SV datasets were then analyzed separately to infer population structure within the species. We investigated population structure using ADMIXTURE v.1.3.0 (Alexander and Lange, 2011) with the number of genetic clusters (K) ranging from 1 to 10. Principal component analysis (PCA) was performed on all individuals using GCTA v.1.26.0 (Yang et al., 2011). Subsequently, VCF2Dis (Xu et al., 2025) was used to calculate pairwise genetic distances based on Dataset 3, and a neighbor-joining (NJ) tree was then constructed using FastME (Lefort et al., 2015). Next, the identity-by-state (IBS) kinship matrix between individuals was analyzed using VCF2PCACluster v.1.41 (He et al., 2024) based on Dataset 2. Additionally, the D-statistic was estimated using Dsuite v.0.5 r58 (Malinsky et al., 2021) to perform the ABBA–BABA test among populations.

To evaluate genetic diversity, we calculated unbiased nucleotide diversity (π), absolute diversity (DXY), and fixation statistics (FST) with PIXY v.1.2.11 (Korunes and Samuk, 2021) using Dataset 2 in a sliding window of 200 kb. Whole-genome heterozygosity (He) was estimated using realSFS implemented in ANGSD v.1.9 (Korneliussen et al., 2014) for each individual. Linkage disequilibrium (LD) decay was analyzed using PopLDdecay v.3.43 (Zhang et al., 2019).

2.6. Demographic history

We used three complementary approaches to reconstruct the demographic history of Perkinsiodendron macgregorii. First, the pairwise sequentially Markovian coalescent (PSMC) model (Li and Durbin, 2011) was applied to infer effective population size (Ne) trajectories from genome-wide heterozygosity, using the highest-coverage individuals from representative populations of each lineage. Due to no direct data are available on age at first reproduction or generation time for P. macgregorii, generation time was set to 10 years and the mutation rate to 2.5 × 10−8 per site per generation, following estimates of Sinojackia xylocarpa (Zhu et al., 2024). To capture more recent dynamics, we additionally inferred Ne fluctuations using Stairway Plot v.2.1.2 (Liu and Fu, 2020) and SMC++ (Terhorst et al., 2017), applying the same generation time and mutation rate.

2.7. Inbreeding and mutation load

Runs of homozygosity (ROHs) were identified using PLINK. ROHs were defined as segments > 10 kb and classified as short (10 < ROH ≤ 100 kb), medium (100 < ROH ≤ 200 kb), and long (> 200 kb). For each individual, the inbreeding coefficient based on ROHs (FROH; Kardos et al., 2015) was calculated as the proportion of the genome covered by ROHs > 10 kb.

To estimate genetic load, SnpEff v.5.2e (Cingolani et al., 2012) was used to annotate SNPs and SVs for each individual. An individual of Sinojackia xylocarpa was used as the outgroup to polarize alleles (ancestral vs. derived). For SNPs, variants annotated as synonymous were defined as synonymous (SYN) mutations. Nonsynonymous mutations were categorized by Grantham score (GS; Grantham, 1974) as tolerated (TOL; 5 < GS < 150) or deleterious (DEL; GS ≥ 150). Putative loss-of-function (LOF) mutations were identified with SnpSift using the criterion “LOF[*].PERC = 1.00”. For SVs, SnpEff impact categories were used to classify variants by predicted effect, and we summarized per-individual proportions across HIGH, MODERATE, and LOW impact classes. For all SNP classes (TOL, DEL, LOF), we tabulated per-individual proportions of derived homozygotes and heterozygotes. We refer to the proportion of derived homozygotes as “realized load” and to the proportion of derived heterozygotes as “masked load”.

2.8. Genomic vulnerability analysis

We identified climate-associated variants using two complementary methods applied to SV and LD-filtered SNP datasets, with SVs phased and imputed using Beagle v.5.4 (Browning and Browning, 2009). Nineteen bioclimatic variables were downloaded from WorldClim 2.1 at 30-arc-second (~1 km) resolution (Fick and Hijmans, 2017). First, latent factor mixed models (LFMMs) were run in the R package LEA (Frichot and François, 2015), with the number of latent factors set to the K inferred with ADMIXTURE; associations were retained at a Benjamini–Hochberg FDR ≤ 0.05. Second, redundancy analysis (RDA) was used to test multilocus genotype–environment associations with the R package vegan (Dixon, 2003). To reduce collinearity, predictors were ranked with gradientForest (Ellis et al., 2012), and five variables with pairwise correlation |r| < 0.6 were retained (Table S1). RDA outliers were defined as variants with loading beyond the mean ± 3 SD. Variants detected by both LFMM and RDA were retained as “core adaptive variants” for genomic vulnerability analysis. Core adaptive variants were annotated using SnpEff, and gene ontology (GO) enrichment was performed with the R package clusterProfile (Wu et al., 2021).

To partition the roles of geography and environment in shaping genetic variation of core adaptive and neutral variants, we performed Mantel and partial Mantel tests to assess isolation by distance (IBD) and isolation by environment (IBE), with significance determined from 999 permutations in the R package vegan. For partial Mantel tests, we tested IBE by correlating genetic distance with environmental distance while controlling for geographic distance, and tested IBD by correlating genetic distance with geographic distance while controlling for environmental distance.

The potential distribution range of Perkinsiodendron macgregorii was inferred using ecological niche modeling (ENM) with MaxEnt v.3.4.4 (Phillips and Dudík, 2008). For genomic offset under future climate change, we considered two Shared Socioeconomic Pathway (SSP) scenarios (SSP126, SSP585) for two periods (2021–2040, 2081–2100) under the BCC-CSM2_MR model. We used the R package gradientForest to estimate genetic offset under the different future scenarios with five environmental variables, and then plotted them in ArcGIS v.10.8 (ESRI, Redlands). Genetic offset was calculated as the Euclidean distance in gradientForest-transformed genomic composition between present and future climates. Risk of non-adaptation (RONA) was computed as a weighted average across core adaptive variants and the five climate variables, following the methods outlined by Sang et al. (2022). Local, forward, and reverse offsets were quantified with gradientForest following Gougherty et al. (2021): local offset is the minimal displacement cost within the current range; forward offset is the minimal future location matching each population's current genomic–environmental composition; reverse offset is the set of present-day locations matching projected future composition.

3. Results 3.1. Genome assembly and comparative genomics

We generated a chromosome-level assembly of the Perkinsiodendron macgregorii genome using 73.54 Gb of Illumina short reads (65 ×), 48.5 Gb of PacBio HiFi reads (43 ×), and 207.14 Gb of Hi-C data (182 ×) (Table S2). A 17-mer analysis of the Illumina reads estimated a genome size of ~1.20 Gb (Table S3 and Fig. S1). The final assembly spans ~1.15 Gb (contig N50 = 7 Mb; scaffold N50 = 86 Mb), with 92.33% of the assembly anchored to 12 pseudochromosomes (Table S4; Figs. 1b and S1). Quality metrics indicate a reference-grade genome assembly (Illumina read-mapping rate = 91.5%; LAI = 17.9; BUSCO completeness = 98.1%; Table S5). Repeats account for 60.65% of the genome, including Gypsy (18.37%) and Copia (3.69%) elements (Table S6). Gene prediction identified 31,851 protein-coding genes, of which 30,767 (96.60%) were functionally annotated; BUSCO recovered 1572 complete genes (97.4%) in the annotation set (Tables S7 and S8), indicating a high level of annotation completeness.

Comparative analyses support conserved collinearity within Styracaceae. Synteny with Sinojackia xylocarpa revealed extensive chromosome-scale collinearity (Figs. 1d and S2). Ks distributions showed a peak at ~1.35 that is shared with Vitis vinifera, consistent with the core eudicot whole-genome triplication (WGT), and a second peak at ~0.47 shared by Perkinsiodendron macgregorii and S. xylocarpa, indicating that there is no lineage-specific WGD in P. macgregorii (Fig. 1e). Orthologous syntenic blocks between P. macgregorii and S. xylocarpa showed an orthologous Ks peak at ~0.04, suggesting relatively recent divergence between the two species (Fig. 1e). Phylogenetic analyses based on 655 low-copy orthologs from 15 angiosperms recovered P. macgregorii and S. xylocarpa as sister lineages that diverged ~10.0 Ma (95% HPD: 17.0–4.1 Ma) (Figs. 1c and S3).

Gene family analysis identified 1305 expansions and 526 contractions in Perkinsiodendron macgregorii (Tables S9 and S10). Expanded families were enriched for epigenetic regulation of gene expression (GO:0040029), post-embryonic plant morphogenesis (GO:0090698), nuclear speck (GO:0016607), cellular hormone metabolic process (GO:0034754). These functional categories plausibly contribute to survival in heterogeneous and stressful environments by promoting regulatory plasticity and developmental flexibility under fluctuating conditions (Liu and Zhong, 2024; Srikant and Drost, 2026). Consistent with this interpretation, several members within expanded families have documented roles in flowering and seasonal regulation (VIL1; Sung et al., 2006), RNA-mediated regulation and immunity (PRP16; Qiao et al., 2015), DNA demethylation and genotoxic-stress responses (ROS1; Qüesta et al., 2013), nutrient recycling via autophagy under limitation (ATG11; Li et al., 2014), and specialized metabolism linked to defense (C71Z1; Krieger et al., 2018). In contrast, contracted families were enriched for anthocyanin and flavonoid biosynthesis (GO:0009718; GO:0009813), nucleotidyltransferase activity (GO:0016779), and auxin polar transport (GO:0009926), which may reflect lineage-specific shifts in secondary metabolism and growth regulation, although their ecological consequences require functional validation.

3.2. Population structure, gene flow and genomic diversity

Whole-genome resequencing of 167 individuals yielded an average mapping rate of 92.6% and a mean coverage of 31 × (Table S11). ADMIXTURE (K = 2) resolved two primary lineages that were consistent with their sampling geography—East and West—with finer substructure within each lineage in both SNP and SV datasets (Figs. 2a, b and S4). This clustering pattern was also corroborated by PCA, a maximum-likelihood phylogeny, and identity-by-state (IBS) analysis (Figs. 3a and S4). Notably, the JXGS and JXYJF populations were strongly differentiated across methods (Figs. 3a and S4). Mantel (r = 0.388, P = 0.001) and partial Mantel (r = 0.359, P = 0.001) tests supported isolation by distance (IBD) across the species’ range (Fig. 2c and d), with significant IBD within the West lineage (r = 0.490, P = 0.012) but not within the East lineage (r = 0.033, P = 0.407) (Fig. S5). D-statistics (Dsuite) indicated inter-lineage introgression between East and West, as well as more frequent intra-lineage gene flow among West populations than among East populations (Fig. S6).

Fig. 2 Population distribution and genetic structure of Perkinsiodendron macgregorii. (a) Sampling locations of 30 populations; the colored pies show ancestral genetic components inferred by ADMIXTURE at K = 2. (b) Individual ancestry bar plots based on SNP and SV datasets at K = 2. (c) Mantel tests of environment-genetic distance for neutral (red) and adaptive (blue) variants. (d) Mantel tests of geographic-genetic distance for neutral (red) and adaptive (blue) variants.

Fig. 3 Demographic history and linkage patterns of Perkinsiodendron macgregorii. (a) Phylogenetic relationship diagram of all resequenced individuals. (b) Linkage disequilibrium (LD) decay for the East and West lineages. (c) Effective population size (Ne) trajectories inferred with the pairwise sequentially Markovian coalescent (PSMC); lines show East (blue) and West (red). (d) Stairway plot 2 reconstructions of recent Ne dynamics.

Genome-wide nucleotide diversity (π) for Perkinsiodendron macgregorii was 0.0102, with higher diversity in the East lineage (0.0103) than the West lineage (0.0086). Population-level π ranged from 0.0048 (JXJGS) to 0.0098 (FJYX), and per-individual heterozygosity ranged from 0.0073 (JXJGS) to 0.0132 (FJXY) (Table S12). Pairwise differentiation among populations ranged from FST = 0.0284 to 0.5222 and from DXY = 0.0076 to 0.0113 (Fig. S7), reflecting pronounced East–West divergence (mean FST = 0.1171; mean DXY = 0.0107). Linkage disequilibrium (LD) decay was faster in the East than in the West, suggesting a larger effective population size (Ne) and/or higher recombination in the East lineage (Fig. 3b).

3.3. Demographic history

PSMC analyses for both lineages indicated modest growth during the late Miocene followed by a long-term decline (Fig. 3c). The East lineage exhibited a slight expansion from ~0.1 to 0.07 Ma, followed by a bottleneck at ~0.07 Ma and a subsequent expansion centered at ~0.03 Ma before stabilization (Fig. 3c). By contrast, the West lineage continued declining until ~20 thousand years ago (Ka), followed by an expansion around ~10 Ka, and then stabilized (Fig. 3c). At the onset of the Naynayxungla Glaciation (NG), the East lineage had a larger Ne than the West. SMC++ dated East–West divergence to ~0.16 Ma (Fig. S8). Stairway Plot 2 inferred post-LGM (26.5–19 Ka) stability followed by a continuous decline beginning ~3 Ka, with a more pronounced decline in the West (Fig. 3d).

3.4. Inbreeding and genetic load

Short ROHs were more frequent than medium and long ROHs across the species (Fig. S9), indicating predominantly ancient inbreeding with additional evidence for recent inbreeding. Across lineages, the West lineage harbored significantly more short, medium, and long ROHs than the East lineage (Fig. S9). The genome-wide inbreeding coefficient based on ROHs (FROH) ranged from 0.0116 to 0.1937; lineage means were higher in the West (0.054) than in the East (0.030), with JXJGS showing the highest values, followed by HNSHS (Fig. S9). Consistent with the expectation, FROH and per-individual heterozygosity were negatively correlated (Fig. S10).

To estimate genetic load, SNPs were partitioned into synonymous (SYN), tolerated (TOL; 5 < GS < 150), deleterious (DEL; GS ≥ 150), and loss-of-function (LOF) classes, and SVs were summarized by predicted impact (LOW/MODERATE/HIGH). Relative to the East, the West lineage showed higher proportions of homozygous TOL and DEL sites and higher realized (homozygous) load overall, despite smaller heterozygous burdens (Fig. 4a). JXJGS (followed by HNSHS) had the highest proportions across mutation categories and exhibited the highest realized load with the lowest masked load, consistent with conversion of masked to realized load under strong inbreeding (Fig. 4b). Both JXJGS and HNSHS had elevated proportions of homozygous LOF mutations within ROHs, implicating recent inbreeding in elevating LOF homozygosity (Fig. 4c). SVs showed a higher proportion of HIGH-impact sites than SNPs (Fig. S11), highlighting their potential to disproportionately affect gene function and fitness.

Fig. 4 Mutation load and homozygous LOF within ROHs for Perkinsiodendron macgregorii. (a) Ratio of homozygous and heterozygous to synonymous sites and the metric 2hom/(2hom + hete) representing the proportion of homozygous mutations in the coding region of derived mutations for tolerated (TOL), deleterious (DEL), and loss-of-function (LOF) mutations in the East and West lineages. (b) Per population summaries of TOL, DEL, and LOF burdens and partitioning of masked (heterozygous) and realized (homozygous) load. (c) Proportion of homozygous LOF variants occurring within runs of homozygosity (ROH) segments.
3.5. Genotype–environment associations and genomic vulnerability

LFMM analysis identified 1884 SNPs and 317 SVs associated with one or more of 19 bioclimatic variables (FDR ≤ 0.05) (Fig. S12 and S13). RDA using five uncorrelated variables (Bio3, Bio8, Bio9, Bio18, Bio19) recovered 9801 SNPs and 170 SVs (Fig. S14). The intersection of the two approaches yielded 755 SNPs and 63 SVs as “core” adaptive variants (Fig. S12 and S13). Most adaptive SNPs and SVs were located in non-coding regions (96.29% and 96.83%, respectively). Among the coding SNPs (3.71%), non-synonymous changes accounted for 35.71% (1.32% of all adaptive variants). Genes associated with core adaptive variants were enriched, for SNP-linked genes (within gene bodies or ±2 kb flanks region), in post-transcriptional regulation of gene expression, response to nematodes, and changes in DNA conformation (Fig. S12), and for SV-linked genes (overlapping gene bodies), in immune response, innate immune response, thylakoid components, and magnesium ion binding (Fig. S13).

Patterns of spatial genetic structure differed between neutral and adaptive markers. Neutral variants (r = 0.388, P = 0.001) exhibited slightly stronger isolation-by-distance (IBD) than adaptive variants (r = 0.191, P = 0.049). For adaptive variants, IBD disappeared after controlling for environmental distance (Fig. 2c and d). Within the West lineage, adaptive variants (r = 0.618, P = 0.001) showed stronger IBD than neutral variants (r = 0.49, P = 0.012), while neither IBD nor isolation-by-environment (IBE) was significant in the East lineage, likely reflecting the broader geographic range and more complex topography of the West lineage (Fig. S5).

Gradient Forest projected increasing genetic offset under higher-emission scenarios and later time windows, with high-offset areas expanding markedly (Fig. 5ad). High-risk regions were concentrated in western Jiangxi, western Fujian, and southwestern Hunan, with broadly concordant patterns between SNP and SV datasets (Figs. 5 and S15). Risk of non-adaptation (RONA) likewise increased for most environmental variables under more severe climate change scenarios (Fig. S16 and S17). Using two representative variables (Bio9 and Bio19), we found that populations in regions with greater environmental change were expected to show higher RONA values (Fig. S18). A decrease in precipitation during the coldest quarter increased RONA in the GDDDS population. Forward-offset analyses highlighted western Jiangxi, the Hunan-Guangxi border region, and the Fujian-Zhejiang border region, mirroring patterns of local genetic offset (Fig. 5e). Reverse-offset analyses indicated that southwestern portions of the current suitable area would likely no longer support population survival under the SSP585 scenario in 2081–2100 (Fig. 5f). Consistent results were obtained from both SNP and SV datasets (Figs. 5 and S15).

Fig. 5 Genomic vulnerability to future climate change in Perkinsiodendron macgregorii. (a, b) Local genetic offset under SSP126 for 2021–2040 and 2081–2100. (c, d) Local genetic offset under SSP585 for 2021–2040 and 2081–2100. (e) Forward offset under SSP585 in 2081–2100. (f) Reverse genomic offset under SSP585 in 2081–2100. Higher values indicate greater genomic change required for persistence; estimates are based on Gradient Forest models using five uncorrelated bioclimatic predictors.
4. Discussion 4.1. Lineage divergence and demographic history

Mountain regions are widely recognized as crucial refugia and centers of endemism across central and southern China, where long-term environmental stability can promote persistence, divergence, and speciation (López-Pujol et al., 2011; Ren et al., 2025). In Perkinsiodendron macgregorii, we detected a strong pattern of isolation by distance (IBD), indicating that geographic continuity contributes to broad-scale population structure. Superimposed on this clinal signal is a discrete East–West split that aligns with the Wuyi Mountains. Similar East–West phylogeographic breaks have been observed in the Opsariichthys acutipinnis-evolans complex (Gao et al., 2023) and in Deinagkistrodon acutus (Huang et al., 2007), supporting a general role of the Wuyi Mountains as a barrier to dispersal. The Wuyi Mountains (~27°–28°N and 117°–118°E) extend northeast–southwest across southeastern China and have likely impeded gene flow historically, coincident with the East–West lineage split observed in P. macgregorii. Previous studies similarly identified the Wuyi Mountains as both a glacial refugium and a biogeographic barrier for many organisms (Huang et al., 2007; Guo et al., 2025; Ren et al., 2025). At the same time, alternative processes could reinforce this divergence. First, the Wuyi-associated boundary may also track contemporary environmental contrasts (e.g., monsoon-related precipitation and seasonality gradients), raising the possibility of isolation by environment (IBE), whereby environmentally mediated selection reduces effective gene flow even across moderate geographic distances. Second, ongoing habitat loss and fragmentation can reduce landscape connectivity, erode genetic diversity, and amplify population differentiation (Johnson et al., 2017; Exposito-Alonso et al., 2022); thus, recent anthropogenic disturbance—especially in low-elevation corridors between mountain blocks—may have further weakened contemporary dispersal and strengthened the apparent East–West split. Together, these patterns suggest that the divergence across the Wuyi Mountains likely reflects the combined effects of historical topographic barriers, environmentally structured selection, and human-driven fragmentation, rather than a single driver acting in isolation.

By contrast, the Nanling Mountains connect eastward to the Wuyi Mountains and northward to the Luoxiao Mountains, and together they function as major east–west (Nanling) and north–south (Luoxiao) dispersal corridors (Qiu et al., 2011; Zhang et al., 2015; Tian et al., 2018; Yang et al., 2019; Li et al., 2023; Lin et al., 2024). These corridors are consistent with the admixture we observed between western Jiangxi and Guangdong populations of the West lineage and populations in the East lineage. Notably, the predominantly N–S orientation of the Luoxiao Mountains may simultaneously act as a partial barrier between the East and West lineages—an effect also inferred in Oreocharis auricula (Sun et al., 2024). Chloroplast data further suggest that both the Wuyi and Nanling Mountains harbored glacial refugia for P. macgregorii (Yan et al., 2025), a view supported by elevated genetic diversity within Wuyi populations. Together, our results suggest that the three mountain ranges (Wuyi, Nanling, and Luoxiao) have jointly shaped the phylogeographic pattern of P. macgregorii, highlighting the profound influence of mountainous terrain on lineage divergence.

Demographic history analyses revealed a long-term decline in effective population size (Ne) since the Late Pliocene, with East–West divergence dated to ~0.16 Ma. This split likely reflects intensified climatic oscillations during the Middle Pleistocene (Pisias and Moore, 1981; Hofreiter and Stewart, 2009). Notably, this divergence coincides with pronounced climatic instability and monsoon variability in East Asia, which could have increased habitat fragmentation and promoted lineage isolation in this relict tree. Since the onset of the Naynayxungla Glaciation (NG), the West lineage has experienced more severe population contraction and weaker postglacial expansion than the East lineage. Consistent with this, the results of Stairway Plot 2 showed a more pronounced post-LGM (26.5–19 Ka) decline in the West lineage. Continued post-LGM contraction may further reflect limited natural regeneration of P. macgregorii and ongoing anthropogenic disturbance (Cao et al., 2015). Although PSMC suggests the East lineage maintained larger Ne around the onset of the NG, SMC++ infers a more recent divergence (~0.16 Ma). We interpret this difference as primarily methodological rather than truly contradictory: PSMC is a single-genome, within-lineage coalescent-based method with limited resolution for recent events and can inflate ancient Ne in recently diverged lineages by treating between-lineage coalescence as within-population events (Mazet et al., 2016). By contrast, SMC++ leverages multiple genomes and site-frequency information to better resolve more recent demographic changes and split times (Terhorst et al., 2017). In addition, absolute timing in both frameworks is sensitive to assumed mutation rates and generation times, so divergence estimates should be viewed as approximate; however, the qualitative inference of long-term decline and stronger recent contraction in the West lineage is consistent across methods.

4.2. Complementary insights from sequence and structural variation

Structural variants (SVs) represent a major—and often underappreciated—fraction of genomic variation and can exert disproportionate functional effects relative to SNPs (Wellenreuther et al., 2019; Stuart et al., 2024). Analyses of SNP and SV datasets yielded congruent population structure, jointly supporting a robust East–West split. In the SV dataset, one individual from HNCZ displayed a unique genetic signature, potentially reflecting localized rearrangements or calling artifacts and thereby warranting targeted validation. Compared with SNPs, SVs showed a significantly higher proportion of high-impact sites (HIGH) and fewer moderate- and low-impact sites (MODERATE and LOW), consistent with their larger mutational targets (deletions, duplications, inversions) and higher potential deleteriousness (Catanach et al., 2019; Collins et al., 2020; Hämälä et al., 2021).

Genotype–environment association results were complementary across marker classes: 755 candidate SNPs and 63 candidate SVs associated with bioclimatic variation mapped to largely distinct functional processes, implying partially distinct molecular routes to environmental response. SNP-linked genes were enriched for post-transcriptional regulation, DNA conformation, and defense processes, whereas SV-linked genes were enriched for immune functions, thylakoid components, and magnesium ion binding. Similar SNP–SV complementarity has been reported in Atlantic salmon (Lecomte et al., 2024). One plausible implication is that SNPs may often contribute to fine-scale tuning of gene expression and RNA metabolism—consistent with enrichment for post-transcriptional regulation and chromatin/DNA-conformation terms—thereby modulating developmental and stress-response programs in a more incremental manner. In contrast, because SVs can encompass larger genomic segments and alter gene dosage, exon structure, or local regulatory architecture, they may disproportionately affect stress-response and organellar/metabolic functions that are central to abiotic tolerance. In Perkinsiodendron macgregorii, the enrichment of SV-linked signals in immune pathways and chloroplast/thylakoid-related functions (and magnesium ion binding) is consistent with the idea that structural changes could influence photosynthetic performance, redox homeostasis, and stress-associated signaling, traits that are often sensitive to drought and temperature extremes and therefore relevant to the persistence and projected vulnerability of relict lineages. Representative SV-linked candidates include stress- and defense-related regulators such as ERF4 and ABA-associated signaling components (e.g., MKK3), along with immune-associated genes (e.g., NDR1, PAT1, CDC5) and jasmonate-related pathways (e.g., LPA1), supporting a potential role for SVs in coordinating broad stress-defense networks. Importantly, the widespread occurrence of non-coding adaptive variants highlights the importance of regulatory elements in shaping gene expression responses to environmental stress. By influencing key regulatory networks, they may enable phenotypic plasticity and rapid adjustment to heterogeneous or changing habitats (Romero et al., 2012; Marand et al., 2023). Overall, SNPs and SVs may contribute to adaptation via distinct molecular mechanisms, with SNPs potentially fine-tuning gene expression and regulatory processes, while SVs may exert larger-scale effects on metabolic and immune-related pathways, collectively enhancing the species’ capacity to cope with heterogeneous environments.

Despite these mechanistic differences, projections of genomic vulnerability (genetic offset and RONA) exhibited the same qualitative pattern for both datasets: maladaptation risk increases under more severe emission scenarios and over longer time horizons. This trend has also been observed in many other species (Sang et al., 2022; Jiang et al., 2025; Lin et al., 2025). However, high-offset regions were more extensive for SNPs than for SVs in Perkinsiodendron macgregorii, suggesting that SNP-based signals may be more spatially diffuse. Notably, some taxa show tight SNP–SV concordance (e.g., Oncorhynchus nerka; Tigano et al., 2024), suggesting that marker behavior can be system-specific.

4.3. Genetic load, diversity and vulnerability

As a relict tree species, Perkinsiodendron macgregorii has comparatively high genome-wide diversity (mean π = 0.0102; Fig. S19), exceeding values reported for several East Asian relicts—including Sinojackia xylocarpa (π = 0.0050; S. Zhu et al., 2024), the Chinese lineage of Cercidiphyllum japonicum (π = 0.0011; Zhu et al., 2020), Liquidambar formosana (π = 0.0044; Xu et al., 2024), and Davidia involucrata (π = 0.0047; Ren et al., 2024)—and comparable to Tetracentron sinense (π = 0.0115; Jing et al., 2025), though lower than Bretschneidera sinensis (π = 0.0636; X.-L. Zhu et al., 2024) and Euptelea pleiosperma (π = 0.0890; Cao et al., 2020). While cross-study comparisons should be interpreted cautiously (e.g., due to sequencing depth, filtering, reference genomes), these benchmarks suggest that P. macgregorii is not characterized by extremely low diversity, which is consistent with its current Vulnerable (VU) status rather than imminent extinction risk. Gene flow among populations likely contributes to maintaining genetic diversity and adaptive potential (Weeks et al., 2016, 2017; Clarke et al., 2024).

At the same time, small, isolated populations are prone to converting masked load into realized load via drift and inbreeding, while purifying selection can purge strongly deleterious alleles (Yang et al., 2018; van der Valk et al., 2019; Dussex et al., 2021; Khan et al., 2021; Kardos et al., 2023; Mooney et al., 2023; Robinson et al., 2023; Feng et al., 2024; Yi et al., 2024; Bemmels et al., 2025). Compared with the East lineage, the West lineage exhibited higher inbreeding, fewer heterozygous deleterious sites (DEL and TOL), and more homozygous deleterious sites. LOF burdens do not differ markedly between lineages in homozygotes, but the West retained fewer heterozygous LOF variants than the East. Together with severe population contraction since the onset of the Naynayxungla Glaciation (NG), these patterns suggest that bottleneck-associated inbreeding may have exposed (and partially purged) strongly deleterious LOF alleles while allowing weakly deleterious mutations to accumulate. Consistently, realized load, FROH, and the proportion of homozygous LOF sites within ROH were higher in the JXJGS and HNSHS populations, indicating active conversion of masked load to realized load via inbreeding and heightened vulnerability in these populations.

Genetic offset and RONA analyses have been used to predict genomic vulnerability in many species (Sang et al., 2022; Jiang et al., 2025; Long et al., 2025). Multiple offset metrics consistently pinpoint the same regions as genomic vulnerability hotspots. When considered together with elevated realized load and extensive runs of homozygosity (ROHs), these signals indicate that certain western populations face the greatest near-term risk, whereas several eastern populations exhibit distinctive variation consistent with incipient divergence. Overall, the concordance between climate-associated variation and patterns of genetic load highlights spatial heterogeneity in both risk and evolutionary potential across the species’ range, providing a results-based foundation for the management recommendations.

4.4. Conservation implications

Genomic evidence from SNP and SV datasets, demographic inference, and IBD jointly support the East and West lineages. These two evolutionarily cohesive lineages should be designated as the primary conservation units (CUs). Within each lineage, we further delineate secondary CUs to capture contrasting levels of risk and evolutionary distinctiveness. For example, JXJGS and HNSHS show elevated inbreeding, evidence of conversion of masked load to realized load, and occur within future genomic-offset hotspots in western Jiangxi and southwestern Hunan. These populations should be prioritized for habitat protection, demographic reinforcement, and cautious within-lineage assisted gene flow to reduce inbreeding while avoiding outbreeding depression. In contrast, JXGS and JXYJF exhibit consistent genetic differentiation and harbor unique variants indicative of incipient divergence; they should be managed as distinct secondary CUs to preserve evolutionary potential. Reverse-offset analyses indicate that parts of the southwestern range may become unsuitable under late-century high-emission scenarios, arguing for ex situ safeguards alongside in situ actions.

Acknowledgements

This work was supported by the Guangdong S&T Program (2022B1111230001) and National Key R&D Program of China (2024YFF1307400).

Data availability

The data supporting the findings of this work are available within the paper and Supporting Information. All of the raw sequence reads and assemblies described in this paper have been submitted to the National Genomics Data Center (NGDC; https://bigd.big.ac.cn/bioproject) under BioProject accession PRJCA050730 (genome assembly) and PRJCA051433 (whole-genome resequencing).

CRediT authorship contribution statement

Jiaxin Li: Writing–original draft, Visualization, Methodology, Investigation, Formal analysis, Data curation. Lihua Yang: Investigation, Writing–review & editing. Danqi Li: Investigation. Chen Feng: Supervision, Resources, Investigation, Writing–review & editing. Ming Kang: Supervision, Resources, Project administration, Funding acquisition, Conceptualization, Writing – review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A. Supplementary data

Supplementary data to this article can be found online at https://doi.org/10.1016/j.pld.2026.03.001.

References
Alexander, D.H., Lange, K., 2011. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics, 12: 246. DOI:10.1186/1471-2105-12-246
Bemmels, J.B., Starko, S., Weigel, B.L., et al., 2025. Population genomics reveals strong impacts of genetic drift without purging and guides conservation of bull and giant kelp. Curr. Biol., 35: 688-698.e8. DOI:10.1016/j.cub.2024.12.025
Browning, B.L., Browning, S.R., 2009. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am. J. Hum. Genet., 84: 210-223. DOI:10.1016/j.ajhg.2009.01.005
Cao, Y.-N., Zhu, S.-S., Chen, J., et al., 2020. Genomic insights into historical population dynamics, local adaptation, and climate change vulnerability of the East Asian tertiary relict Euptelea (Eupteleaceae). Evol. Appl., 13: 2038-2055. DOI:10.1111/eva.12960
Cao, Z., Li, X., Lin, H., et al., 2015. Analysis on growing process of Halesia macgregorii in Jiulianshan Mountain of Jiangxi Province. J. Agric., 5: 92.
Capella-Gutiérrez, S., Silla-Martínez, J.M., Gabaldón, T., 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics, 25: 1972-1973. DOI:10.1093/bioinformatics/btp348
Catanach, A., Crowhurst, R., Deng, C., et al., 2019. The genomic pool of standing structural variation outnumbers single nucleotide polymorphism by threefold in the marine teleost Chrysophrys auratus. Mol. Ecol., 28: 1210-1223. DOI:10.1111/mec.15051
Chang, C.C., Chow, C.C., Tellier, L.C., et al., 2015. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience, 4: s13742-15-47–8.
Chen, S., Zhou, Y., Chen, Y., et al., 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics, 34: i884-i890. DOI:10.1093/bioinformatics/bty560
Chen, T., Chen, G., 1996. The geographical distribution of Styracaceae. Bull. Bot. Res., 16: 57-66. DOI:10.1017/S0022112096008245
Chen, X., Schulz-Trieglaff, O., Shaw, R., et al., 2016. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics, 32: 1220-1222. DOI:10.1093/bioinformatics/btv710
Cheng, H., Concepcion, G.T., Feng, X., et al., 2021. Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm. Nat. Methods, 18: 170-175. DOI:10.1038/s41592-020-01056-5
Cingolani, P., Platts, A., Wang, L.L., et al., 2012. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly, 6: 80-92. DOI:10.4161/fly.19695
Clarke, J.G., Smith, A.C., Cullingham, C.I., 2024. Genetic rescue often leads to higher fitness as a result of increased heterozygosity across animal taxa. Mol. Ecol., 33: e17532. DOI:10.1111/mec.17532
Collins, R.L., Brand, H., Karczewski, K.J., et al., 2020. A structural variation reference for medical and population genetics. Nature, 581: 444-451. DOI:10.1038/s41586-020-2287-8
Danecek, P., Auton, A., Abecasis, G., et al., 2011. The variant call format and VCFtools. Bioinformatics, 27: 2156-2158. DOI:10.1093/bioinformatics/btr330
Dixon, P., 2003. VEGAN, a package of R functions for community ecology. J. Veg. Sci., 14: 927-930. DOI:10.1111/j.1654-1103.2003.tb02228.x
Dorant, Y., Cayuela, H., Wellband, K., et al., 2020. Copy number variants outperform SNPs to reveal genotype–temperature association in a marine species. Mol. Ecol., 29: 4765-4782. DOI:10.1111/mec.15565
Doyle, J.J., Doyle, J.L., 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull, 19: 11-15.
Durand, N.C., Shamim, M.S., Machol, I., et al., 2016. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst., 3: 95-98. DOI:10.1016/j.cels.2016.07.002
Dussex, N., Van Der Valk, T., Morales, H.E., et al., 2021. Population genomics of the critically endangered kākāpō. Cell Genom., 1: 100002. DOI:10.1016/j.xgen.2021.100002
Ellegren, H., 2014. Genome sequencing and population genomics in non-model organisms. Trends Ecol. Evol., 29: 51-63. DOI:10.1016/j.tree.2013.09.008
Ellis, N., Smith, S.J., Pitcher, C.R., 2012. Gradient forests: calculating importance gradients on physical predictors. Ecology, 93: 156-168. DOI:10.1890/11-0252.1
Emms, D.M., Kelly, S., 2019. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol., 20: 238. DOI:10.1186/s13059-019-1832-y
Exposito-Alonso, M., Booker, T.R., Czech, L., et al., 2022. Genetic diversity loss in the anthropocene. Science, 377: 1431-1435. DOI:10.1126/science.abn5642
Feng, Y., Comes, H.P., Chen, J., et al., 2024. Genome sequences and population genomics provide insights into the demographic history, inbreeding, and mutation load of two ‘living fossil’ tree species of Dipteronia. Plant J., 117: 177-192. DOI:10.1111/tpj.16486
Feulner, P.G.D., Chain, F.J.J., Panchal, M., et al., 2013. Genome-wide patterns of standing genetic variation in a marine population of three-spined sticklebacks. Mol. Ecol., 22: 635-649. DOI:10.1111/j.1365-294X.2012.05680.x
Fick, S.E., Hijmans, R.J., 2017. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. Int. J. Climatol., 37: 4302-4315. DOI:10.1002/joc.5086
Frichot, E., François, O., 2015. LEA: an R package for landscape and ecological association studies. Methods Ecol. Evol., 6: 925-929. DOI:10.1111/2041-210X.12382
Fritsch, P.W., 2001. Phylogeny and biogeography of the flowering plant genus Styrax (Styracaceae) based on chloroplast DNA restriction sites and DNA sequences of the internal transcribed spacer region. Mol. Phylogenet. Evol., 19: 387-408. DOI:10.1006/mpev.2001.0933
Fritsch, P.W., Yao, X., Simison, W.B., et al., 2016. Perkinsiodendron, a new genus in the styracaceae based on morphology and DNA sequences. J. Bot. Res. Inst. Tex., 10: 109-117.
Gabriel, L., Brůna, T., Hoff, K.J., et al., 2024. BRAKER3: fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS, and TSEBRA. Genome Res., 34: 769-777. DOI:10.1101/gr.278090.123
Gao, J., Yu, D., Liu, H., 2023. Phylogeographic analysis revealed allopatric distribution pattern and biogeographic processes of the widespread pale chub Opsariichthys acutipinnis-evolans complex (Teleostei: Cyprinidae) in southeastern China. Front. Ecol. Evol., 11: 1142810. DOI:10.3389/fevo.2023.1142810
Gougherty, A.V., Keller, S.R., Fitzpatrick, M.C., 2021. Maladaptation, migration and extirpation fuel climate change risk in a forest tree species. Nat. Clim. Change, 11: 166-171. DOI:10.1038/s41558-020-00968-6
Grandcolas, P., Nattier, R., Trewick, S., 2014. Relict species: a relict concept?. Trends Ecol. Evol., 29: 655-663. DOI:10.1016/j.tree.2014.10.002
Grantham, R., 1974. Amino acid difference formula to help explain protein evolution. Science, 185: 862-864. DOI:10.1126/science.185.4154.862
Gremme, G., Steinbiss, S., Kurtz, S., 2013. GenomeTools: a comprehensive software library for efficient processing of structured genome annotations. IEEE/ACM Trans. Comput. Biol. Bioinformatics, 10: 645-656. DOI:10.1109/TCBB.2013.68
Guan, D., McCarthy, S.A., Wood, J., et al., 2020. Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics, 36: 2896-2898. DOI:10.1093/bioinformatics/btaa025
Guo, P., Lu, A., Zheng, J., et al., 2025. The past, present, and future distribution of Calanthe graciliflora: implications for conservation and phylogeography. Nat. Conserv., 60: 21-38. DOI:10.3897/natureconservation.60.156661
Hämälä, T., Wafula, E.K., Guiltinan, M.J., et al., 2021. Genomic structural variants constrain and facilitate adaptation in natural populations of Theobroma cacao, the chocolate tree. Proc. Natl. Acad. Sci. U.S.A., 118: e2102914118. DOI:10.1073/pnas.2102914118
He, W., Xu, L., Wang, J., et al., 2024. VCF2PCACluster: a simple, fast and memory-efficient tool for principal component analysis of tens of millions of SNPs. BMC Bioinformatics, 25: 173. DOI:10.1186/s12859-024-05770-1
Hofreiter, M., Stewart, J., 2009. Ecological change, range fluctuations and population dynamics during the Pleistocene. Curr. Biol., 19: R584-R594. DOI:10.1016/j.cub.2009.06.030
Huang, S., He, S., Peng, Z., et al., 2007. Molecular phylogeography of endangered sharp-snouted pitviper (Deinagkistrodon acutus; Reptilia, Viperidae) in mainland China. Mol. Phylogenet. Evol., 44: 942-952. DOI:10.1016/j.ympev.2007.05.019
Jeffares, D.C., Jolly, C., Hoti, M., et al., 2017. Transient structural variations have strong effects on quantitative traits and reproductive isolation in fission yeast. Nat. Commun., 8: 14061. DOI:10.1038/ncomms14061
Jiang, Q., Shen, Y., Wu, L., et al., 2025. Genomic signatures of local adaptation to precipitation and solar radiation in kiwifruit. Plant Divers., 47: 733-745. DOI:10.1016/j.pld.2025.02.003
Jing, Z.-Y., Zhang, R.-G., Liu, Y., et al., 2025. Genomic insights into the evolutionary history and conservation of the living fossil Tetracentron sinense. Plant Divers., 47: 759-771. DOI:10.1016/j.pld.2025.05.008
Johnson, C.N., Balmford, A., Brook, B.W., et al., 2017. Biodiversity losses and conservation responses in the Anthropocene. Science, 356: 270-275. DOI:10.1126/science.aam9317
Kardos, M., Luikart, G., Allendorf, F.W., 2015. Measuring individual inbreeding in the age of genomics: marker-based measures are better than pedigrees. Heredity, 115: 63-72. DOI:10.1038/hdy.2015.17
Kardos, M., Zhang, Y., Parsons, K.M., et al., 2023. Inbreeding depression explains killer whale population dynamics. Nat. Ecol. Evol., 7: 675-686. DOI:10.1038/s41559-023-01995-0
Katoh, K., Standley, D.M., 2013. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol., 30: 772-780. DOI:10.1093/molbev/mst010
Khan, A., Patel, K., Shukla, H., et al., 2021. Genomic evidence for inbreeding depression and purging of deleterious genetic variation in Indian tigers. Proc. Natl. Acad. Sci. U.S.A., 118: e2023018118. DOI:10.1073/pnas.2023018118
Korneliussen, T.S., Albrechtsen, A., Nielsen, R., 2014. ANGSD: analysis of next generation sequencing data. BMC Bioinformatics, 15: 356. DOI:10.1186/s12859-014-0356-4
Korunes, K.L., Samuk, K., 2021. pixy: unbiased estimation of nucleotide diversity and divergence in the presence of missing data. Mol. Ecol. Resour., 21: 1359-1368. DOI:10.1111/1755-0998.13326
Krieger, C., Roselli, S., Kellner-Thielmann, S., et al., 2018. The CYP71AZ P450 subfamily: a driving factor for the diversification of coumarin biosynthesis in apiaceous plants. Front. Plant Sci., 9: 820. DOI:10.3389/fpls.2018.00820
Lecomte, L., Árnyasi, M., Ferchaud, A., et al., 2024. Investigating structural variant, indel and single nucleotide polymorphism differentiation between locally adapted Atlantic salmon populations. Evol. Appl., 17: e13653. DOI:10.1111/eva.13653
Lefort, V., Desper, R., Gascuel, O., 2015. FastME 2.0: a comprehensive, accurate, and fast distance-based phylogeny inference program. Mol. Biol. Evol., 32: 2798-2800. DOI:10.1093/molbev/msv150
Li, F., Chung, T., Vierstra, R.D., 2014. AUTOPHAGY-RELATED11 plays a critical role in general autophagy- and senescence-induced mitophagy in Arabidopsis. Plant Cell, 26: 788-807. DOI:10.1105/tpc.113.120014
Li, H., Durbin, R., 2011. Inference of human population history from individual whole-genome sequences. Nature, 475: 493-496. DOI:10.1038/nature10231
Li, H., Durbin, R., 2009. Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics, 25: 1754-1760. DOI:10.1093/bioinformatics/btp324
Li, H., Handsaker, B., Wysoker, A., et al., 2009. The sequence alignment/map format and SAMtools. Bioinformatics, 25: 2078-2079. DOI:10.1093/bioinformatics/btp352
Li, J., Fu, C., Ai, Q., et al., 2023. Whole-genome resequencing reveals complex effects of geographical–palaeoclimatic interactions on diversification of moustache toads in East Asia. Mol. Ecol., 32: 644-659. DOI:10.1111/mec.16781
Liang, Y.-Y., Liu, H., Lin, Q.-Q., et al., 2025. Pan-genome analysis reveals local adaptation to climate driven by introgression in oak species. Mol. Biol. Evol., 42: msaf088. DOI:10.1093/molbev/msaf088
Liao, H., Guo, C., Guo, J., et al., 2024. Observation and analysis of the growth pattern of one-year-old seedlings of Perkinsiodendron macgregorii. J. Temp. For. Res., 3: 57-60.
Lin, A.-L., Zou, M.-M., Cao, L.-J., et al., 2024. Synergistic effects of Pleistocene geological and climatic events on complex phylogeographic history of widespread sympatric species of Megaloptera in East Asia. Zool. Res., 45: 1131-1146. DOI:10.24272/j.issn.2095-8137.2024.056
Lin, N., Wang, Y., Landis, J.B., et al., 2025. Genome sequencing and population genomics provide insights into the demographic history, genetic load, and local adaptation of an endangered tertiary relict. Plant J., 123: e70425. DOI:10.1111/tpj.70425
Liu, B., Shi, Y., Yuan, J., et al., 2013. Estimation of Genomic Characteristics by Analyzing k-mer Frequency in De Novo Genome Projects. Preprint arXiv: 1308.2012.
Liu, J., Zhong, X., 2024. Population epigenetics: DNA methylation in the plant omics era. Plant Physiol., 194: 2039-2048. DOI:10.1093/plphys/kiae089
Liu, X., Fu, Y.-X., 2020. Stairway plot 2: demographic history inference with folded SNP frequency spectra. Genome Biol., 21: 280. DOI:10.1007/s12250-020-00244-z
Long, Z., Sang, Y., Feng, J., et al., 2025. Evolutionary genomics unravels the responses and adaptation to climate change in a key alpine forest tree species. Mol. Biol. Evol., 42: 1-21. DOI:10.1080/09500693.2025.2503493
López-Pujol, J., Zhang, F.-M., Sun, H.-Q., et al., 2011. Mountains of southern China as “plant museums” and “plant cradles”: evolutionary and conservation insights. Mt. Res. Dev., 31: 261-269. DOI:10.1659/MRD-JOURNAL-D-11-00058.1
Lu, L.-M., Mao, L.-F., Yang, T., et al., 2018. Evolutionary history of the angiosperm flora of China. Nature, 554: 234-238. DOI:10.1038/nature25485
Malinsky, M., Matschiner, M., Svardal, H., 2021. Dsuite-fast D-statistics and related admixture evidence from VCF files. Mol. Ecol. Resour., 21: 584-595. DOI:10.1111/1755-0998.13265
Manchester, S.R., Chen, Z.-D., Lu, A.-M., et al., 2009. Eastern Asian endemic seed plant genera and their paleogeographic history throughout the northern hemisphere. J. Syst. Evol., 47: 1-42. DOI:10.1111/j.1759-6831.2009.00001.x
Manni, M., Berkeley, M.R., Seppey, M., et al., 2021. BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol., 38: 4647-4654. DOI:10.1093/molbev/msab199
Marand, A.P., Eveland, A.L., Kaufmann, K., et al., 2023. cis-Regulatory elements in plant development, adaptation, and evolution. Annu. Rev. Plant Biol., 74: 111-137. DOI:10.1146/annurev-arplant-070122-030236
Mazet, O., Rodríguez, W., Grusea, S., et al., 2016. On the importance of being structured: instantaneous coalescence rates and human evolution—lessons for ancestral population size inference?. Heredity, 116: 362-371. DOI:10.1038/hdy.2015.104
McKenna, A., Hanna, M., Banks, E., et al., 2010. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res., 20: 1297-1303. DOI:10.1101/gr.107524.110
Mendes, F.K., Vanderpool, D., Fulton, B., et al., 2021. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics, 36: 5516-5518. DOI:10.1093/bioinformatics/btaa1022
Milne, R.I., 2006. Northern hemisphere plant disjunctions: a window on tertiary land bridges and climate change?. Ann. Bot., 98: 465-472. DOI:10.1093/aob/mcl148
Milne, R.I., Abbott, R.J., 2002. The origin and evolution of tertiary relict floras. In: Advances in Botanical Research. Elsevier, pp. 281-314.
Mooney, J.A., Marsden, C.D., Yohannes, A., et al., 2023. Long-term small population size, deleterious variation, and altitude adaptation in the Ethiopian wolf, a severely endangered canid. Mol. Biol. Evol., 40: msac277. DOI:10.1093/molbev/msac277
Ou, S., Jiang, N., 2018. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol., 176: 1410-1422. DOI:10.1104/pp.17.01310
Ou, S., Su, W., Liao, Y., et al., 2019. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol., 20: 275. DOI:10.1186/s13059-019-1905-y
Phillips, S.J., Dudík, M., 2008. Modeling of species distributions with maxent: new extensions and a comprehensive evaluation. Ecography, 31: 161-175. DOI:10.1111/j.0906-7590.2008.5203.x
Pisias, N.G., Moore, T.C., 1981. The evolution of pleistocene climate: a time series approach. Earth Planet Sci. Lett., 52: 450-458. DOI:10.1016/0012-821X(81)90197-7
Qiao, Y., Shi, J., Zhai, Y., et al., 2015. Phytophthora effector targets a novel component of small RNA pathway in plants to promote infection. Proc. Natl. Acad. Sci. U.S.A., 112: 5851-5855.
Qiu, Y.-X., Fu, C.-X., Comes, H.P., 2011. Plant molecular phylogeography in China and adjacent regions: tracing the genetic imprints of quaternary climate and environmental change in the world's most diverse temperate flora. Mol. Phylogenet. Evol., 59: 225-244.
Qüesta, J.I., Fina, J.P., Casati, P., 2013. DDM1 and ROS1 have a role in UV-B induced- and oxidative DNA damage in A. thaliana. Front. Plant Sci., 4: 420.
Rausch, T., Zichner, T., Schlattl, A., et al., 2012. DELLY: structural variant discovery by integrated paired-end and split-read analysis. Bioinformatics, 28: i333-i339. DOI:10.1093/bioinformatics/bts378
Ren, J.-L., Li, R.-H., Jia, R.-W., et al., 2025. Quaternary glaciation accelerates speciation in aquatic snakes through recent bottlenecks. Mol. Ecol., 34: e70071. DOI:10.1111/mec.70071
Ren, Y., Zhang, L., Yang, X., et al., 2024. Cryptic divergences and repeated hybridizations within the endangered “living fossil” dove tree (Davidia involucrata) revealed by whole genome resequencing. Plant Divers., 46: 169-180. DOI:10.1016/j.pld.2024.02.004
Robinson, J., Kyriazis, C.C., Yuan, S.C., et al., 2023. Deleterious variation in natural populations and implications for conservation genetics. Ann. Rev. Anim. Biosci., 11: 93-114. DOI:10.1146/annurev-animal-080522-093311
Romero, I.G., Ruvinsky, I., Gilad, Y., 2012. Comparative studies of gene expression and the evolution of gene regulation. Nat. Rev. Genet., 13: 505-516. DOI:10.1038/nrg3229
Sakaguchi, S., Qiu, Y.-X., Liu, Y.-H., et al., 2012. Climate oscillation during the Quaternary associated with landscape heterogeneity promoted allopatric lineage divergence of a temperate tree Kalopanax septemlobus (Araliaceae) in East Asia. Mol. Ecol., 21: 3823-3838. DOI:10.1111/j.1365-294X.2012.05652.x
Sang, Y., Long, Z., Dan, X., et al., 2022. Genomic insights into local adaptation and future climate-induced vulnerability of a keystone forest tree in East Asia. Nat. Commun., 13: 6541. DOI:10.1038/s41467-022-34206-8
Shear, W., Werth, A., 2014. The evolutionary truth about living fossils. Am. Sci., 102: 434. DOI:10.1511/2014.111.434
Smeds, L., Huson, L.S.A., Ellegren, H., 2024. Structural genomic variation in the inbred Scandinavian wolf population contributes to the realized genetic load but is positively affected by immigration. Evol. Appl., 17: e13652. DOI:10.1111/eva.13652
Song, B., Ning, W., Wei, D., et al., 2023. Plant genome resequencing and population genomics: current status and future prospects. Mol. Plant, 16: 1252-1268.
Srikant, T., Drost, H.-G., 2026. Defining the epigenetic toolkit as an evolvable trait. Trends Plant Sci., 31: 153-164. DOI:10.1016/j.tplants.2025.11.010
Stuart, K.C., Tan, H.Z., Whibley, A., et al., 2024. Both structural variant and single nucleotide polymorphism load impact lifetime fitness in a threatened bird species. Mol. Ecol., 34: e17631.
Sun, P., Jiao, B., Yang, Y., et al., 2022. WGDI: a user-friendly toolkit for evolutionary analyses of whole-genome duplications and ancestral karyotypes. Mol. Plant, 15: 1841-1851.
Sun, Y., Moore, M.J., Yue, L., et al., 2014. Chloroplast phylogeography of the East Asian arcto-tertiary relict Tetracentron sinense (Trochodendraceae). J. Biogeogr., 41: 1721-1732. DOI:10.1111/jbi.12323
Sun, Z., Yang, L., Kong, H., et al., 2024. Phylogeographical patterns match the floristic subdivisions: the diversification history of a widespread herb in subtropical China. Ann. Bot., 134: 1263-1276. DOI:10.1093/aob/mcae157
Sung, S., Schmitz, R.J., Amasino, R.M., 2006. A PHD finger protein involved in both the vernalization and photoperiod pathways in arabidopsis. Genes Dev., 20: 3244-3248. DOI:10.1101/gad.1493306
Tang, C.Q., Matsui, T., Ohashi, H., et al., 2018. Identifying long-term stable refugia for relict plant species in East Asia. Nat. Commun., 9: 4488.
Tang, H., Krishnakumar, V., Zeng, X., et al., 2024. JCVI: a versatile toolkit for comparative genomics analysis. iMeta, 3: e211.
Terhorst, J., Kamm, J.A., Song, Y.S., 2017. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genet., 49: 303-309. DOI:10.1038/ng.3748
Tian, S., Kou, Y., Zhang, Zhirong, et al., 2018. Phylogeography of Eomecon chionantha in subtropical China: the dual roles of the Nanling Mountains as a glacial refugium and a dispersal corridor. BMC Evol. Biol., 18: 20.
Tigano, A., Weir, T., Ward, H.G., et al., 2024. Genomic vulnerability of a freshwater salmonid under climate change. Evol. Appl., 17: e13602.
van der Valk, T., Díez-del-Molino, D., Marques-Bonet, T., et al., 2019. Historical genomes reveal the genomic consequences of recent population decline in eastern gorillas. Curr. Biol., 29: 165-170.e6.
Wang, J., Gao, P., Kang, M., Huang, H., 2009. Refugia within refugia: the case study of a canopy tree Eurycorymbus cavaleriei in subtropical China. J. Biogeogr., 36: 2156-2164. DOI:10.1111/j.1365-2699.2009.02165.x
Wang, T.-R., Meng, H.-H., Wang, N., et al., 2023. Adaptive divergence and genetic vulnerability of relict species under climate change: a case study of Pterocarya macroptera. Ann. Bot., 132: 241-254. DOI:10.1093/aob/mcad083
Wang, Y., Tang, H., DeBarry, J.D., et al., 2012. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res., 40: e49. DOI:10.1093/nar/gkr1293
Weeks, A.R., Heinze, D., Perrin, L., et al., 2017. Genetic rescue increases fitness and aids rapid recovery of an endangered marsupial population. Nat. Commun., 8: 1071.
Weeks, A.R., Stoklosa, J., Hoffmann, A.A., 2016. Conservation of genetic uniqueness of populations may increase extinction likelihood of endangered species: the case of Australian mammals. Front. Zool., 13: 31.
Wellenreuther, M., Mérot, C., Berdan, E., et al., 2019. Going beyond SNPs: the role of structural genomic variants in adaptive evolution and species diversification. Mol. Ecol., 28: 1203-1209. DOI:10.1111/mec.15066
Wong, T.K., Ly-Trong, N., Ren, H., et al., 2025. IQ-TREE 3: Phylogenomic Inference Software Using Complex Evolutionary Models. https://doi.org/10.32942/X2P62N. Preprint at EcoEvoRxiv.
Wu, T., Hu, E., Xu, S., et al., 2021. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovation, 2: 100141.
Xu, L., He, W., Tai, S., et al., 2025. VCF2Dis: an ultra-fast and efficient tool to calculate pairwise genetic distance and construct population phylogeny from VCF files. GigaScience, 14: giaf032.
Xu, W.-Q., Ren, C.-Q., Zhang, X.-Y., et al., 2024. Genome sequences and population genomics reveal climatic adaptation and genomic divergence between two closely related sweetgum species. Plant J., 118: 1372-1387. DOI:10.1111/tpj.16675
Xu, Z., Wang, H., 2007. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res., 35: W265-W268. DOI:10.1093/nar/gkm286
Yan, M., Fritsch, P.W., Moore, M.J., et al., 2018. Plastid phylogenomics resolves infrafamilial relationships of the Styracaceae and sheds light on the backbone relationships of the Ericales. Mol. Phylogenet. Evol., 121: 198-211.
Yan, M.-H., Du, Y.-R., Zhao, J.-Y., et al., 2025. Phylogeography of the endangered endemic Perkinsiodendron macgregorii based on chloroplast genome data. Diversity, 17: 439. DOI:10.3390/d17070439
Yan, M.-H., Si, J.-Y., Dong, N.-C., et al., 2022. Assessing the potential distribution of a vulnerable tree under climate change: Perkinsiodendron macgregorii (Chun) P.W. Fritsch (Styracaceae). Sustainability, 15: 666. DOI:10.3390/su15010666
Yang, A., Zhong, Y., Liu, S., et al., 2019. New insight into the phylogeographic pattern of Liriodendron chinense (Magnoliaceae) revealed by chloroplast DNA: east–west lineage split and genetic mixture within western subtropical China. PeerJ, 7: e6355. DOI:10.7717/peerj.6355
Yang, J., Lee, S.H., Goddard, M.E., et al., 2011. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet., 88: 76-82. DOI:10.5815/ijeme.2011.06.12
Yang, Y., Ma, T., Wang, Z., et al., 2018. Genomic effects of population collapse in a critically endangered ironwood tree Ostrya rehderiana. Nat. Commun., 9: 5449.
Yang, Z., 2007. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol., 24: 1586-1591. DOI:10.1093/molbev/msm088
Yi, H., Wang, J., Dong, S., et al., 2024. Genomic signatures of inbreeding and mutation load in tree ferns. Plant J., 120: 1522-1535. DOI:10.1111/tpj.17064
Zeng, X., Yi, Z., Zhang, X., et al., 2024. Chromosome-level scaffolding of haplotype-resolved assemblies using Hi-C data without reference genomes. Nat. Plants, 10: 1184-1200. DOI:10.1038/s41477-024-01755-3
Zhang, C., Dong, S.-S., Xu, J.-Y., et al., 2019. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics, 35: 1786-1788. DOI:10.1093/bioinformatics/bty875
Zhang, J., Li, Z., Fritsch, P.W., et al., 2015. Phylogeography and genetic structure of a tertiary relict tree species, Tapiscia sinensis (Tapisciaceae): implications for conservation. Ann. Bot., 116: 727-737. DOI:10.1093/aob/mcv112
Zhang, Y., Yang, Z., He, Y., et al., 2024. Structural variation reshapes population gene expression and trait variation in 2,105 Brassica napus accessions. Nat. Genet., 56: 2538-2550. DOI:10.1038/s41588-024-01957-7
Zhang, Y.-H., Wang, I.J., Comes, H.P., et al., 2016. Contributions of historical and contemporary geographic and environmental factors to phylogeographic structure in a Tertiary relict species, Emmenopterys henryi (Rubiaceae). Sci. Rep., 6: 24041.
Zhu, S., Chen, J., Zhao, J., et al., 2020. Genomic insights on the contribution of balancing selection and local adaptation to the long-term survival of a widespread living fossil tree, Cercidiphyllum japonicum. New Phytol., 228: 1674-1689. DOI:10.1111/nph.16798
Zhu, S., Wei, X.-F., Lu, Y.-X., et al., 2024. The jacktree genome and population genomics provides insights for the mechanisms of the germination obstacle and the conservation of endangered ornamental plants. Hortic. Res., 11: uhae166.
Zhu, X.-L., Wang, J., Chen, H.-F., et al., 2024. Lineage differentiation and genomic vulnerability in a relict tree from subtropical forests. Evol. Appl., 17: e70033.