Insights into the phylogenetic, structural and functional evolution of LHC superfamily in photosynthetic organisms
Kexin Cai, Runjie Diao, Zhihang Zhao, Yannan Liu, Zhenhua Zhang*, Bojian Zhong**     
College of Life Sciences, Ministry of Education Key Laboratory of NSLSCS, Nanjing Normal University, Nanjing 210023, China
Abstract: Photosynthesis is the fundamental process for the survival of photosynthetic organisms. The light-harvesting complexes (LHCs) play a crucial role in light energy capture and transfer. Despite their conserved role in energy transfer, the evolutionary basis for functional diversification of LHCs across photosynthetic lineages remains largely unclear. In this study, we performed phylogenetic analyses based on 65 genomes covering all major linages of photosynthetic organisms and identified 1922 LHC proteins, which were classified into four groups: LHCA, LHCB, LHC-like, and FCP. The LHC gene family has undergone dynamic diversification through lineage-specific mechanisms in which green algae primarily utilized dispersed duplications, while land plants increasingly relied on whole-genome duplications (WGDs). Structural analyses revealed key adaptive modifications: PSI-LHCI transitioned from a double-layer crescent antenna in green algae to a simplified single-layer configuration in land plants, while PSII-LHCII in angiosperms replaced the N-LHCII trimer with CP24 to optimize light harvesting. Expression profiling revealed the functional diversification of different LHC members, with LHCA/LHCB members showing constitutive expression in photosynthetic tissues and strong induction of abiotic stresses, while LHC-like exhibited specialized responses to specific environmental challenges. Our study enhances better understanding of the evolutionary scenario of LHC superfamily during the evolution of photosynthetic organisms, and provides a foundation for future research on the molecular mechanisms underlying the adaptation of LHCs to extreme environments.
Keywords: Light-harvesting complex    Gene family    Evolution    Photosynthetic organisms    
1. Introduction

Photosynthesis is a fundamental process that converts solar energy into chemical energy, thereby supporting life on Earth. It not only provides essential materials for growth, development, and reproduction, but also sustains energy flow and material cycling in both aquatic and terrestrial ecosystems (Jiang et al., 2014; Neilson and Durnford, 2010a). Photosystem (including light-harvesting complex and reaction center) serves as the core functional unit to absorb, convert, and transfer light energy through a series of complex electron transfer reactions (Buchel, 2015; Chmeliov et al., 2016; Chukhutsina et al., 2020). The primary endosymbiotic event, wherein a eukaryotic host cell engulfed a cyanobacterial ancestor, gave rise to three photosynthetic lineages: glaucophytes, red algae, and green plants (Hohmann-Marriott and Blankenship, 2011; Keeling, 2010). The secondary endosymbiosis involving a green alga gave rise to the euglenids (Keeling, 2010). The secondary and tertiary endosymbiosis of red algae led to the origin of the red plastid lineage, including cryptophytes and haptophytes and SAR supergroup: stramenopiles (e.g., diatoms and brown algae), alveolates (e.g., dinoflagellates) and rhizarians (Hohmann-Marriott and Blankenship, 2011; Keeling, 2010). These endosymbiotic events have contributed to the extensive diversity of photosynthetic organisms.

Cyanobacteria primarily depend on phycobilisomes (PBSs) to harvest light (Bag, 2021). Additionally, photosynthetic eukaryotes have evolved the light-harvesting complex (LHC) superfamily (Durnford et al., 1999; Green, 2019; Stadnichuk and Kusnetsov, 2023). The LHC superfamily encodes the light-harvesting antennas of photosystems, effectively capturing and directing light energy to the photosynthetic reaction center. In green plants, the LHC superfamily is categorized into four subfamilies: LHC, LHC-like (light-harvesting-like), PsbS (photosystem Ⅱ subunit S), and FCII (ferrochelatase Ⅱ) (Engelken et al., 2010; Jansson, 1999; Lan et al., 2022; Zou and Yang, 2019). The LHC subfamily associated with PSI and PSII is separately referred to as LHCA (LHCI) and LHCB (LHCII) (Huang et al., 2021; Jansson et al., 1999). The LHC-like subfamily includes four groups: OHP (one-helix protein), SEP (stress-enhanced protein), ELIP (early light-induced protein), and Psb33 (photosystem Ⅱ protein 33) (Engelken et al., 2010; Lan et al., 2022; Zou, 2018). In diatoms, the light-harvesting complexes incorporate fucoxanthin as the major carotenoid and are thus called fucoxanthin-chlorophyll proteins (FCPs) (Eppard et al., 2000; Nagao et al., 2019). The FCPs include the Lhcf, Lhcr, Lhcx and Lhcz subfamilies (Koziol et al., 2007; Neilson and Durnford, 2010b). Although the LHC orthologs among various photosynthetic lineages has been identified, the evolutionary scenarios of the LHC superfamily across photosynthetic organisms remains poorly understood.

The composition and architecture of both LHCI and LHCII antenna systems exhibit diversity across various photosynthetic lineages. The PSI-LHCI supercomplex of the green alga Chlamydomonas reinhardtii incorporates ten LHCA subunits organized into an extensive two-layered semicircular array (Merchant et al., 2007; Su et al., 2019). In Arabidopsis thaliana, its PSI-LHCI supercomplex comprises only four core LHCA proteins (Lhca1/Lhca4 and Lhca2/Lhca3 forming heterodimers) that assemble into a single-layered, belt-like structure (Ben-Shem et al., 2003; Pan et al., 2018). Beyond the four core antennae, Lhca5 and Lhca6 mediate the interaction between the NADH dehydrogenase-like complex and PSI, thereby assembling the PSI-NDH complexes to facilitate cyclic electron flow (Opatíková and Kouřil, 2024; Peng and Shikanai, 2011; Shen et al., 2021). In addition, the PSII-LHCII complex of C. reinhardtii forms a twofold-symmetric homodimer, comprising two copies of the PSII core alongside strongly bound (S-), moderately bound (M-), and naked (N-) LHCII trimers (Shen et al., 2019). Each of the PSII core contains one CP26 (Lhcb5) and one CP29 (Lhcb4) subunit. The S-LHCII and CP26 subunits bind directly to the CP43 side of the core, whereas M-LHCII attaches via CP29 on the CP47 side (Shen et al., 2019). In Pisum sativum, the CP24 (Lhcb6) subunit occupies a position equivalent to that of the N-LHCII trimer in green algae (Shen et al., 2019; Su et al., 2017).

Beyond light harvesting, the LHC superfamily plays a crucial role in managing light energy to optimize photosynthesis and prevent damage (Scholes et al., 2011). LHC-like proteins are specifically responsible for mitigating photooxidative stress (Bassi and Dall'Osto, 2021). Under low-light conditions, the LhcSR protein of Chlamydomonas reinhardtii localizes to the stromal thylakoid membranes, distal to PSII, and remains inactive (Peers et al., 2009). Upon exposure to excessive light, thylakoid acidification and zeaxanthin accumulation activate LhcSR, prompting its migration to the grana margins and assembly into an LhcSR-CP26-LHCII quenching module for photoprotection (Bassi and Dall'Osto, 2021; Shen et al., 2019; Su et al., 2017). In Arabidopsis thaliana, PsbS remains inactive under low light but is activated upon acidification of the thylakoid lumen under high light (Li et al., 2000). The activated PsbS then binds to the CP29-CP24-M-LHCII complex, inducing its dissociation and the release of M-LHCII for energy dissipation (Bassi and Dall'Osto, 2021; Betterle et al., 2009; Drop et al., 2014). Notably, the early-branching land plant Physcomitrium patens uniquely retains both PsbS and LhcSR proteins for driving nonphotochemical quenching (NPQ) and providing robust photoprotection, crucial for its adaptation to terrestrial environments (Alboresi et al., 2010). In addition to rapid photoprotection, state transition-dependent phosphorylation of PSII-LHCII fine-tunes the photosynthetic balance between efficiency and protection. The STN7 kinase facilitates this process by phosphorylating LHCII, promoting the transfer of excitation energy towards PSI (Wunder et al., 2013). This redistribution equilibrates excitation between the photosystems and ensures a smooth delivery of electrons to PSI acceptors (Bellafiore et al., 2005; Rochaix, 2014).

Although previous studies have established the evolutionary framework of the LHC gene family in green plants, a comprehensive understanding of its functional evolution and structural variation across photosynthetic organisms remains unclear. Our study elucidates the evolutionary diversity of the LHC gene family within photosynthetic organisms and identifies four distinct subfamilies: LHCA, LHCB, LHC-like, and FCP. We demonstrate that the evolution of photosynthetic antennae from green algae to land plants involved two major shifts: a transition in gene duplication from dispersed to whole-genome duplication, and a distinct trend of structural simplification in photosynthetic antennae. Furthermore, expression profiles of LHC genes exhibit tissue-specific expression patterns. Members of the LHCA and LHCB subfamilies are upregulated in response to both biotic and abiotic stresses, indicating an adaptive regulatory strategy. This study enhances our understanding of the role of the LHC gene family in the evolution of photosynthesis.

2. Materials and methods 2.1. Identification and phylogenetic analysis of LHC superfamily

To systematically identify the homology of LHC genes across various photosynthetic organisms, we selected genomic data from 65 representative species, covering Cyanobacteria, Dinophyta, Chlorarachniophyta, Cryptophyta, Haptophyta, Ochrophyta (Phaeophyceae), Bacillariophyta, Chlorophyta, Charophyta, Glaucophyta, Rhodophyta, Pteridophyta, Lycopodiophyta, Bryophyta, Gymnosperms, and Angiosperms. The 65 genomic data were primarily obtained from Plant JGI portal and National Center for Biotechnology Information (https://ncbi.nlm.nih.gov/genome/) (Table S1). To identify homologous genes of LHC genes, we retrieved the PF00504 (Chlorophyll A–B binding protein) seed file from Pfam database (Mistry et al., 2021) and performed Hidden Markov Model (HMM) searches across the 65 genomes. We further utilized the InterProScan (Quevillon et al., 2005) and CDD database (https://ncbi.nlm.nih.gov/cdd) (Marchler-Bauer et al., 2015) to verify the presence of conserved domain. The homologous sequences were aligned using MAFFT (Katoh et al., 2005), and trimmed (-gt = 0.2) using TrimAL (Capella-Gutierrez et al., 2009). Based on the Bayesian information criterion, the optimal model LG + F + G4 was selected for constructing the maximum likelihood (ML) phylogenetic tree using IQ-TREE (v.1.6.11) (Nguyen et al., 2015), with 1000 bootstrap replicates. Protein sequences of the four monophyletic clades: LHCA, LHCB, LHC-like, and FCP, were extracted and reconstructed their orthologous trees using the above mentioned pipeline. The phylogenetic trees were visualized using iTOL (Letunic and Bork, 2021). The physicochemical properties of LHC members were analyzed using the “Protein Parameter Calc (based on ProtParam)” tool in TBtools (Chen et al., 2020).

2.2. Expansion mechanism analysis of LHC in green plants

To elucidate the expansion mechanism of LHC gene family, we conducted gene duplication and synteny analyses across eight representatives of green plants with chromosome-level genomes: green algae (Chlamydomonas reinhardtii, Chromochloris zofingiensis and Prasinoderma colonial), bryophytes (Physcomitrella patens), ferns (Alsophila spinulosa), gymnosperms (Cycas panzhihuaensis) and angiosperms (Arabidopsis thaliana and Oryza sativa). The MCScanX (Wang et al., 2012) was utilized for collinearity analysis, and genes were classified into five categories: single, dispersed, proximal, tandem, and segmental/whole-genome duplication (WGD). The genomic locations of LHC genes were visualized using TBtools (Chen et al., 2020).

2.3. Structural analyses of PSI-LHCI and PSII-LHCII between green algae and land plants

The structural models of PSI-LHCI and PSII-LHCII were obtained from the RCSB Protein Data Bank (https://www.rcsb.org/). The retrieved structural models of representatives include PSI-LHCI (PDB ID: 6IJO) and PSII-LHCII (PDB ID: 6KAF) from Chlamydomonas reinhardtii, PSI-LHCI (PDB ID: 6L35) and PSII-LHCII (PDB ID: 5XNM) from Physcomitrium patens, and PSI-LHCI from Pisum sativum (PDB ID: 4XK8). Comparison and visualization of these complex structures was performed using the UCSF Chimera tool (Pettersen et al., 2004).

2.4. Gene expression analysis

The FPKM values for members of the LHC gene family in Arabidopsis thaliana and Oryza sativa were obtained from the PlantRNADB database (https://plantrnadb.com/) (Zhang et al., 2020). This dataset encompasses expression profiles across various tissues (roots, leaves, seedlings, stems, etc.) under different stress conditions (salt, drought, heat, etc.). For each gene, expression values from all biological replicates under the same condition were averaged to generate the final dataset for plotting. Using the pheatmap package in R, we analyzed the expression patterns of LHC gene family members in both tissue-specific contexts and under biotic and abiotic stress conditions.

3. Results and discussion 3.1. Phylogenetic and structural analyses of LHCA and LHCB

Among 65 photosynthetic organisms, a total of 1922 LHC proteins were identified and classified into four clades: LHCA, LHCB, LHC-like and FCP (Fig. S1). LHCA subfamily only existed in green plants and was categorized into 12 branches. Among these, Lhca7, Lhca8, and Lhca9 are green algae-specific clades (Fig. 1A). The Lhcp clade, located at the base of LHCA subfamily, is exclusively found in chlorophytes and charophytes (Mesostigma viride) (Fig. 1A). The LHCB subfamily is also unique to green plants and has 7 branches. Among them, the LHCBM in green algae forms a stable monophyletic group (Fig. 1B). CP29 and CP24 phylogenetically clustered within the LHCA clade, sharing conserved gene structures and regulatory sequences with LHCA members, suggesting that CP29, CP24, and LHCA members shared a common ancestors (Chukhutsina et al., 2020).

Fig. 1 Phylogeny and structural analysis of PSI and PSII. (A). The phylogenetic tree of the LHCA subfamily. (B). The phylogenetic tree of the LHCB subfamily. The detailed information, including gene IDs and bootstrap values, for Fig. 1A and B is provided in Figs. S2 and S3, respectively. (C).The left PSI-LHCI structure represents Chlamydomonas reinhardtii from the Chlorophyta (PDB ID: 6IJO). The middle PSI-LHCI structure corresponds to Physcomitrium patens from the Bryophyta (PDB ID: 6L35). The right PSI-LHCI structure pertains to Pisum sativum from the Angiospermae (PDB ID: 4XK8). the Cyanidioschyzon merolae PSI-LHCI supercomplex is provided in Fig. S4 (D). The PSII-LHCII complexes from C (PDB ID: 6KAF) and P. sativum (PDB ID: 5XNM).

To determine how structure relates to function in these LHC members, we conducted comparative structural analyses of PSI-LHCI and PSII-LHCII complexes. In Chlamydomonas reinhardtii, PSI-LHCI forms a two-layer crescent structure, with the P1–P4 sites occupied by Lhca1, 8, 7, 3 (inner) and Lhca1, 4, 6, 5 (outer) (Fig. 1C). The C. reinhardtii Lhca2 (hereafter CrLhca2) and CrLhca9 proteins, located at the P5 and P6 sites respectively, forming a heterodimer (Fig. 1C) (Huang et al., 2021). CrLhca3, CrLhca8, and CrLhca7 were classified in the Lhca3/5/7/8 clade, while CrLhca4, CrLhca6, and CrLhca5 were assigned to the Lhca2/4/6 clade (Fig. 1C), The sister-clade divergence of LHCA proteins in the phylogeny corresponds directly to their spatial layering in the PSI peripheral antenna complex. In contrast, the PSI-LHCI complex in Physcomitrium patens and Pisum sativum only possess a single-layer LHCA protein occupying the P1–P4 sites (Fig. 1C). Notably, the P2 site exhibits differences between the two species. In P. sativum, this position is occupied by Lhca4 (Qin et al., 2015). In P. patens, earlier work suggested it was Lhca5, but recent structural evidence identifies it as a unique paralogue, Lhca2b, forming a distinctive Lhca1-Lhca2b-Lhca2a-Lhca3 antenna. The other three subunits remain conserved (Gorski et al., 2022; Iwai et al., 2024; Liu et al., 2025; Yan et al., 2021). These two dimers exhibit distinct functional characteristics: the Lhca1-Lhca4 dimer in P. sativum specifically binds chlorophyll a (Chla 614) and lutein (LUT 320), whereas the Lhca1-Lhca5 dimer in P. patens lacks this ability (Mazor et al., 2017). Although both dimers contain proteins from the Lhca1 clade, Lhca4 is classified under the Lhca2/4/6 clade, whereas Lhca5 belongs to the Lhca3/5/7/8 clade. We hypothesize that clade-specific residue variations in the pigment-binding pockets could underlie the observed differences in pigment binding between Lhca4-and Lhca5-containing dimers. In contrast to the LHCI complexes found in green plants and algae, the PSI-LHCI complex in the red alga Cyanidioschyzon merolae exhibits a significantly simpler architecture. Comparative studies indicate that it lacks PsaG and retains only three Lhcr subunits, resulting in a vacancy at the Lhca1 binding site (Fig. S4) (Antoshvili et al., 2019; Kumazawa and Ifuku, 2024).

The LHCII antenna system, the primary light-harvesting complex of PSII, is organized as trimers at the periphery of core PSII complex (Fig. 1D). These trimers are predominantly composed of Lhcb1, Lhcb2, and Lhcb3, conserved across land plants. However, other peripheral antenna organization differs fundamentally between green algae and angiosperms. In Chlamydomonas reinhardtii, the antenna comprises three LHCII trimers associated with CP26 and CP29, while in angiosperms, the N-LHCII trimer is replaced by CP24, leading to a simplified monomeric arrangement (Fig. 1D). This divergence in peripheral antenna architecture likely reflects adaptations to distinct environments. The more complex antenna in C. reinhardtii, which features an additional trimer with CP26/CP29, may enhance light harvesting under fluctuating aquatic light conditions (Li and Dohlman, 2023). Conversely, in angiosperms, the incorporation of CP24 and the loss of trimers potentially optimize energy transfer and photoprotection in stable, high light intensities characteristic of terrestrial habitats (Suga and Shen, 2020). These variations highlight how the organization of PSII antennae is tailored to meet specific environmental demands across different photosynthetic lineages.

3.2. Phylogeny of LHC-like and FCP subfamily among photosynthetic organisms

The single-helix HLIP in cyanobacteria serves as the structural prototype of LHC family, from which eukaryotes have evolved more complex variants: the dual-helix SEP, the triple-helix ELIP, and the quadruple-helix PsbS (Fig. 2A and C). Notably, OHP1 and OHP2, the eukaryotic homologs of HLIP, have retained the single-helix architecture and diverged into two functionally distinct clades (Fig. 2A). SEP is widely distributed within the green lineage and facilitates NPQ by promoting direct energy transfer from chlorophyll a to lutein (Levin and Schuster, 2023). ELIP, the most abundant member of LHC-like branch, accumulates under high light stress, binding to chlorophyll a and lutein, effectively eliminating excited-state chlorophyll and thereby preventing the formation of ROS (Hutin et al., 2003). PsbS is present in green plants, typically existing as a single copy in most species (Fig. 2A). It acts as a key regulator of pH-dependent photoprotection in land plants, undergoing conformational changes that enhance its interaction with other LHC proteins, such as CP29, thus facilitating energy quenching (Levin and Schuster, 2023). This indicates that PsbS likely originated prior to the aquatic-terrestrial split, but its central role in photoprotection was refined during the evolution of land plants.

Fig. 2 The phylogenetic tree and evolutionary roadmap of LHC proteins. (A). Phylogenetic tree of LHC-like subfamily. (B). Phylogenetic tree of FCP subfamily. The detailed information of A and B is provided in Figs. S5 and S6, respectively. (C and D). Structural diagrams of members of LHC-like and FCP subfamilies. (E). Evolutionary roadmap of LHC genes. The arrows indicate endosymbiotic relationships. Cyanobacteria evolved into plastids in three major groups: red algae, glaucophytes, and green algae through primary endosymbiosis. Green algae produce plastids in chlorarachniophytes via secondary endosymbiosis. Red algae generate cryptophyte plastids through secondary endosymbiosis. Cryptophytes produce plastids in haptophytes and stramenopiles through tertiary endosymbiosis. Haptophytes produce plastids in dinoflagellates through quaternary endosymbiosis. The arrows denote the direction of endosymbiotic events. The color of the arrows corresponds to distinct endosymbiotic origins: blue for cyanobacteria, green for green algae, purple for rhodophytes, light blue for cryptophytes, and orange for haptophytes. Thick lines denote major lineages: dark/light green, green lineage; gray, glaucophyte lineage; red/orange, red lineage.

The FCP clade comprises four subtypes: Lhcx (LhcSR), Lhcf, Lhcr, and Lhcz. Lhcx (LhcSR) is widely distributed among Chlorophytes, Charophytes, Bryophytes, and the SAR group, as well as Haptophytes. This subtype forms a stable monophyletic group and serves as the executor of NPQ in green algae. However, it is absent in angiosperms, where its function is replaced by PsbS. In contrast, Lhcf, Lhcr, and Lhcz are specifically found in unequal flagellates, fixed flagellates, and cryptophytes (Fig. 2B). These organisms contain fucoxanthin, as FCP specifically bind this pigment to facilitate light harvesting, and these species possess multiple copies of FCP to support this function. All members of the FCP family exhibit a triple helix structure (Fig. 2D). This structural combination provides the FCP complex with a significant competitive advantage, enabling it to efficiently absorb and utilize the most penetrating blue light (450–550 nm) and green light (500–580 nm) wavelengths in marine surface environments, where chlorophyll b and carotenoids in green plants exhibit relatively weak absorption capabilities (Wang et al., 2019).

The phylogeny of the LHC superfamily elucidates its intricate evolutionary trajectory, characterized by multiple endosymbiotic events, gene duplications, and gene losses (Fig. 2E). The LHC-like proteins represent the earliest diverging members within the LHC superfamily. Red algae acquired ancestral OHP, SEP, and ELIP proteins from their cyanobacterial endosymbiont, evolving from HLIP precursors. Through gene duplication, these proteins diversified into two functional groups: the OHP/SEP/ELIP proteins for photoprotection, and specialized Lhcr proteins for light harvesting (Fig. 2E). In contrast, green algae acquired OHP, SEP, ELIP, and PsbS proteins, while their repertoire of light-harvesting genes further diversified into Lhcx, Lhcf, Lhca1-9, and Lhcb4/5/7. Among the secondary endosymbiotic groups of green algae, the chlorarachniophyta acquired the Lhcb4/5 proteins while losing the Lhcb7 protein. The secondary endosymbiotic group of red algae, known as cryptoalgae, obtained ELIP, Lhcr, and Lhcz proteins. For instance, Lhcf and Lhcr originated from red algal endosymbionts, whereas Lhcz is specific to cryptophytes and may have been transferred to haptophytes. Ultimately, OHP, ELIP, Lhcf, and Lhcr proteins were acquired from dinoflagellates that originated through symbiosis across four levels (Fig. 2E). Within the LHC-like subfamily, the nuclear-encoded OHP gene evolved from the ancestral HLIP gene of cyanobacteria. Through gene duplication, proteins such as SEP, PsbS, and ELIP emerged, exhibiting diverse structures and functions. This phenomenon illustrates the functional transitions followed by structural evolution (Hohmann-Marriott and Blankenship, 2011; Neilson and Durnford, 2010a).

3.3. High diversity of LHC revealed by molecular characteristics

We conducted a statistical analysis to investigate the molecular variation of LHC superfamily across photosynthetic organisms. LHCA and LHCB are predominantly found in green algae, charophytes, and vascular plants, with LHCB consistently outnumbering LHCA in these lineages (Fig. 3A). The coefficient of variation (CV) for LHC-like subfamily is 91.5%, which is more than twice that of LHCA (42%) and LHCB (58%), reflecting differential evolutionary constraints among these subfamilies. The FCP subfamily exhibits a more extreme distribution pattern: it is particularly found in SAR superclusters. The coefficient of variation for FCP is 191%, indicating the highest variability among all LHC family members (Fig. 3A).

Fig. 3 Sequence characteristics and protein properties of LHCs. (A). The distribution of normalized LHC family members across the selected species is illustrated. The heat map has undergone Z-score standardization processing, and the numbers represent the original LHC quantities. (B). Characteristic features, including molecular weight, isoelectric point, and the number of amino acids for the four subfamilies are presented. The calculation formula of the Coefficient of Variation (CV) is: CV = (Standard deviation/mean) × 100%. Comparison of typical values: The typical molecular weight of LHCB is 27.8 kDa. The typical molecular weight of LHCA is 25.2 kDa. Calculation of absolute difference:27.8 kDa–25.2 kDa = 2.6 kDa. Percentage difference calculation: (2.6/25.2) × 100% ≈ 10.3%. (C). Within the LHC proteins sequences, two conserved motifs are identified, with ExxNxR being the most highly conserved.

Physicochemical property characteristics of LHC superfamily demonstrated evident differences in amino acid length, molecular weight, and isoelectric point (pI) among distinct subfamilies (Fig. 3B). LHCB subfamily exhibits molecular weight ranging from 24 to 31 kDa. In contrast, the LHC-like subfamily displays the broadest range of molecular weights, spanning from 18 to 35 kDa, and includes the highest observed molecular weight of 35 kDa among all groups. This observation is consistent with its sequence divergence. Both LHCA (pI 5.2–6.0) and LHCB (pI 4.8–5.7) subfamily members are classified as weakly acidic, which facilitates their binding to PSI and PSII. In contrast, FCP subfamily has the lowest pI (3.9–5.1). The LHC-like family exhibits the largest variation in pI (from 7.3 to 10.0, with a difference of 2.7) (Fig. 3B). In terms of protein length, LHCB exhibits the longest mean length, whereas LHC-like demonstrates the shortest mean length (Fig. 3B).

We identified two highly conserved motifs within LHC superfamily, which share a common conserved region characterized by ‘ExxNxRxAM’ (Fig. 3C). These two highly conserved motifs mainly localize in functional domains involved in chlorophyll ligation and exciton transfer. Purifying selection maintains the conservation of these motifs because they constitute key pigment-binding sites indispensable for light-harvesting function (Li and Dohlman, 2023). Comparative sequence analysis of Lhca1-4 and Lhcb1/Lhcb4-6 proteins revealed highly conserved structural features at most pigment-binding sites. Specific variations were observed at certain positions. For instance, in M. commoda Lhca1, glutamine is replaced by glutamate (Q to E) at the Chl613 ligand position. Similarly, in C. reinhardtii Lhca2, histidine is substituted with asparagine (H to N) at the Chl603 coordination site (Figs. S8–15 and Tables S2–S8). These specific modifications represent adaptive changes that maintain core pigment-binding functionality while enabling localized structural optimization. Such variations facilitate precise tuning of protein-pigment interactions without compromising the fundamental light-harvesting capabilities of the complex (Su et al., 2019).

3.4. Gene duplication and synteny analysis of LHC superfamily

To elucidate the evolutionary dynamics of LHC superfamily across green plants, we systematically analyzed their genomic distribution and duplication patterns in six representative species: Arabidopsis thaliana, Oryzia sativa, Cycas panzhihuaensis, Alsophila spinulosa, Physcomitrium patens and Chlamydomonas. Ranging from green algae to angiosperms, all identified LHC family members showed widespread and uneven distribution patterns across chromosomes (Fig. 4). Synteny analysis indicated that WGD events were absent in green algae, dispersed duplication accounts for the majority of LHC gene copies (Fig. 4A and Fig. S5). In P. patens and A. spinulosa, WGD events resulted in an overall doubling of its genome, simultaneously increasing the number of WGD-type LHC genes, meanwhile, the number of dispersed duplicate genes also increased (Fig. 4B and C and Table 1). In C. panzhihuaensis, LHC genes predominantly experienced dispersed-type and tandem-type duplications, with the tandem-type duplications significantly enriched on chromosome 7 (Fig. 4D and Table 1). Eight of the nine tandem-type LHC gene copies are localized on chromosome 7, underscoring a chromosomal preference for tandem duplication events. These results suggested that chromosome 7 of C. panzhihuaensis may serve as a lineage-specific hotspot for the expansion of LHC genes through localized replication mechanisms. Although WGD events have been observed in angiosperms, dispersed type LHC genes were the most prevalent, separately comprising 48% and 67% of their LHC gene copies (Fig. 4E and F; Table 1). This observation indicates that, following these WGD events, dispersed-type LHC genes have maintained a quantitative dominance.

Fig. 4 Synteny analysis of LHC genes across six representative species (A–F). The gene names adjacent to each chromosome correspond to the approximate locations of the respective LHC genes. Gray lines represent all synteny blocks within the entire genome of each species. The blue, green, light blue, purple, and black lines denote singleton, dispersed, proximal, tandem, and whole genome duplication WGD/segmental duplication, respectively. The collinearity among LHC genes in Chromochloris zofingiensis and Prasinoderma colonial is illustrated in Fig. S16.

Table 1 The number of LHC genes from different sources in representative species.
Group Species Gene numbers Singleton Dispersed Proximal Tandem WGD or segmental
Green alga Chlamydomonasreinhardtii 37 1 28 3 5 0
Chromochloris zofingiensis 36 5 29 2 0 0
Prasinoderma colonial 39 3 26 2 8 0
Bryophytes Physcomitrium patens 75 1 24 17 26 7
Ferns Alsophila spinulosa 89 1 57 4 8 19
Gymnosperms Cycas panzhihuaensis 23 1 9 4 9 0
Angiosperms Arabidopsis thaliana 29 4 14 0 5 6
Oryza sativa 24 2 16 0 2 4
352 18 203 32 63 36

Our analysis revealed a distinct expansion pattern: green algae primarily rely on dispersed duplication, whereas vascular plants synergistically employ WGD and dispersed duplication. Especially, the dispersed patterns found in angiosperms may represent an evolutionary mode aimed at preserving functional diversity following WGD (Wang et al., 2011). These differences likely reflect distinct selective pressures encountered during terrestrial adaptation (Van de Peer et al., 2017).

3.5. Expression patterns of LHC in Arabidopsis thaliana and Oryza sativa

To explore the potential functions of LHC superfamily members, we analyzed the expression patterns of LHC genes under various stress conditions in A. thaliana and O. sativa. The LHC superfamily members exhibited tissue-specific expression patterns (Fig. 5A and D). LHCA and LHCB subfamilies are constitutively highly expressed in photosynthetic tissues (such as leaf, meristem), reflecting their functions in formation photosynthetic antenna systems. The variability in expression levels among different tissues for LHCB subfamily members is greater than LHCA subfamily members. In contrast, LHC-like subfamily members displayed unique pattern: specifically expressed in seedlings and stems of A. thaliana, and predominantly expressed in stem and panicle tissues of O. sativa. The expression abundance of LHC-like subfamily members was not directly correlating with the photosynthetically active tissues (Fig. 5A and D).

Fig. 5 Expression profiles of LHC genes in representative species. (A–C) The tissue-specific expression, abiotic stress expression, and biotic stress expression profiles of LHC genes in Arabidopsis thaliana. (D–F). The tissue-specific expression, abiotic stress expression, and biotic stress expression profiles of LHC genes in Oryza sativa. The expression pattern of the LHC gene in Selaginella moellendorffii is similar to that in A. thaliana and O. sativa, and the results are shown in Fig. S17.

LHCA and LHCB subfamily members exhibited consistent upregulation under salt, drought, and heat stresses (Fig. 5B and E), whereas LHC-like subfamily showed species-specific patterns in response to different stresses. In Arabidopsis thaliana, LHC-like subfamily members were specifically induced by oxidative and irradiated stress. In Oryza sativa, only water deficit triggered the induction of LHC-like subfamily members, with no significant response to dark, shade and cold stressors (Fig. 5E). This contrast reveals that the core photosynthetic components (LHCA/B) and LHC-like subfamily of the LHC superfamily have evolved distinct functional roles. In response to biotic stress, both LHCA and LHCB subfamilies showed marked upregulation, with their expression levels increased by approximately 2- to 3-fold compared to constitutive expression levels. Conversely, LHC-like genes maintained consistently lower expression levels compared to LHCA and LHCB, mirroring their minimal responsiveness to abiotic stressors observed previously (Fig. 5C and F). This persistent expression dichotomy suggests distinct functional specialization within the LHC superfamily. These results demonstrate that LHCA and LHCB subfamily maintain stable photosynthetic protection across abiotic and biotic stresses, while LHC-like members exhibit species-specific responses. These findings offer molecular evidence for comprehending the functional diversity of the LHC superfamily in the context of plant evolution and adaptations.

In summary, this study systematically investigates the evolutionary dynamics of the LHC superfamily across 65 photosynthetic organisms, revealing new insights into phylogenetic diversification, structural adaptation, and functional specialization. Our phylogenetic analyses classify LHC proteins into four major clades: LHCA, LHCB, LHC-like, and FCP, with LhcSR belonging to the FCP subfamily. Primary endosymbiosis introduced ancestral proteins, and secondary endosymbiosis drove lineage-specific gains and losses. Green algae primarily rely on dispersed duplication for the expansion of LHC, whereas land plants increasingly employ WGD during terrestrial adaptation. Expression profiling further demonstrates functional diversification: core LHCA and LHCB genes exhibit constitutive expression in photosynthetic tissues with strong induction under biotic and abiotic stresses, confirming their conserved role in maintaining photosynthetic activity. These findings elucidate how the structural and functional diversification of LHC underpin photosynthetic adaptations to various environments, providing a framework for future studies on LHC-mediated stress tolerance mechanism and photosynthetic efficiency enhancement.

Acknowledgements

This work was supported by the National Natural Science Foundation of China (32370228, W2511024 and 32470232) (B.Z. and Z.Z.); the Natural Science Foundation of Jiangsu Province (BK20250004) (B.Z.); the Collaborative Innovation Center for Modern Crop Production co-sponsored by Province and Ministry (B.Z.); the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD) (B.Z.).

CRediT authorship contribution statement

Kexin Cai: Writing-review & editing, Writing-original draft, Visualization, Project administration, Funding acquisition, Formal analysis, Data curation, Conceptualization. Runjie Diao, Zhihang Zhao, Yannan Liu and Zhenhua Zhang: Writing-original draft, Resources, Formal analysis. Bojian Zhong: Writing-original draft, Project administration, Investigation, Conceptualization.

Data availability statement

The datasets presented in this study can be found in online repositories: https://figshare.com/articles/dataset/LHC_phylogeny_data/29939495?file=57277457.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Appendix A. Supplementary data

Supplementary data to this article can be found online at https://doi.org/10.1016/j.pld.2026.01.004.

References
Alboresi, A., Gerotto, C., Giacometti, G.M., et al., 2010. Physcomitrella patens mutants affected on heat dissipation clarify the evolution of photoprotection mechanisms upon land colonization. Proc. Natl. Acad. Sci. U.S.A., 107: 11128-11133. DOI:10.1073/pnas.1002873107
Antoshvili, M., Caspy, I., Hippler, M., et al., 2019. Structure and function of photosystem I in Cyanidioschyzon merolae. Photosynth. Res., 139: 499-508. DOI:10.1007/s11120-018-0501-4
Bag, P., 2021. Light harvesting in fluctuating environments: Evolution and function of antenna proteins across photosynthetic lineage. Plants, 10: 1184. DOI:10.3390/plants10061184
Bassi, R., Dall’Osto, L., 2021. Dissipation of light energy absorbed in excess: the molecular mechanisms. Annu. Rev. Plant Biol., 72: 47-76. DOI:10.1146/annurev-arplant-071720-015522
Bellafiore, S., Barneche, F., Peltier, G., et al., 2005. State transitions and light adaptation require chloroplast thylakoid protein kinase STN7. Nature, 433: 892-895. DOI:10.1038/nature03286
Ben-Shem, A., Frolow, F., Nelson, N., 2003. Crystal structure of plant photosystem I. Nature, 426: 630-635. DOI:10.1038/nature02200
Betterle, N., Ballottari, M., Zorzan, S., et al., 2009. Light-induced dissociation of an antenna hetero-oligomer is needed for non-photochemical quenching induction. J. Biol. Chem., 284: 15255-15266. DOI:10.1074/jbc.M808625200
Buchel, C., 2015. Evolution and function of light harvesting proteins. J. Plant Physiol., 172: 62-75. DOI:10.1016/j.jplph.2014.04.018
Capella-Gutierrez, S., Silla-Martinez, J.M., Gabaldon, T., 2009. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics, 25: 1972-1973. DOI:10.1093/bioinformatics/btp348
Chen, C., Chen, H., Zhang, Y., et al., 2020. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol. Plant, 13: 1194-1202. DOI:10.1016/j.molp.2020.06.009
Chmeliov, J., Gelzinis, A., Songaila, E., et al., 2016. The nature of self-regulation in photosynthetic light-harvesting antenna. Nat. Plants, 2: 16045. DOI:10.1038/nplants.2016.45
Chukhutsina, V.U., Liu, X., Xu, P., et al., 2020. Light-harvesting complex Ⅱ is an antenna of photosystem I in dark-adapted plants. Nat. Plants, 6: 860-868. DOI:10.1038/s41477-020-0693-4
Drop, B., Yadav S., K.N., Boekema, E.J., et al., 2014. Consequences of state transitions on the structural and functional organization of photosystem I in the green alga Chlamydomonas reinhardtii. Plant J., 78: 181-191. DOI:10.1111/tpj.12459
Durnford, D.G., Deane, J.A., Tan, S., et al., 1999. A phylogenetic assessment of the eukaryotic light-harvesting antenna proteins, with implications for plastid evolution. J. Mol. Evol., 48: 59-68. DOI:10.1007/PL00006445
Engelken, J., Brinkmann, H., Adamska, I., 2010. Taxonomic distribution and origins of the extended LHC (light-harvesting complex) antenna protein superfamily. BMC Evol. Biol., 10: 233. DOI:10.1186/1471-2148-10-233
Eppard, M., Krumbein, W.E., von Haeseler, A., et al., 2000. Characterization of fcp4 and fcp12, two additional genes encoding light harvesting proteins of Cyclotella cryptica (Bacillariophyceae) and phylogenetic analysis of this complex gene family. Plant Biol., 2: 283-289. DOI:10.1055/s-2000-3702
Gorski, C., Riddle, R., Toporik, H., et al., 2022. The structure of the Physcomitrium patens photosystem I reveals a unique Lhca2 paralogue replacing Lhca4. Nat. Plants, 8: 307-316. DOI:10.1038/s41477-022-01099-w
Green, B.R., 2019. What happened to the phycobilisome?. Biomolecules, 9: 748. DOI:10.3390/biom9110748
Hohmann-Marriott, M.F., Blankenship, R.E., 2011. Evolution of photosynthesis. Annu. Rev. Plant Biol., 62: 515-548. DOI:10.1146/annurev-arplant-042110-103811
Huang, Z., Shen, L., Wang, W., et al., 2021. Structure of photosystem I-LHCI-LHCII from the green alga Chlamydomonas reinhardtii in state 2. Nat. Commun., 12: 1100. DOI:10.1038/s41467-021-21362-6
Hutin, C., Nussaume, L., Moise, N., et al., 2003. Early light-induced proteins protect Arabidopsis from photooxidative stress. Proc. Natl. Acad. Sci. U.S.A., 100: 4921-4926. DOI:10.1073/pnas.0736939100
Iwai, M., Patel-Tupper, D., Niyogi, K.K., 2024. Structural diversity in eukaryotic photosynthetic light harvesting. Annu. Rev. Plant Biol., 75: 119-152. DOI:10.1146/annurev-arplant-070623-015519
Jansson, S., 1999. A guide to the Lhc genes and their relatives in Arabidopsis. Trends Plant Sci., 4: 236-240. DOI:10.1016/S1360-1385(99)01419-3
Jansson, S., Green, B., Grossman, A.R., et al., 1999. A proposal for extending the nomenclature of light-harvesting proteins of the three transmembrane helix type. Plant Mol. Biol. Rep., 17: 221-224. DOI:10.1023/A:1007620508007
Jiang, Q., Xu, Z.S., Wang, F., et al., 2014. Effects of abiotic stresses on the expression of Lhcb1 gene and photosynthesis of Oenanthe javanica and Apium graveolens. Biol. Plant. (Prague), 58: 256-264. DOI:10.1007/s10535-014-0396-7
Katoh, K., Kuma, K., Toh, H., et al., 2005. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res., 33: 511-518. DOI:10.1093/nar/gki198
Keeling, P.J., 2010. The endosymbiotic origin, diversification and fate of plastids. Philos. Trans. R. Soc. B-Biol. Sci., 365: 729-748. DOI:10.1098/rstb.2009.0103
Koziol, A.G., Borza, T., Ishida, K.-I., et al., 2007. Tracing the evolution of the light-harvesting antennae in chlorophyll a/b-Containing organisms. Plant Physiol., 143: 1802-1816. DOI:10.1104/pp.106.092536
Kumazawa, M., Ifuku, K., 2024. Unraveling the evolutionary trajectory of LHCI in red-lineage algae: conservation, diversification, and neolocalization. iScience, 27: 110897. DOI:10.1016/j.isci.2024.110897
Lan, Y., Song, Y., Zhao, F., et al., 2022. Phylogenetic, structural and functional evolution of the LHC gene family in plant species. Int. J. Mol. Sci., 24: 488. DOI:10.3390/ijms24010488
Letunic, I., Bork, P., 2021. Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res., 49: W293-W296. DOI:10.1093/nar/gkab301
Levin, G., Schuster, G., 2023. LHC-like proteins: the guardians of photosynthesis. Int. J. Mol. Sci., 24: 2503. DOI:10.3390/ijms24032503
Li, S., Dohlman, H.G., 2023. Evolutionary conservation of sequence motifs at sites of protein modification. J. Biol. Chem., 299: 104617. DOI:10.1016/j.jbc.2023.104617
Li, X.P., BjoÈrkman, O., Shih, C., et al., 2000. A pigment-binding protein essential for regulation of photosynthetic light harvesting. Nature, 403: 391-395. DOI:10.1038/35000131
Liu, D., Yan, Q., Qin, X., et al., 2025. Ultrafast kinetics of PSI-LHCI super-complex from the moss Physcomitrella patens. Biochim. Biophys. Acta Bioenerg., 1866: 149526. DOI:10.1016/j.bbabio.2024.149526
Marchler-Bauer, A., Derbyshire, M.K., Gonzales, N.R., et al., 2015. CDD: ncbi’s conserved domain database.. Nucleic Acids Res., 43: D222-D226. DOI:10.1093/nar/gku1221
Mazor, Y., Borovikova, A., Caspy, I., et al., 2017. Structure of the plant photosystem I supercomplex at 2.6 Å resolution. Nat. Plants, 3: 17014. DOI:10.1038/nplants.2017.14
Merchant, S.S., Prochnik, S.E., Vallon, O., et al., 2007. The chlamydomonas genome reveals the evolution of key animal and plant functions. Science, 318: 245-250. DOI:10.1126/science.1143609
Mistry, J., Chuguransky, S., Williams, L., et al., 2021. Pfam: the protein families database in 2021. Nucleic Acids Res., 49: D412-D419. DOI:10.1093/nar/gkaa913
Nagao, R., Kato, K., Suzuki, T., et al., 2019. Structural basis for energy harvesting and dissipation in a diatom PSII-FCPII supercomplex. Nat. Plants, 5: 890-901. DOI:10.1038/s41477-019-0477-x
Neilson, J.A., Durnford, D.G., 2010a. Evolutionary distribution of light-harvesting complex-like proteins in photosynthetic eukaryotes. Genome, 53: 68-78. DOI:10.1139/G09-081
Neilson, J.A., Durnford, D.G., 2010b. Structural and functional diversification of the light-harvesting complexes in photosynthetic eukaryotes. Photosynth. Res., 106(1–2): 57-71. DOI:10.1007/s11120-010-9576-2
Nguyen, L.T., Schmidt, H.A., von Haeseler, A., et al., 2015. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol., 32: 268-274. DOI:10.1093/molbev/msu300
Opatíková, M., Kouřil, R., 2024. Unique structural attributes of the PSI-NDH supercomplex in Physcomitrium patens. Plant J., 120: 2226-2237. DOI:10.1111/tpj.17116
Pan, X., Ma, J., Su, X., et al., 2018. Structure of the maize photosystem Ⅰ supercomplex with light-harvesting complexes Ⅰ and Ⅱ. Science, 8360: 1109-1113. DOI:10.1126/science.aat1156
Peers, G., Truong, T.B., Ostendorf, E., et al., 2009. An ancient light-harvesting protein is critical for the regulation of algal photosynthesis. Nature, 462: 518-521. DOI:10.1038/nature08587
Peng, L., Shikanai, T., 2011. Supercomplex formation with photosystem I is required for the stabilization of the chloroplast NADH dehydrogenase-like complex in arabidopsis. Plant Physiol., 155: 1629-1639. DOI:10.1104/pp.110.171264
Pettersen, E.F., Goddard, T.D., Huang, C.C., et al., 2004. UCSF Chimera--a visualization system for exploratory research and analysis. J. Comput. Chem., 25: 1605-1612. DOI:10.1002/jcc.20084
Qin, X., Suga, M., Kuang, T., et al., 2015. Structural basis for energy transfer pathways in the plant PSI-LHCI supercomplex. Science, 348: 989-995. DOI:10.1126/science.aab0214
Quevillon, E., Silventoinen, V., Pillai, S., et al., 2005. InterProScan: protein domains identifier. Nucleic Acids Res., 33: W116-W120. DOI:10.1093/nar/gki442
Rochaix, J.-D., 2014. Regulation and dynamics of the light-harvesting system. Annu. Rev. Plant Biol., 65: 287-309. DOI:10.1146/annurev-arplant-050213-040226
Scholes, G.D., Fleming, G.R., Olaya-Castro, A., et al., 2011. Lessons from nature about solar light harvesting. Nat. Chem., 3: 763-774. DOI:10.1038/nchem.1145
Shen, L., Huang, Z., Chang, S., et al., 2019. Structure of a C2S2M2N2-type PSII–LHCII supercomplex from the green alga Chlamydomonas reinhardtii. Proc. Natl. Acad. Sci. U.S.A., 116: 21246-21255. DOI:10.1073/pnas.1912462116
Shen, L., Tang, K., Wang, W., et al., 2021. Architecture of the chloroplast PSI–NDH supercomplex in Hordeum vulgare. Nature, 601: 649-654. DOI:10.1109/i-smac52330.2021.9640942
Stadnichuk, I.N., Kusnetsov, V.V., 2023. Phycobilisomes and phycobiliproteins in the pigment apparatus of oxygenic photosynthetics: from cyanobacteria to tertiary endosymbiosis. Int. J. Mol. Sci., 24, 2290.
Su, Ma, Wei, et al., 2017. Structure and assembly mechanism of plant C2S2M2-type PSII-LHCII supercomplex. Science, 357: 815-820. DOI:10.1126/science.aan0327
Su, X., Ma, J., Pan, X., et al., 2019. Antenna arrangement and energy transfer pathways of a green algal photosystem-I-LHCI supercomplex. Nat. Plants, 5: 273-281. DOI:10.1038/s41477-019-0380-5
Suga, M., Shen, J.R., 2020. Structural variations of photosystem I-antenna supercomplex in response to adaptations to different light environments. Curr. Opin. Struct. Biol., 63: 10-17. DOI:10.1016/j.sbi.2020.02.005
Van de Peer, Y., Mizrachi, E., Marchal, K., 2017. The evolutionary significance of polyploidy. Nat. Rev. Genet., 18: 411-424. DOI:10.1038/nrg.2017.26
Wang, W., Yu, L.J., Xu, C., et al., 2019. Structural basis for blue-green light harvesting and energy dissipation in diatoms. Science, 363, eaav0365.
Wang, X., Wang, H., Wang, J., et al., 2011. The genome of the mesopolyploid crop species Brassica rapa. Nat. Genet., 43: 1035-1039. DOI:10.1038/ng.919
Wang, Y., Tang, H., Debarry, J.D., et al., 2012. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res., 40: e49. DOI:10.1093/nar/gkr1293
Wunder, T., Xu, W., Liu, Q., et al., 2013. The major thylakoid protein kinases STN7 and STN8 revisited: effects of altered STN8 levels and regulatory specificities of the STN kinases. Front. Plant Sci., 4: 417.
Yan, Q., Zhao, L., Wang, W., et al., 2021. Antenna arrangement and energy-transfer pathways of PSI–LHCI from the moss Physcomitrella patens. Cell Discov., 7: 10.
Zhang, H., Zhang, F., Yu, Y., et al., 2020. A comprehensive online database for exploring ~20,000 public arabidopsis RNA-seq libraries. Mol. Plant, 13: 1231-1233. DOI:10.1016/j.molp.2020.08.001
Zou, Z., 2018. Mining gene families in the Castor bean genome. The Castor Bean Genome: 135-173. DOI:10.1007/978-3-319-97280-0_8
Zou, Z., Yang, J., 2019. Genomics analysis of the light-harvesting chlorophyll a/b-binding (Lhc) superfamily in cassava (Manihot esculenta Crantz). Gene, 702: 171-181.