Using iterative identification and clustering methods, the researchers found that the West Indian variety of avocado contained 40,629 protein-coding genes and 4,879 non-coding RNAs. They predicted 12 chromosome-specific centromere sequences that had complex structures containing multiple types of transposable elements with low coverage depth of HiFi and ONT reads, particularly the long centromeric regions of chromosomes Pa03 and Pa07.
Using fluorescence in situ hybridization (FISH), they further verified the accuracy of all predicted centromeres, telomeres, and nucleolus organizer regions in the avocado genome sequence.
They identified 376 nucleotide-binding leucine-rich repeat receptor genes related to disease resistance and 128 genes related to fatty acid biosynthesis, among which Pa07g1095, Pa07g1091, and Pa12g0002 (FAD2 gene) were specifically expressed in fruit during the triacylglycerol formation stage.
Pa02g0113, which encodes one of 11 stearoyl-acyl carrier protein desaturases that mediate C18 unsaturated fatty acid synthesis, was more highly expressed in the leaves than in the stems and fruit.
"Our telomere-to-telomere genome assembly can provide the basis for future research on disease resistance and fatty acid biosynthesis in avocado," said YANG Yunqiang of XTBG.
86-10-68597521 (day)
86-10-68597289 (night)
86-10-68511095 (day)
86-10-68512458 (night)
cas_en@cas.cn
52 Sanlihe Rd., Xicheng District,
Beijing, China (100864)