Supporting data for "Haplogenome assembly reveals interspecific structural variation in Eucalyptus hybrids"
Using Nanopore sequencing and a trio-binning approach, we assembled the separate haplogenomes (567 Mb and 545 Mb) to 97.9% BUSCO completion. High-density SNP genetic linkage maps of both parents allowed scaffolding of 88% of the haplogenome contigs into 11 pseudo-chromosomes (scaffold N50 of 43.82Mb and 42.45 Mb for the E. grandis and E. urophylla haplogenomes, respectively). We identify 48,729 SVs between the two haplogenomes providing the first detailed insight into genome structural rearrangement in these species. The two haplogenomes have similar gene content, 35,572 and 33,915 functionally annotated genes, of which 34% are contained in genome rearrangements.
Knowledge of SV and haplotype diversity in the two species will form the basis for understanding the genetic basis of hybrid superiority in these trees.