Near Chromosome-Level Genome Assembly and Annotation of Rhodotorula babjevae Strains Reveals High Intraspecific Divergence

ORCID
0000-0003-1497-7755
Zugehörigkeit
Department of Molecular Sciences, Swedish University of Agricultural Sciences, 75007 Uppsala, Sweden; giselle.martin@slu.se (G.C.M.-H.); bettina.muller@slu.se (B.M.)
Martín-Hernández, Giselle C.;
ORCID
0000-0002-0030-7710
Zugehörigkeit
Department of Molecular Sciences, Swedish University of Agricultural Sciences, 75007 Uppsala, Sweden; giselle.martin@slu.se (G.C.M.-H.); bettina.muller@slu.se (B.M.)
Müller, Bettina;
GND
1164030272
ORCID
0000-0002-7199-3957
Zugehörigkeit
Institute for Infectious Diseases and Infection Control, Jena University Hospital, 07743 Jena, Germany; christian.brandt@med.uni-jena.de
Brandt, Christian;
ORCID
0000-0001-7090-8717
Zugehörigkeit
Method Development and Research Infrastructure, MF1 Bioinformatics, Robert Koch Institute, 13353 Berlin, Germany; hoelzerm@rki.de
Hölzer, Martin;
Zugehörigkeit
Institute of Medical Microbiology and Virology, University Hospital Leipzig, 04103 Leipzig, Germany; adrian.viehweger@medizin.uni-leipzig.de
Viehweger, Adrian;
ORCID
0000-0002-2059-9044
Zugehörigkeit
Department of Molecular Sciences, Swedish University of Agricultural Sciences, 75007 Uppsala, Sweden; giselle.martin@slu.se (G.C.M.-H.); bettina.muller@slu.se (B.M.)
Passoth, Volkmar

The genus Rhodotorula includes basidiomycetous oleaginous yeast species. Rhodotorula babjevae can produce compounds of biotechnological interest such as lipids, carotenoids, and biosurfactants from low value substrates such as lignocellulose hydrolysate. High-quality genome assemblies are needed to develop genetic tools and to understand fungal evolution and genetics. Here, we combined short- and long-read sequencing to resolve the genomes of two R. babjevae strains, CBS 7808 (type strain) and DBVPG 8058, at chromosomal level. Both genomes are 21 Mbp in size and have a GC content of 68.2%. Allele frequency analysis indicates that both strains are tetraploid. The genomes consist of a maximum of 21 chromosomes with a size of 0.4 to 2.4 Mbp. In both assemblies, the mitochondrial genome was recovered in a single contig, that shared 97% pairwise identity. Pairwise identity between most chromosomes ranges from 82 to 87%. We also found indications for strain-specific extrachromosomal endogenous DNA. A total of 7591 and 7481 protein-coding genes were annotated in CBS 7808 and DBVPG 8058, respectively. CBS 7808 accumulated a higher number of tandem duplications than DBVPG 8058. We identified large translocation events between putative chromosomes. Genome divergence values between the two strains indicate that they may belong to different species.

Zitieren

Zitierform:
Zitierform konnte nicht geladen werden.

Rechte

Rechteinhaber: © 2022 by the authors.

Nutzung und Vervielfältigung: