Authors: Christian Morabito, Riccardo Aiese Cigliano, Eric Maréchal, Fabrice Rébeillé, Alberto Amato

Institutions:

  • Laboratoire de Physiologie Cellulaire Végétale, Université Grenoble Alpes, CEA, CNRS, INRAE, IRIG-LPCV, 38054 Grenoble Cedex 9, France
  • Sequentia Biotech, Barcelona, Spain

Publication: Data in Brief

Date: August, 2020

Full paper: Illumina and PacBio DNA sequencing data, de novo assembly and annotation of the genome of Aurantiochytrium limacinum strain CCAP_4062/1

Abstract:

The complete genome of the thraustochytrid Aurantiochytrium limacinum strain CCAP_4062/1 was sequenced using both Illumina Novaseq 6000 and third generation sequencing technology PacBio RSII in order to obtain trustworthy assembly and annotation. The reads from both platforms were combined at multiple levels in order to obtain a reliable assembly, then compared to the A. limacinum ATCC MYA1381™ reference genome. The final assembly was annotated with the help of strain CCAP_4062/1 RNAseq data. A. limacinum strain CCAP_4062/1 is an industrial strain used for the production of very long chain polyunsaturated fatty acids, like the docosahexaenoic acid that is an essential fatty acid synthesised only at very low pace in humans and vertebrates . Thraustochytrids in general and Aurantiochytrium more specifically, are used for carotenoid and squalene production as well. Beside their biotechnological interest, thraustochytrids play a crucial role in both inshore and oceanic basins ecosystems. Genome sequences will foster biotechnological as well as ecological studies.