LOCUS CM000365 949497 bp DNA linear CON 14-JUL-2016 DEFINITION Drosophila simulans chromosome 4, whole genome shotgun sequence. ACCESSION CM000365 VERSION CM000365.1 DBLINK BioProject: PRJNA18237 BioSample: SAMN02953618 KEYWORDS WGS. SOURCE Drosophila simulans ORGANISM Drosophila simulans Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. REFERENCE 1 (bases 1 to 949497) AUTHORS Clark,A.G., Eisen,M.B., et al. CONSRTM Drosophila 12 Genomes Consortium TITLE Evolution of genes and genomes on the Drosophila phylogeny JOURNAL Nature 450 (7167), 203-218 (2007) PUBMED 17994087 REFERENCE 2 (bases 1 to 949497) AUTHORS Wilson,R.K. TITLE Direct Submission JOURNAL Submitted (16-AUG-2006) Genome Sequencing Center, Washington University School of Medicine, 4444 Forest Park Parkway, St. Louis, MO 63108, USA REFERENCE 3 (bases 1 to 949497) CONSRTM FlyBase TITLE Direct Submission JOURNAL Submitted (09-JUN-2008) FlyBase, Harvard University, Biological Laboratories, 16 Divinity Ave., Cambridge, MA 02138, USA COMMENT This is the CAF1 assembly of the Drosophila simulans genome. It represents a mosaic of several different D. simulans lines. The assembly process began with a 4x WGS assembly of the the D. simulans white501 (w501) line, AAGH00000000. The w501 contigs were initially anchored, ordered and oriented by alignment with the D. melanogaster genome. The assembly was then examined for places where the w501 assembly suggested inversions with respect to the D. melanogaster assembly. One major inversion was found, confirming the already documented inversion found by Lemeunier and Ashburner (1976). Six other D. simulans lines (C167.4, MD106TS, MD199S, New Caledonia 48S, SIM4, and SIM6) were assembled with approximately 1x coverage (WGS projects AASR00000000-AASW00000000, respectively). The 4x WGS assembly of the D. simulans w501 genome was used as a scaffold, and the contigs and unplaced reads from the 1x assemblies of the other individual D. simulans lines were used to cover gaps in the w501 assembly where possible. Thus the resulting assembly is a mosaic containing the w501 contigs as the primary scaffolding, with contigs and unplaced reads from the other lines filling gaps in the w501 assembly. Total size is 142,405,747 bp including gaps and 127,241,461 bp excluding gaps. For more information about the D. simulans assembly and statistics, see the WUSTL Genome Sequencing Center Drosophila simulans web page from the home page, http://genome.wustl.edu/home.cgi. The gene annotation is based on FlyBase Release 1.3, which contains some corrections of the original annotation published by the Drosophila 12 Genomes Consortium.