Citrus sinensis genome v1.0 (JGI)
Overview
Note: The following text comes from phytozome.org: Genome Size / Loci Sequencing Method Assembly Method Identification of Repeats EST Alignments Assembly metrics
Downloads
All assembly and annotation files are available for download by selecting the desired data type in the right-hand "Resources" side bar. Each data type page will provide a description of the available files and links do download. Alternatively, you can browse all available files on the CGD data repository. Assembly
The following text comes from phytozome.org: Genomic sequence was generated using a whole genome shotgun approach with 2Gb sequence coming from GS FLX Titanium; 2.4 Gb from FLX Standard; 440 Mb from Sanger paired-end libraries; 2.0 Gb from 454 paired-end libraries. The 25.5 million 454 reads and 623k Sanger sequence reads were generated by a collaborative effort by 454 Life Sciences, University of Florida and JGI. The assembly was generated by Brian Desany at 454 Life Sciences using the Newbler assembler. Please note: if you download and use the JGI whole genome assembly and annotation please abide by the requirements for this data as specified on phytozome.org's Citrus sinensis download page. Downloads
Gene Predictions
The following text comes from phytozome.org: The current gene set (orange1.1) integrates 3.8 million ESTs with homology and ab initio-based gene predictions (see below). 25,376 protein-coding loci have been predicted, each with a primary transcript. An additional 20,771 alternative transcripts have been predicted, generating a total of 46,147 transcripts. 16,318 primary transcripts have EST support over at least 50% of their length. Two-fifths of the primary transcripts (10,813) have EST support over 100% of their length. Please note: if you download and use the JGI whole genome assembly and annotation please abide by the requirements for this data as specified on phytozome.org's Citrus sinensis download page. Downloads
Protein Homology
Protein homology found here was performed by the Main Bioinformatics Lab at WSU. Proteins from the C. clementina v1.0 assembly were mapped against proteins from other genomes and databases using blastp with an e-value cutoff of 1e-6. Only the best 10 matches were kept. The available files are in Excel 2007 format. Downloads
Repeats
The following text comes from phytozome.org: A de novo repeat library was made by running RepeatModeler (Arian Smit, Robert Hubley) on the genome to produce a library of repeat sequences. Sequences with Pfam domains associated with non-TE functions were removed from the library of repeat sequences and the library was then used to mask 31% of the genome with RepeatMasker. Please note: if you download and use the JGI whole genome assembly and annotation please abide by the requirements for this data as specified on phytozome.org's Citrus clementina download page. Downloads
Links
|