Citrus sinensis genome v1.0 (JGI)

Gene Predictions

The following text comes from phytozome.org:

The current gene set (orange1.1) integrates 3.8 million ESTs with homology and ab initio-based gene predictions (see below). 25,376 protein-coding loci have been predicted, each with a primary transcript. An additional 20,771 alternative transcripts have been predicted, generating a total of 46,147 transcripts. 16,318 primary transcripts have EST support over at least 50% of their length. Two-fifths of the primary transcripts (10,813) have EST support over 100% of their length.

Please note: if you download and use the JGI whole genome assembly and annotation please abide by the requirements for this data as specified on phytozome.org's Citrus sinensis download page.  

Downloads

Coding sequences--CDS (FASTA file, 11Mb compressed) Csinensis_v1.0_cds.fa.gz
Transcript sequences--mRNA (FASTA file,  15Mb compressed) Csinensis_v1.0_transcript.fa.gz
Protein sequences (FASTA file, 7Mb compressed) Csinensis_v1.0_peptide.fa.gz
Gene models (GFF3 file, 4Mb compressed) Csinensis_v1.0_gene.gff3.gz
Alternate genes (GFF3 file, 3.5 Mb compressed) Csinensis_v1.0_alt_gene.gff3.gz
RepeatsMasker repeats (GFF3 file, 6.3 Mb compressed) Csinensis_v1.0_repeats.gff3.gz