EST assembling
EST assembling and clustering were
performed by pipeline analysis after cleaning, repeat masking,
vector trimming and organelle masking (EGassembler). Cap3 parameters
were -p 95 -o 40.
A unigene set of 9 598 contigs and 26
161 singletons was obtained.
Download:
- Nicotiana tabacum ESTs available in
Genbank (05/02/06) in FASTA format
(13.4Mo),
- Details of Contigs obtained with these ESTs in
txt format
(564Ko),
- Unigene set in FASTA format
(7.30Mo).
BlastX and GO annotations
BlastX with expectation value 1e-25 and GO
annotations were obtained with Blast2GO.
Download:
- Annotations in MS Excel format
(1.12Mo).
Pfam domains
The unigenes were translated using ORFpredictor,
and the resulting amino acid sequences where then entered as queries
in the Pfam database. The default settings of Pfam 20.0 were used.
Download:
- ORFs in FASTA format
(2.38Mo),
- Pfam domains in MS Excel format
(488Ko).
Archives must be decompressed with WinRAR.
Additional analysis of the whole set
of Nicotiana tabacum sequences released in Genbank are
available on TIGR, SGN and PlantGDB,
as well as the annotations relating to this last unigene
set.
A transcription factor analysis is available on TOBFAC.
|