Steps for Clustering of an organisms Sequences

  1. Selection of organisms sequences (ESTs and cDNAs) from EMBL
  2. Annotation of vector sequences, repeats, low quality
  3. Sequence clipping
  4. Clustering of mRNAs and related ESTs
  5. Search all-against-all similarities between remaining ESTs using QUASAR
  6. Clustering of remaining ESTs
  7. Assembly of all clusters, generation of Staden projects
  8. Extraction of consensus sequences for each cluster (contig)
  9. Selection of representative clone/cluster
  10. Picking of the non-redundant clone-set
  11. Annotation of the resulting clusters (contigs)
  12. Generation of Presentation for the Web

Consensus of human sequences based on Unigene


Criteria for selection of 'optimal' clones