http://sourceforge.net/projects/tgicl/ A sample LSF script file jobfile to cluster a EST dataset using TGICL on lewis may include the following lines: tgicl fastdb where fastdb the multi-fasta file containing all the sequences to be clustered. For more ...
RNA-seq的测序数据要向NCBI提交,这里简单总结一下。原始的测序数据 (reads) 数据要提交到 SRA . RNA-seq的拼接结果应该提交到TSA库, TSA 全称Transcriptome Shotgun Assembly Sequence Database, TSA is an archive of computationally assembled sequences from primary data such as ESTs, traces and Ne ...
http://oldspace.biovip.com/500/spacelist-blog-itemtypeid-650.html identity和similarity有什么区别,发现自己对这几个概念也不甚了了,于是做了点功课,如下。 第一反应 去查了 BLAST的glossary Identity The extent to which two (nucleotide or amino acid) sequences are invariant. Similarity The extent ...