科学网—xbinbzy的留言板

IP: 210.72.153.* [12]李晓明 2017-8-4 10:00

肖老师您好！
看到您的博文，对我现在的数据处理非常有帮助，先感谢您这么耐心详细的分享。
我的问题是，我想对高通量测序的数据进行lefse分析，在自己的mac电脑上，怎么才能安装lefse呢？
您能指点一下mac下安装lefse的大概流程步骤吗？
pip 和mvtnorm 已经在终端安装好。
非常感谢！

我的回复(2017-8-6 21:30)：lefse主要是用python写的，mac上自带python，所以下载下来基本上可以使用，如果缺少一些包，用pip装上即可。下载路径https://bitbucket.org/nsegata/lefse/downloads/，相关的使用说明可参考https://bitbucket.org/biobakery/biobakery/wiki/lefse，

IP: 223.167.134.* [11]yanan2015 2017-4-8 09:37

多谢肖老师的指教！！！

IP: 223.166.135.* [10]yanan2015 2017-2-28 10:05

肖老师您好！
向您请教一个问题，如果想在哺乳动物细胞中检测是否有细菌菌体成分，比如，有无细菌的细胞膜成分？采取什么检测方法更好一些呢？我可以想到的有：肽聚糖检测，细菌16s检测，不知道是否可以？还有，如果检测有无细菌质粒和基因片段，应用什么方法呢？也是用PCR吗？我是临床医生，这方面是外行，望您在百忙中指教，多谢您！

我的回复(2017-3-14 09:35)：首先不好意思哈，您说的这个我研究不多。按照个人理解，要实现动物细胞中细菌的检测，可以利用16s DNA这个片段，它在人体中不存在，为此利用PCR或者sequencing可以去鉴定此信号，看看是否呈现阳性（这个过程在血液的细菌检测中尝试较多）。对于肽聚糖的检测，查了一下有些试剂盒，但是不确定是否适合您的样本，这个建议咨询一下服务商。

IP: 116.6.99.* [9]qinyouwen 2016-5-19 15:25

肖老师，近几年好多微生物大型研究计划，期待您做一个整理，漫谈这些年的微生物研究大计划。

我的回复(2016-5-24 13:58)：谢谢鼓励，我尽量！鉴于同行业内有很多人在做这方面的工作，个人就没太关注，以后多写写

IP: 219.134.32.* [8]qq1021573448 2016-4-15 15:54

您好，我在看您的metagenomeSeq的原理介绍，其中前面建立的MRexperiment矩阵(obj)为什么不能进行normalization呢

我的回复(2016-4-16 09:01)：可以进行，我写的这篇是将这个工具的说明文档进行整理的，它用到的数据在不同步骤不一样，个人也没来得及对数据做统一。

IP: 59.77.43.* [7]flowerflower 2016-4-15 14:20

好的，谢谢您哦。

我的回复(2016-4-15 14:37)：不客气，欢迎随时讨论

IP: 59.77.43.* [6]flowerflower 2016-4-13 10:33

谢谢您。In de novo mode, the database is constructed on the fly using a strategy similar to Perseus: sequences are considered in the order of decreasing abundance, and candidate parents must have abundance at least 2× that of the query sequence, assuming chimeras are less abundant than their parents because they undergo fewer rounds of amplification. Sequences not classified as chimeric are added to the reference database.您好，再请教您一下，denovo模式是基于序列的丰富度进行考虑，因为在PCR扩增过程中，嵌合体是后面才开始形成的，所以嵌合体序列的丰富度比母体序列（非嵌合体序列）要少。那么，uchime的这个denovo模式是说对序列的丰富度进行分析，非嵌合体序列的丰度至少要是嵌合体序列的2倍才可以，是这样的理解吗

我的回复(2016-4-15 09:44)：是的，原理是这样，具体工具的代码没仔细看，哈哈～

IP: 59.77.43.* [5]flowerflower 2016-4-11 16:24

谢谢您。我最近也在看uchime和slayer的文献，有些不懂的地方也想向您请教一下。那么我想问一下unchime的denovo模式和ref模式，用ref模式是不是更精准一些（不过耗时长）？我想问问您对denovo模式的理解。

我的回复(2016-4-12 22:28)：是的，ref模式会借鉴之前的数据建立模型。denovo相当于没有参考集进行分析。

IP: 59.77.43.* [4]flowerflower 2016-4-9 09:07

谢谢您哦，我在mothur里用chimera.unchime和chimera.slayer检测chimera情况，就是我想再问您一下，同样的数据量用chimera.unchime的denovo模式只要十几分钟，用ref.大概是一小时左右，而用slayer却要一个晚上到第二天早上还没有运行完。请问这是怎么回事呀？而且chimera.unchime的denovo模式和ref.模式检测到的chimera数量相差很大，一个是几百，一个是几十。chimera.uchime(fasta=,count=,dereplicate=t,minh=0.1)
chimera.uchime(fasta=,reference=silva.gold.align,dereplicate=t)
chimera.slayer(fasta=,count=,reference=self,dereplicate=t)这是我在mothur里运行的命令，给您看看

我的回复(2016-4-9 22:41)：chimera.slayer中的那个工具原理，我还没解读，解读完了再跟您讨论。chimera.unchime中的denovo和ref两种模式的原理不一样，结果的差别肯定存在。

IP: 59.77.43.* [3]flowerflower 2016-4-7 11:21

minh
The minh parameter - mininum score to report chimera. Default 0.3. Values from 0.1 to 5 might be reasonable. Lower values increase sensitivity but may report more false positives. If you decrease xn you may need to increase minh, and vice versa.
xn
The xn parameter - weight of a no vote. Default 8.0. Decreasing this weight to around 3 or 4 may give better performance on denoised data.
您好，这是mothur里chimera.uchime的两个参数，这两个参数的涵义可以给我讲一讲吗？谢谢您。

我的回复(2016-4-8 17:58)：个人理解如下，在uchime中，minh是设定的一个阈值，用于判断一段序列是否是chimera，这是因为uchime会针对每段序列计算一个score值。minh太低的时候，很多非chimeric sequence判断为chimera，造成假阳性。xn是uchime计算模型中的一个参数，它较大的时候，uchime计算的score会较小，如果minh不对应设置小一点，会影响uchime的效果。

IP: 59.77.43.* [2]flowerflower 2016-4-7 10:14

dereplicate
The dereplicate parameter can be used when checking for chimeras by group. If the dereplicate parameter is false, then if one group finds the sequence to be chimeric, then all groups find it to be chimeric, default=f. If you set dereplicate=t, and then run remove.seqs with dups=f you can remove only the redundant chimeric sequences.
我想请问您，mothur.uchime里的这个参数要怎么理解，默认值是F的意思是说：如果序列a在样品A是chimera，然后就认为所有样品（A/BC/D）里序列a都认为是chimera并把序列a移除吗？那么设dereplicate=t，又要怎么理解？我真是弄不清楚了，谢谢您了

我的回复(2016-4-8 17:40)：个人理解是这样，此参数为f时，当1条序列在某个group是chimeric时，则此序列在所有group中是chimeric，这样就直接过滤掉了。如果设置为t，则此序列在哪个group中是chimeric则在此group标记，其他的group不做标注。过滤时用remover.seqs，dups的参数为f是则过滤掉对应group中的chimeric。此序列在其他group中不进行处理。

IP: 59.77.43.* [1]flowerflower 2016-4-7 10:06

您好，我目前正在处理高通量的一批数据，分析chimera的情况，用不同的工具，在mothur里运行chimera.slayer和chimera.uchime，因为是第一次接触，在mothur网站上看那些命令与参数，有些地方不是很理解，可以请教一下您吗？您说的不同方法chimera的重叠情况，这个是要怎么做？

我的回复(2016-4-7 10:15)：哈哈～可以，欢迎交流。

xbinbzy的个人博客分享 http://blog.sciencenet.cn/u/xbinbzy

留言板

肖斌