Functional similarity of gene products can be estimated by controlled biological vocabularies, such as Gene Ontology (GO).
Four methods have been presented to determine the semantic similarity of two GO terms based on the annotation statistics of their common ancestor terms (Resnik (Philip, 1999), Jiang (Jiang and Conrath, 1997), Lin (Lin, 1998) and Schlicker (Schlicker et al., 2006)). Wang (Wang et al., 2007) proposed a new method to measure the similarity based on the graph structure of GO. Each of these methods has its own advantages and weaknesses.
Recently, Yu and colleagues developed a R package named GOSemSim, which can be used to compute semantic similarity among GO terms, sets of GO terms, gene products, and gene clusters, In addition, GOSemSim provides all five methods mentioned above.
[1] Guangchuang Yu, Fei Li, Yide Qin, Xiaochen Bo, Yibo Wu, and Shengqi Wang. Gosemsim: an r package for measuring semantic similarity among go terms and gene products. Bioinformatics, 26:976–978, 2010.
[2] Guangchuang Yu, Chuan-Le Xiao, Xiaochen Bo, Chun-Hua Lu, Yide Qin, Sheng Zhan, and Qing-Yu He. A new
method for measuring functional similarity of micrornas. Journal of Integrated OMICS, 1(1):49–54, 2011.
[3] Guangchuang Yu, Le-Gen Wang, Yanyan Han, and Qing-Yu He. clusterprofiler: an r package for comparing
biological themes among gene clusters. OMICS: A Journal of Integrative Biology, 16, 2012.