ABSTRACT
Biomedical ontologies are commonly used to structure and organize formal knowledge about biological and biomedical concepts. Terms structured within ontologies are usually associated to biomedical entities in a process referred to as annotation. The Human Phenotype Ontology (HPO) is a standardized, controlled vocabulary that contains phenotypic information about genes or product genes. Due to the recent introduction of the HPO, problem to check annotation consistency of HPO annotations, has not been formally investigated differently from other ontologies, such as Gene Ontology (GO). In a previous work we introduced a framework to learn association rules from Gene Ontology demonstrating its usefulness to improve annotation consistency. Here we extend those results in HPO and we present a novel framework to learn association rules from HPO. The framework is based on a multithreaded tool able to learn rules in an efficient way. Results demonstrate its usefulness, by extracting rules that connect two or more terms of HPO, currently under investigation.
- G. Agapito, M. Cannataro, P. H. Guzzi, and M. Milano. Using go-war for mining cross-ontology weighted association rules. Computer methods and programs in biomedicine, 120(2):113--122, 2015. Google ScholarDigital Library
- G. Agapito, M. Cannataro, P. H. Guzzi, and M. Milano. Using GO-WAR for mining cross-ontology weighted association rules. Computer Methods and Programs in Biomedicine, 120(2):113--122, 2015. Google ScholarDigital Library
- G. Agapito, M. Milano, P. H. Guzzi, and M. Cannataro. Improving annotation quality in gene ontology by mining cross-ontology weighted association rules. In Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on, pages 1--8. IEEE, 2014.Google ScholarCross Ref
- R. Agrawal, T. Imieliński, and A. Swami. Mining association rules between sets of items in large databases. SIGMOD Rec., 22(2):207--216, June 1993. Google ScholarDigital Library
- G. Alterovitz, M. Xiang, D. P. Hill, J. Lomax, J. Liu, M. Cherkassky, J. Dreyfuss, C. Mungall, M. A. Harris, M. E. Dolan, et al. Ontology engineering. Nature Biotechnology, 28(2):128--130, 2010.Google ScholarCross Ref
- Y.-R. Cho, M. Mina, Y. Lu, N. Kwon, and P. H. Guzzi. M-finder: Uncovering functionally associated proteins from interactome data integrated with go annotations. Proteome Sci, 11(Suppl 1):S3, 2013.Google ScholarCross Ref
- D. Faria, A. Schlicker, C. Pesquita, H. Bastos, A. E. N. Ferreira, M. Albrecht, and A. O. FalcÃčo. Mining go annotations for improving annotation consistency. PLoS ONE, 7(7):e40519, 07 2012.Google ScholarCross Ref
- T. Groza, S. Kohler, D. Moldenhauer, N. Vasilevsky, G. Baynam, T. Zemojtel, L. M. Schriml, W. A. Kibbe, P. N. Schofield, T. Beck, D. Vasant, A. J. Brookes, A. Zankl, N. L. Washington, C. J. Mungall, S. E. Lewis, M. A. Haendel, H. Parkinson, and P. N. Robinson. The human phenotype ontology: Semantic unification of common and rare disease. The American Journal of Human Genetics, 97(1):111--124, 2015.Google ScholarCross Ref
- P. Guzzi, M. Mina, C. Guerra, and M. Cannataro. Semantic similarity analysis of protein data: assessment with biological features and issues. Briefings in bioinformatics, 13(5):569--585, 2012.Google ScholarCross Ref
- P. H. Guzzi, M. Milano, and M. Cannataro. Mining association rules from gene ontology and protein networks: Promises and challenges. In D. Abramson, M. Lees, V. V. Krzhizhanovskaya, J. Dongarra, and P. M. A. Sloot, editors, Proceedings of the International Conference on Computational Science, ICCS 2014, Cairns, Queensland, Australia, 10--12 June, 2014, volume 29 of Procedia Computer Science, pages 1970--1980. Elsevier, 2014.Google ScholarCross Ref
- J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In W. Chen, J. Naughton, and P. A. Bernstein, editors, 2000 ACM SIGMOD Intl. Conference on Management of Data, pages 1--12. ACM Press, May 2000. Google ScholarDigital Library
- J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. SIGMOD Rec., 29(2):1--12, May 2000. Google ScholarDigital Library
- S. Harispe, D. Sánchez, S. Ranwez, S. Janaqi, and J. Montmain. A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain. Journal of biomedical informatics, 2013. Google ScholarDigital Library
- P. Manda, F. McCarthy, and S. M. Bridges. Interestingness measures and strategies for mining multi-ontology multi-level association rules from gene ontology annotations for the discovery of new go relationships. Journal of biomedical informatics, 46(5):849--856, 2013. Google ScholarDigital Library
- M. Milano, G. Agapito, P. H. Guzzi, and M. Cannataro. Biases in information content measurement of gene ontology terms. In H. J. Zheng, W. Dubitzky, X. Hu, J. Hao, D. P. Berrar, K. Cho, Y. Wang, and D. Gilbert, editors, 2014 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2014, Belfast, United Kingdom, November 2--5, 2014, pages 9--16. IEEE, 2014.Google ScholarCross Ref
- S. Naulaerts, P. Meysman, W. Bittremieux, T. N. Vu, W. Vanden Berghe, B. Goethals, and K. Laukens. A primer to frequent itemset mining for bioinformatics. Briefings in Bioinformatics, 2013.Google Scholar
- P. N. Robinson, S. Kohler, S. Bauer, D. Seelow, D. Horn, and S. Mundlos. The human phenotype ontology: a tool for annotating and analyzing human hereditary disease. The American Journal of Human Genetics, 83(5):610--615, 2008.Google ScholarCross Ref
- D. Sánchez, M. Batet, and D. Isern. Ontology-based information content computation. Know.-Based Syst., 24:297--303, Mar. 2011. Google ScholarDigital Library
- W. Wang, J. Yang, and P. S. Yu. Efficient mining of weighted association rules (war). In Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 270--274. ACM, 2000. Google ScholarDigital Library
Index Terms
- Efficient learning of association rules from human phenotype ontology
Recommendations
Learning Weighted Association Rules in Human Phenotype Ontology
Computational Intelligence Methods for Bioinformatics and BiostatisticsAbstractHuman Phenotype Ontology (HPO) provides information about medically relevant phenotypes and the association of disease and phenotype concepts to HPO terms through annotations. The specificity of each HPO terms is estimated by its Information ...
Evaluation of cross-ontology association rules weighted by term specificity
The use of an ontology is a prevailing trend for management and analysis of biological big data. Consequently, we have encountered strong demands on developing algorithms for accurate analysis of ontology structures and annotated data. We can discover the ...
Parallel Learning of Weighted Association Rules in Human Phenotype Ontology
Euro-Par 2019: Parallel Processing WorkshopsAbstractThe Human Phenotype Ontology (HPO) is a standardized vocabulary of terms related to diseases. The importance and the specificity of HPO terms are estimated employing the Information Content (IC). Thus, the analysis of annotated data is a critical ...
Comments