Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








411 Hits in 6.2 sec

A gene–phenotype relationship extraction pipeline from the biomedical literature using a representation learning approach

Wenhui Xing, Junsheng Qi, Xiaohui Yuan, Lin Li, Xiaoyu Zhang, Yuhua Fu, Shengwu Xiong, Lun Hu, Jing Peng
2018 Bioinformatics  
Combined with abbreviation revision and sentence template extraction, we improved the unsupervised word-embedding-to-sentence-embedding cascaded approach as representation learning to recognize the various  ...  Results: We have proposed a pipeline for extracting phenotype, gene and their relations from biomedical literature.  ...  Due to the lack of training of positive and negative samples, we use the Negative Class Label Enhanced (NCLE) algorithm (Xing et al., 2017) to label negative samples and train the sentence-embedding  ... 
doi:10.1093/bioinformatics/bty263 pmid:29950017 pmcid:PMC6022650 fatcat:43lehrpjpfdlblvcbsuvrxebty

Learning for Biomedical Information Extraction: Methodological Review of Recent Advances [article]

Feifan Liu, Jinying Chen, Abhyuday Jagannatha, Hong Yu
2016 arXiv   pre-print
Unlike existing reviews covering a holistic view on BioIE, this review focuses on mainly recent advances in learning based approaches, by systematically summarizing them into different aspects of methodological  ...  Biomedical information extraction (BioIE) is important to many applications, including clinical decision support, integrative biology, and pharmacovigilance, and therefore it has been an active research  ...  To overcome cascading errors in a multi-step pipeline framework, joint models (e.g. a Markov Logic Network(MLN) based approach [111] ) have shown improved performance.  ... 
arXiv:1606.07993v1 fatcat:7d5om7zxxzhoviiriasrfwg3xi

The extraction of complex relationships and their conversion to biological expression language (BEL) overview of the BioCreative VI (2017) BEL track

2019 Database: The Journal of Biological Databases and Curation  
provided sentences could be seen.  ...  Unfortunately, this knowledge is mostly embedded in the literature in such a way that it is unavailable for automated data analysis procedures.  ...  Also our special thanks to Jens Dörpinghaus who kindly helped to create the curation interface. Furthermore, we would like to thank all participants of the BioCreative BEL track.  ... 
doi:10.1093/database/baz084 pmid:31603193 pmcid:PMC6787548 fatcat:zdc2oacdhfgrdn5gjvkq4jyyzm

Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning [article]

Xiaoxin He, Xavier Bresson, Thomas Laurent, Adam Perold, Yann LeCun, Bryan Hooi
2024 arXiv   pre-print
Lastly, we believe the versatility of the proposed method extends beyond TAGs and holds the potential to enhance other tasks involving graph-text data.  ...  With the advent of powerful large language models (LLMs) such as GPT or Llama2, which demonstrate an ability to reason and to utilize general knowledge, there is a growing need for techniques which combine  ...  One approach is to use a cascaded architecture, where the node features are first encoded independently by the LMs, and then fed into GNN models.  ... 
arXiv:2305.19523v5 fatcat:7vl5s4udabhrbkfictsp5lxtaa

CBAG: Conditional biomedical abstract generation

Justin Sybrandt, Ilya Safro, Friedhelm Schwenker
2021 PLoS ONE  
We propose a conditional language model following the transformer architecture. This model uses the "encoder stack" to encode concepts that a user wishes to discuss in the generated text.  ...  from the capacity to select the specific set of concepts that underpin a generated biomedical text.  ...  tags, and entity class labels associated with each textual element.  ... 
doi:10.1371/journal.pone.0253905 pmid:34228754 pmcid:PMC8259990 fatcat:xig3mofcaza6tpnda2xircvrxu

De-identification of clinical free text using natural language processing: A systematic review of current approaches [article]

Aleksandar Kovačević, Bojana Bašaragin, Nikola Milošević, Goran Nenadić
2023 arXiv   pre-print
Results: A total of 2125 publications were identified for the title and abstract screening. 69 studies were found to be relevant.  ...  Machine learning (37 studies) and hybrid (26 studies) approaches are predominant, while six studies relied only on rules. Majority of the approaches were trained and evaluated on public corpora.  ...  While some approaches have been trying to extract sentences that do not contain identifiable information by measuring frequencies of sentences and terms [6], [9], or by creating representations of clinical  ... 
arXiv:2312.03736v1 fatcat:gd5oci3z7nbd3bpmvmxt6unbry

PhenoTagger: A Hybrid Method for Phenotype Concept Recognition using Human Phenotype Ontology

Ling Luo, Shankai Yan, Po-Ting Lai, Daniel Veltri, Andrew Oler, Sandhya Xirasagar, Rajarshi Ghosh, Morgan Similuk, Peter N Robinson, Zhiyong Lu
2021 Bioinformatics  
Next, a cutting-edge deep learning model is trained to classify each candidate phrase (n-gram from input sentence) into a corresponding concept label.  ...  Automatic phenotype concept recognition from unstructured text remains a challenging task in biomedical text mining research.  ...  Thanks to Chih-Hsuan Wei for his help with Web APIs. Conflict of Interest: none declared.  ... 
doi:10.1093/bioinformatics/btab019 pmid:33471061 pmcid:PMC11025364 fatcat:sxba74g5azgfno2lz2yrwnm5eu

Vision Transformers in Medical Computer Vision – A Contemplative Retrospection [article]

Arshi Parvaiz, Muhammad Anwaar Khalid, Rukhsana Zafar, Huma Ameer, Muhammad Ali, Muhammad Moazam Fraz
2022 arXiv   pre-print
These are immensely utilized by a plenty of researchers to perform new as well as former experiments.  ...  Recent escalation in the field of computer vision underpins a huddle of algorithms with the magnificent potential to unravel the information contained within images.  ...  Finally, the learnable class embedding is fed to a softmax layer for Emphysema classification.  ... 
arXiv:2203.15269v1 fatcat:wecjpoikbvfz5cygytqpktoxdq

Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey [article]

Chen Ling, Xujiang Zhao, Jiaying Lu, Chengyuan Deng, Can Zheng, Junxiang Wang, Tanmoy Chowdhury, Yun Li, Hejie Cui, Xuchao Zhang, Tianjiao Zhao, Amit Panalkar (+12 others)
2024 arXiv   pre-print
Specifically, to solve these hurdles, there has been a notable increase in research and practices conducted in recent years on the domain specialization of LLMs.  ...  This emerging field of study, with its substantial potential for impact, necessitates a comprehensive and systematic review to better summarize and guide ongoing work in this area.  ...  Verbalizer are only used for classification task where a mapping from class label to label words is required, which can be one-one mapping, trainable tokens [43] , or enhanced with extra knowledge [53  ... 
arXiv:2305.18703v7 fatcat:6vnz3xnvdfb7pkburxb3i6js5y

BioCreative VI Precision Medicine Track system performance is constrained by entity recognition and variations in corpus characteristics

Qingyu Chen, Nagesh C Panyam, Aparna Elangovan, Karin Verspoor
2018 Database: The Journal of Biological Databases and Curation  
context for capturing individual genotype variation related to disease.We present the READ-BioMed team's approach to identifying PPIm-related publications and to extracting specific PPIm information from  ...  of representative training data and the cascading impact of tool limitations in a modular system.  ...  In the context of biocuration, entity embeddings (embeddings over genes and mutations, for example) could be more effective than raw word embeddings and may have the potential to improve the performance  ... 
doi:10.1093/database/bay122 pmid:30576491 pmcid:PMC6301335 fatcat:gysdyhqrmjdevhzxcomcbu5slu

Syntactic Scope Resolution in Uncertainty Analysis

Lilja Øvrelid, Erik Velldal, Stephan Oepen
2010 International Conference on Computational Linguistics  
We show how the use of syntactic structure enables the resolution of hedge scope in a hybrid, two-stage approach to uncertainty analysis.  ...  In the first stage, a Maximum Entropy classifier, combining surface-oriented and syntactic features, identifies cue words.  ...  Acknowledgements We are grateful to the organizers of the 2010 CoNLL Shared Task and creators of the BioScope resource; first, for engaging in these kinds of community service, and second for many in-depth  ... 
dblp:conf/coling/OvrelidVO10 fatcat:4qcdfujfgrcend6e2ruzz2tg7q

Text Mining for Drug–Drug Interaction [chapter]

Heng-Yi Wu, Chien-Wei Chiang, Lang Li
2014 Msphere  
A novel hierarchical three-level annotation scheme was proposed and implemented to tag key terms, drug interaction sentences, and drug interaction pairs.  ...  Using our pharmacokinetics ontology, a PK corpus was constructed to present four classes of pharmacokinetics abstracts: in vivo pharmacokinetics studies, in vivo pharmacogenetic studies, in vivo drug interaction  ...  Vague DDI Sentence Problem In most DDI extraction approaches, CDDIS are considered to be candidates for the analysis of DDI extraction.  ... 
doi:10.1007/978-1-4939-0709-0_4 pmid:24788261 pmcid:PMC4636907 fatcat:jdxhh37g2zer3n4gikt34ewkry

Neuro-Symbolic Learning: Principles and Applications in Ophthalmology [article]

Muhammad Hassan, Haifei Guan, Aikaterini Melliou, Yuqi Wang, Qianhui Sun, Sen Zeng, Wen Liang, Yiwei Zhang, Ziheng Zhang, Qiuyue Hu, Yang Liu, Shunkai Shi (+15 others)
2022 arXiv   pre-print
Attempts have been made to overcome the challenges in neural network computing by representing and embedding domain knowledge in terms of symbolic representations.  ...  This review presents a comprehensive survey on the state-of-the-art NeSyL approaches, their principles, advances in machine and deep learning algorithms, applications such as opthalmology, and most importantly  ...  For the natural language processing, the embedding can be carried out using word, sentence, and structural levels. The AM can be employed both at global and local levels.  ... 
arXiv:2208.00374v1 fatcat:pktmnomj3bbwpjyj7lmu37rl7i

A Comprehensive Review on Synergy of Multi-Modal Data and AI Technologies in Medical Diagnosis

Xi Xu, Jianqiang Li, Zhichao Zhu, Linna Zhao, Huina Wang, Changwei Song, Yining Chen, Qing Zhao, Jijiang Yang, Yan Pei
2024 Bioengineering  
Traditionally, the amalgamation of diverse medical data modalities (e.g., image, text, speech, genetic data, physiological signals) is imperative to facilitate a comprehensive disease analysis, a topic  ...  Hence, there exists a pressing need to synthesize the latest strides in multi-modal data and AI technologies in the realm of medical diagnosis.  ...  [105] explored the benefits of pre-training BERT on a cancer-specific dataset, which aimed to enhance the model's ability to extract breast cancer phenotypes from pathology reports and clinical records  ... 
doi:10.3390/bioengineering11030219 pmid:38534493 pmcid:PMC10967767 fatcat:tdrqch5tinhhbmnk7bkf4ilsv4

Information Extraction from the Text Data on Traditional Chinese Medicine: A Review on Tasks, Challenges, and Methods from 2010 to 2021

Tingting Zhang, Zonghai Huang, Yaqiang Wang, Chuanbiao Wen, Yangzhi Peng, Ying Ye, Xuezhong Zhou
2022 Evidence-Based Complementary and Alternative Medicine  
Developing a method of information extraction (IE) from these sources to generate a cohesive data set would be a great contribution to the medical field.  ...  In the future, IE work should be promoted by extracting more existing entities and relations, constructing gold standard data sets, and exploring IE methods based on a small amount of labeled data.  ...  [49] proposed a novel cascade-type Chinese medication entity recognition approach, which integrated the sentence category classifier from an SVM and the CRF-based medication entity recognition.  ... 
doi:10.1155/2022/1679589 pmid:35600940 pmcid:PMC9122692 fatcat:r7sj7sdoubhwfhcscj227neoiy
« Previous Showing results 1 — 15 out of 411 results