Annotation of specialized corpora using a comprehensive entity and relation scheme.

Specifically, the annotation scheme is comprehensive, and compatible between tasks, especially for the medical relations. ... This study presents an engineering framework of medical entity recognition, relation extraction and attribute extraction, which are unified in annotation, modeling and evaluation. ... We highly appreciate the constructive comments on the annotation scheme by Xiaofen Zhao. We sincerely thank Xin Wei for his efforts in EMR data cleaning and parsing. ...

arXiv:2203.03823v1 fatcat:gwc3634ahbev7be76azgc5vbna

On the one hand, different datasets have different annotation schemes, which hinders the comparison between competitors across different corpora. ... While most research works adopt traditional metrics such as precision, recall, and 𝐹 1 , a few others prefer temporal awareness -a metric tailored to be more comprehensive on the evaluation of temporal ... ACKNOWLEDGMENTS We would like to acknowledge the financial support received during the development of this project. ...

doi:10.1145/3539618.3591892 fatcat:5heg6iemsvadhomu6nm3pmxwbe

On the one hand, different datasets have different annotation schemes, thus hindering the comparison between competitors across different corpora. ... While most research works adopt traditional metrics such as precision, recall, and F_1, a few others prefer temporal awareness – a metric tailored to be more comprehensive on the evaluation of temporal ... Some used the TimeML annotation scheme to create new corpora, such as AQUAINT Graff [2002] and the Platinum corpus UzZaman et al. ...

arXiv:2301.04643v1 fatcat:5s47vuzxifapzb2ptmss5etl5u

Open Access Multiple Versions

In this Chapter we review the currently available corpora to study anaphoric interpretation, and the tools that can be used to create new ones. ... Acknowledgements This work was supported in part by a PhD studentship offered by Cogito / Expert Systems (Kepa Rodriguez), in part by the LIVEMEMORIES project (Poesio), and in part by the SENSEI project ... In some of these schemes (including ACE, GNOME, and ARRAU), special attributes are used to mark the semantic function of the markable. ...

doi:10.1007/978-3-662-47909-4_4 dblp:series/tanlp/PoesioPRRV16 fatcat:4dnly5kz6zar7bfboduulhu66a

of an entity, and the inclusion of epistemic markers into NPs. ... The study summarizes the comprehensive annotation schema developed for this task and the preliminary research of the referential choice features based on the corpus. ... We took the ready-made scheme used for corpora annotation in NLP tasks. ...

doi:10.2139/ssrn.2885814 fatcat:g3mfagwcnzfp5hrxhciacg7fai

Six annotators contributed to the corpus annotation, using a comprehensive annotation scheme covering 21 entities, 11 attributes and 37 relations. ... All annotators trained on a small, common portion of the corpus before proceeding independently. An automatic tool was used to produce entity and attribute pre-annotations. ... The authors specially wish to acknowledge Dr. Griffon for his critical appraisal of the annotation scheme and its application to the corpus. ...

doi:10.1007/s10579-017-9382-y fatcat:gujff6rkqveklm3snqigp55364

This paper offers a survey of how they are currently used in different fields of the discipline, with particular emphasis on anaphora and coreference resolution, automatic summarisation and term extraction ... This paper offers a survey of how they are currently used in different fields of the discipline, with particular emphasis on anaphora and coreference resolution, automatic summarisation and term extraction ... The annotation process requires the markup in text of items that have a special meaning for some purpose, using an annotation scheme. ...

doaj:13213d8008b948a5959e49ccf3d045f8 fatcat:iyq6iauxevf4bpx3lalo575f3q

DOAJ SciELO Szczepanski

The corpus is annotated with named entities, event relationship and syntactic dependencies, and freely available at http:// www.biominingbu.org/hPPcorpus/hPP_corpus.xml. ... Text mining researchers apply a variety of algorithms to extract such information. A standard annotated corpus is necessary to evaluate the performance of the text mining algorithms. ... (iv) to confirm the event relationship between the related entities using a set of annotation rules. ...

doi:10.23880/jes-16000140 fatcat:nkz7treybnaqpmkgfqn2666nqi

We describe an annotation scheme capturing named entities and their relationships along with a dependency analysis of sentence syntax. ... The development and evaluation of such methods requires annotated domain corpora. ... We are also grateful to Meelis Kolmer and Mauno Vihinen for consultation in creating the bioentity and relationship annotation schemes. ...

doi:10.1186/1471-2105-8-50 pmid:17291334 pmcid:PMC1808065 fatcat:hbjyjjsu4zhu7j67jn2l3obhya

DOAJ

KLUE is a collection of 8 Korean natural language understanding (NLU) tasks, including Topic Classification, SemanticTextual Similarity, Natural Language Inference, Named Entity Recognition, Relation Extraction ... We make a few interesting observations from the preliminary experiments using the proposed KLUE benchmark suite, already demonstrating the usefulness of this new benchmark suite. ... annotators in MRC dataset, and Sangah Park for careful consideration of data construction in DP, NER, and RE. ...

arXiv:2105.09680v4 fatcat:ejkvhzfldzhcbkhw4eakm7s634

Open Access Multiple Versions

Citation

Sungjoon Park, Jihyung Moon, Sungdong Kim, Won Ik Cho, Jiyoon Han, Jangwon Park, Chisung Song, Junseong Kim, Yongsook Song, Taehwan Oh, Joohong Lee, Juhyun Oh, Sungwon Lyu, Younghoon Jeong, Inkwon Lee, Sangwoo Seo, Dongjun Lee, Hyunwoo Kim, Myeonghwa Lee, Seongbo Jang, Seungwon Do, Sunkyoung Kim, Kyungtae Lim, Jongwon Lee, Kyumin Park, Jamin Shin, Seonghyun Kim, Lucy Park, Alice Oh, Jung-Woo Ha, Kyunghyun Cho. "KLUE: Korean Language Understanding Evaluation." arXiv (2021)

In this paper, we present PKDE4J, a comprehensive text-mining system that integrates dictionary-based entity extraction and rule-based relation extraction in a highly flexible and extensible framework. ... We demonstrate its competitive performance by evaluating it on many corpora and found that it surpasses existing systems with average F-measures of 85% for entity extraction and 81% for relation extraction ... Acknowledgments This work was supported by the Bio-Synergy Research Project (NRF-2013M3A9C4078138) of the Ministry of Science, ICT, and Future Planning through the National Research Foundation. ...

doi:10.1016/j.jbi.2015.08.008 pmid:26277115 fatcat:sfgmji7zu5efbbo6xyarry7h4e

Thus, NEREL-BIO comprises the following specific features: annotation of nested named entities, it can be used as a benchmark for cross-domain (NEREL -> NEREL-BIO) and cross-language (English -> Russian ... This paper describes NEREL-BIO -- an annotation scheme and corpus of PubMed abstracts in Russian and smaller number of abstracts in English. ... The authors thank all annotators for their contribution and Zulfat Miftahutdinov for his work on models trained on the MedMentions corpus. ...

arXiv:2210.11913v1 fatcat:n5qlhr5cknfu7gqkln54lso6ju

Open Access

The final fine-grained annotated entity corpus consists of 1104 entities and 67,799 tokens. ... We iteratively updated the guidelines until the inter-annotator agreement (IAA) exceeded a Cohen's kappa value of 0.9. Comprehensive annotations were performed while keeping the IAA value above 0.9. ... The authors are grateful to the editor's and the reviewers' comments that help us to improve the quality and merit of this paper. ...

doi:10.1186/s12911-020-1079-2 pmid:32252745 fatcat:dcktu2lgejcjnliwrivzxlpgd4

DOAJ

In this study, we evaluated FL on 2 biomedical NLP tasks encompassing 8 corpora using 6 LMs. ... models using zero-/one-shot learning and offered lightning inference speed. ... Three experts were used to annotate a set of 100 abstracts for each of the drug-disorder, drugtarget, and target-disorder relations. ...

arXiv:2307.11254v2 fatcat:nm6ljllrcjhszn4dyxx5jsfrri

Open Access Multiple Versions

Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream ... Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. ... All auhtors read and approved the final manuscript. ...

doi:10.1186/s12859-022-05096-w pmid:36536280 pmcid:PMC9761970 fatcat:jakzw44pujgx7himt5zmyeqlui

DOAJ

A Unified Framework of Medical Information Annotation and Extraction for Chinese Clinical Text [article]

Preserved Fulltext

tieval: An Evaluation Framework for Temporal Information Extraction Systems

Preserved Fulltext

tieval: An Evaluation Framework for Temporal Information Extraction Systems [article]

Preserved Fulltext

Other Versions

Annotated Corpora and Annotation Tools [chapter]

Preserved Fulltext

Coreference Annotation in the Russian Clinical Pear Stories Corpus: Annotation Features and Preliminary Results

Preserved Fulltext

A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT)

Preserved Fulltext

Corpora for computational linguistics Corpora for computational linguistics

Preserved Fulltext

hPP Corpus: A Tagged Biomedical Corpus for Automatic Extraction of Human Protein Phosphorylation for Understanding Cellular Functions

Preserved Fulltext

BioInfer: a corpus for information extraction in the biomedical domain

Preserved Fulltext

KLUE: Korean Language Understanding Evaluation [article]

Preserved Fulltext

Other Versions

PKDE4J: Entity and relation extraction for public knowledge discovery

Preserved Fulltext

NEREL-BIO: A Dataset of Biomedical Abstracts Annotated with Nested Named Entities [article]

Preserved Fulltext

Constructing fine-grained entity recognition corpora based on clinical records of traditional Chinese medicine

Preserved Fulltext

An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing [article]

Preserved Fulltext

JCBIE: a joint continual learning neural network for biomedical information extraction

Preserved Fulltext