Edit disfluency detection and correction using a cleanup language model and an alignment model.

Dowding and E. Shriberg (1992). Integrating multiple knowledge sources for detection and correction of repairs in human-computer dialog. Proc. of ACL. ... Wordbased linguistic module consists of a cleanup language model and an alignment model is used for verifying the position of the IP and therefore correcting the edit disfluency. ... For handling language-related phenomena in edit disfluency, a cleanup language model characterizing the structure of the cleanup sentences and an alignment model for aligning words between deletable region ...

doi:10.5772/47843 fatcat:iwk4m3v44vcp7aujvnhkdzw4yi

We create detection models using logistic regression and conditional random field models, exploring a variety of text-based features that consider the structure of clinical notes and exploit the medical ... Experimental results on a large corpus of practitioner-edited clinical notes show that 67% of sentence-level edits and 45% of word-level edits can be detected with a false detection rate of 15%. ... The data used was provided through the support of grant number R21HS023631 from the Agency for Healthcare Research and Quality (AHRQ). ...

pmid:29854187 pmcid:PMC5977669 fatcat:6cemlu6pu5f77c4qbxlwcqsdoe

Open Access

This paper presents an approach to detecting and correcting edit disfluency based on conditional random fields with variable-length features. ... The experimental results show that the proposed model outperforms other methods and efficiently detects and corrects edit disfluency in spontaneous speech. ... Yeh and Wu integrated the alignment model and language model to build an edit disfluency cleanup model [14] . ...

doi:10.21437/interspeech.2007-582 fatcat:4xcirpcdhfauhcxealn44kh3ou

We find that combining lexical, syntactic, and language model-related features with the output of a state-of-the-art disfluency identification system improves overall word-level identification of these ... This paper presents a conditional random field-based approach for identifying speaker-produced disfluencies (i.e. if and where they occur) in spontaneous speech transcripts. ... Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the supporting agency. ...

doi:10.3115/1609067.1609095 fatcat:rsw6nysquzgtvoyp4nglgm3u7q

This work investigates a number of knowledge sources for disfluency detection, including acoustic-prosodic features, a language model (LM) to account for repetition patterns, a part-of-speech (POS) based ... Detection and correction of disfluencies can make automatic speech recognition transcripts more readable for human readers, and can aid downstream processing by machine. ... Acknowledgments The authors gratefully acknowledge Luciana Ferrer for her help with the prosodic feature extraction, and Mary Harper and Barbara Peskin for helpful comments and discussion. ...

doi:10.21437/eurospeech.2003-332 fatcat:jalknwbwxfca5oyjree46vhp2e

This paper provides an introduction to recent work in statistical parsing and its applications for conversational speech, with particular emphasis on the relationship between parsing and detecting speech ... While historically parsing and repair detection have been studied independently, we present a line of research which has spanned the boundary between the two and demonstrated the efficacy of this synergistic ... In Section 4, we describe repairs in more detail and present an approach for detecting them using our parser-based language model. ...

doi:10.1109/icassp.2005.1416465 dblp:conf/icassp/LeaseCJ05 fatcat:yfmudgilnjchjlrxnunxeg2vj4

There is active work on reconstructing this errorful data into a clean and fluent transcript by identifying and removing these simple errors. ... While speaking spontaneously, speakers often make errors such as self-correction or false starts which interfere with the successful application of natural language processing techniques like summarization ... Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the supporting agency. ...

doi:10.3115/1699571.1699612 fatcat:dhemjmdo2fd7fohhg73we22fla

between spoken and reconstructed language. ... Our investigation of naturally-occurring spontaneous speaker errors aligned to corrected text with manual semanticosyntactic analysis yields new insight into the syntactic and structural semantic differences ... Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the supporting agency. ...

doi:10.3115/1690219.1690251 fatcat:rsgxiwcugva7rjsv27gaw67pdm

While it takes a great deal of time and resources to collect a traditional text-to-speech corpus for a given language, we may instead be able to make use of various sources of "found" data which may be ... Typically, building a high-quality voice requires collecting dozens of hours of speech from a single professional speaker in an anechoic chamber with a high-quality microphone. ... Acknowledgments This thesis would not have been possible without the mentorship and guidance of my advisor, ...

doi:10.7916/d8-vdzp-j870 fatcat:bw4g7ng5azbzxixayxlqyxtnky

I start by looking at a Robot Assisted Language Learning scenario, in which the robot attempts to encourage the use of spatial language in a quiz based game. ... I present the following thesis: A robot that is able to use dynamic description methods –using vague initial language with the ability to further repair for generating spatial referring expressions as ... Acknowledgements ACKNOWLEDGEMENTS This work was supported by the EU Horizon 2020 L2TOR project (grant 688014) the and EU H2020 Marie Sklodowska-Curie Actions project DoRoThy (grant 657227). ...

doi:10.24382/1232 fatcat:bwumaknjsbe6jdfpw7niuv4kri

Open Access

A couple of thousand languages and dialects, at present divided into 17 large families and 38 small ones, with several hundred unclassified single languages, are on record. ... In one small portion of the area, in Mexico just north of the Isthmus of Tehuantepec, one finds a diversity of linguistic type hard to match on an entire continent in the Old World. ... This article is an output of a research project implemented as part of the Basic Research Programme at the National Research University Higher School of Economics (HSE University). ...

doi:10.5167/uzh-203436 fatcat:6vbdg7jlkzhrhof66v2zvtyejm

I am particularly thankful for the interesting discussions and for the valuable insights you provided. ... Web Ontology Language-OWL The Web Ontology Language (OWL) is a markup language designed to enable the exchange of data using an ontology. ... As a consequence, a language model based speech recognizer is only able to recognize exactly the sentences available in the language model. ...

doi:10.22028/d291-25919 fatcat:w276wugztnbvhe742q44swlvnq

Robust Speech Recognition for Adverse Environments [chapter]

Preserved Fulltext

Automatically Detecting Likely Edits in Clinical Notes Created Using Automatic Speech Recognition

Preserved Fulltext

Disfluency correction of spontaneous speech using conditional random fields with variable-length features

Preserved Fulltext

Reconstructing false start errors in spontaneous speech text

Preserved Fulltext

Automatic disfluency identification in conversational speech using multiple knowledge sources

Preserved Fulltext

Parsing and Its Applications for Conversational Speech

Preserved Fulltext

Integrating sentence- and word-level error identification for disfluency correction

Preserved Fulltext

What lies beneath

Preserved Fulltext

Text-to-Speech Synthesis Using Found Data for Low-Resource Languages

Preserved Fulltext

Dynamic Generation of Spatial Referring Expressions for Social Robots [article]

Preserved Fulltext

Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas

Preserved Fulltext

Context-based multimodal interpretation : an integrated approach to multimodal fusion and discourse processing [article]

Preserved Fulltext