Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








12 Hits in 6.1 sec

Robust Speech Recognition for Adverse Environments [chapter]

Chung-Hsien Wu, Chao-Hong Liu
2012 Modern Speech Recognition Approaches with Case Studies  
Dowding and E. Shriberg (1992). Integrating multiple knowledge sources for detection and correction of repairs in human-computer dialog. Proc. of ACL.  ...  Wordbased linguistic module consists of a cleanup language model and an alignment model is used for verifying the position of the IP and therefore correcting the edit disfluency.  ...  For handling language-related phenomena in edit disfluency, a cleanup language model characterizing the structure of the cleanup sentences and an alignment model for aligning words between deletable region  ... 
doi:10.5772/47843 fatcat:iwk4m3v44vcp7aujvnhkdzw4yi

Automatically Detecting Likely Edits in Clinical Notes Created Using Automatic Speech Recognition

Kevin Lybarger, Mari Ostendorf, Meliha Yetisgen
2018 AMIA Annual Symposium Proceedings  
We create detection models using logistic regression and conditional random field models, exploring a variety of text-based features that consider the structure of clinical notes and exploit the medical  ...  Experimental results on a large corpus of practitioner-edited clinical notes show that 67% of sentence-level edits and 45% of word-level edits can be detected with a false detection rate of 15%.  ...  The data used was provided through the support of grant number R21HS023631 from the Agency for Healthcare Research and Quality (AHRQ).  ... 
pmid:29854187 pmcid:PMC5977669 fatcat:6cemlu6pu5f77c4qbxlwcqsdoe

Disfluency correction of spontaneous speech using conditional random fields with variable-length features

Jui-Feng Yeh, Chung-Hsien Wu, Wei-Yen Wu
2007 Interspeech 2007   unpublished
This paper presents an approach to detecting and correcting edit disfluency based on conditional random fields with variable-length features.  ...  The experimental results show that the proposed model outperforms other methods and efficiently detects and corrects edit disfluency in spontaneous speech.  ...  Yeh and Wu integrated the alignment model and language model to build an edit disfluency cleanup model [14] .  ... 
doi:10.21437/interspeech.2007-582 fatcat:4xcirpcdhfauhcxealn44kh3ou

Reconstructing false start errors in spontaneous speech text

Erin Fitzgerald, Keith Hall, Frederick Jelinek
2009 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics on - EACL '09   unpublished
We find that combining lexical, syntactic, and language model-related features with the output of a state-of-the-art disfluency identification system improves overall word-level identification of these  ...  This paper presents a conditional random field-based approach for identifying speaker-produced disfluencies (i.e. if and where they occur) in spontaneous speech transcripts.  ...  Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the supporting agency.  ... 
doi:10.3115/1609067.1609095 fatcat:rsw6nysquzgtvoyp4nglgm3u7q

Automatic disfluency identification in conversational speech using multiple knowledge sources

Yang Liu, Elizabeth Shriberg, Andreas Stolcke
2003 8th European Conference on Speech Communication and Technology (Eurospeech 2003)   unpublished
This work investigates a number of knowledge sources for disfluency detection, including acoustic-prosodic features, a language model (LM) to account for repetition patterns, a part-of-speech (POS) based  ...  Detection and correction of disfluencies can make automatic speech recognition transcripts more readable for human readers, and can aid downstream processing by machine.  ...  Acknowledgments The authors gratefully acknowledge Luciana Ferrer for her help with the prosodic feature extraction, and Mary Harper and Barbara Peskin for helpful comments and discussion.  ... 
doi:10.21437/eurospeech.2003-332 fatcat:jalknwbwxfca5oyjree46vhp2e

Parsing and Its Applications for Conversational Speech

M. Lease, E. Charniak, M. Johnson
Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.  
This paper provides an introduction to recent work in statistical parsing and its applications for conversational speech, with particular emphasis on the relationship between parsing and detecting speech  ...  While historically parsing and repair detection have been studied independently, we present a line of research which has spanned the boundary between the two and demonstrated the efficacy of this synergistic  ...  In Section 4, we describe repairs in more detail and present an approach for detecting them using our parser-based language model.  ... 
doi:10.1109/icassp.2005.1416465 dblp:conf/icassp/LeaseCJ05 fatcat:yfmudgilnjchjlrxnunxeg2vj4

Integrating sentence- and word-level error identification for disfluency correction

Erin Fitzgerald, Frederick Jelinek, Keith Hall
2009 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 2 - EMNLP '09   unpublished
There is active work on reconstructing this errorful data into a clean and fluent transcript by identifying and removing these simple errors.  ...  While speaking spontaneously, speakers often make errors such as self-correction or false starts which interfere with the successful application of natural language processing techniques like summarization  ...  Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the supporting agency.  ... 
doi:10.3115/1699571.1699612 fatcat:dhemjmdo2fd7fohhg73we22fla

What lies beneath

Erin Fitzgerald, Frederick Jelinek, Robert Frank
2009 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - ACL-IJCNLP '09   unpublished
between spoken and reconstructed language.  ...  Our investigation of naturally-occurring spontaneous speaker errors aligned to corrected text with manual semanticosyntactic analysis yields new insight into the syntactic and structural semantic differences  ...  Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the supporting agency.  ... 
doi:10.3115/1690219.1690251 fatcat:rsgxiwcugva7rjsv27gaw67pdm

Text-to-Speech Synthesis Using Found Data for Low-Resource Languages

Erica Lindsay Cooper
2019
While it takes a great deal of time and resources to collect a traditional text-to-speech corpus for a given language, we may instead be able to make use of various sources of "found" data which may be  ...  Typically, building a high-quality voice requires collecting dozens of hours of speech from a single professional speaker in an anechoic chamber with a high-quality microphone.  ...  Acknowledgments This thesis would not have been possible without the mentorship and guidance of my advisor,  ... 
doi:10.7916/d8-vdzp-j870 fatcat:bw4g7ng5azbzxixayxlqyxtnky

Dynamic Generation of Spatial Referring Expressions for Social Robots [article]

Christopher Wallbridge, Plymouth University, Faculty Of Science And Engineering
2021
I start by looking at a Robot Assisted Language Learning scenario, in which the robot attempts to encourage the use of spatial language in a quiz based game.  ...  I present the following thesis: A robot that is able to use dynamic description methods –using vague initial language with the ability to further repair for generating spatial referring expressions as  ...  Acknowledgements ACKNOWLEDGEMENTS This work was supported by the EU Horizon 2020 L2TOR project (grant 688014) the and EU H2020 Marie Sklodowska-Curie Actions project DoRoThy (grant 657227).  ... 
doi:10.24382/1232 fatcat:bwumaknjsbe6jdfpw7niuv4kri

Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas

S.N.
2021
A couple of thousand languages and dialects, at present divided into 17 large families and 38 small ones, with several hundred unclassified single languages, are on record.  ...  In one small portion of the area, in Mexico just north of the Isthmus of Tehuantepec, one finds a diversity of linguistic type hard to match on an entire continent in the Old World.  ...  This article is an output of a research project implemented as part of the Basic Research Programme at the National Research University Higher School of Economics (HSE University).  ... 
doi:10.5167/uzh-203436 fatcat:6vbdg7jlkzhrhof66v2zvtyejm

Context-based multimodal interpretation : an integrated approach to multimodal fusion and discourse processing [article]

Norbert Pfleger, Universität Des Saarlandes, Universität Des Saarlandes
2008
I am particularly thankful for the interesting discussions and for the valuable insights you provided.  ...  Web Ontology Language-OWL The Web Ontology Language (OWL) is a markup language designed to enable the exchange of data using an ontology.  ...  As a consequence, a language model based speech recognizer is only able to recognize exactly the sentences available in the language model.  ... 
doi:10.22028/d291-25919 fatcat:w276wugztnbvhe742q44swlvnq