A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
Multimodal redundancy across handwriting and speech during computer mediated human-human interactions
2007
Proceedings of the SIGCHI conference on Human factors in computing systems - CHI '07
speech to dynamically learn new vocabulary. ...
We show that redundantly presented words, compared to other words used during a presentation or meeting, tend to be topic specific and thus are likely to be out-of-vocabulary. ...
Photo Annotation (PA) using Digital Paper and Pen In [6] we reported on some aspects of a pilot study in which photos printed on digital paper were discussed and simultaneously annotated with a digital ...
doi:10.1145/1240624.1240778
dblp:conf/chi/KaiserBEC07
fatcat:slcjyespnzgmbjyvmzunr5fage
An interface for mutual disambiguation of recognition errors in a multimodal navigational assistant
2006
Multimedia Systems
Potentially, these multimedia capabilities can be used for multimediaintensive tasks, but two things stand in the way. ...
Today's mobile devices have multimedia capabilities; cell phones have cameras, displays, sound output, and (soon) speech recognition. ...
The ARIA system is a software agent that proactively looks for opportunities for image annotation and retrieval in everyday work, like email messaging [12] . ...
doi:10.1007/s00530-006-0052-y
fatcat:tegnigy6gnck3kpx5rwxttyaiy
How are gestures used by politicians? A multimodal co-gesture analysis
2021
Italian Journal of Computational Linguistics
ii) Since the corpus used presents an annotation of "speech constants" (Voghera 2001), is the Lexical Retrieval hypothesis confirmed or are gestures used in correlation with other and different constants ...
Finally, we analyse semantic patterns of gesture-speech relationship. ...
What are the semantic patterns of gesture-speech relationship? A summary of the hand movement annotations in the corpus is reported in Table 4 and 3 . ...
doi:10.4000/ijcol.827
fatcat:7dl6fe6bjjcqzhzd4p47c6roxi
Automatic Summarization
2012
Foundations and Trends in Information Retrieval
Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR). ...
This growth has motivated extensive research on techniques and technologies that facilitate reliable indexing and retrieval. ...
The approach is evaluated on Mandarin Chinese speech annotations of a photo collection. Finally, morphemes have also been used as indexing units. ...
doi:10.1561/1500000020
fatcat:o424mjxnp5abbexhjsobtom2ry
Issues on Retrieval of Sound Effects in Large Collaborative Databases
2008
Zenodo
Specifically, aspects concerning the annotation of such large databases by means of collaborative tagging, and others dealing with the study of alternative ways to retrieve audio content, such as sound ...
and the scarcity of the existing annotations. ...
Figure 5 . 1 : 51 Automatic speech recognition system applied to extract information from non-speech sounds at the phoneme level.consequetive phonemes or syllables when building the Finite State Network ...
doi:10.5281/zenodo.3744728
fatcat:taqgakq5z5dp5n6d4rpjj7nrte
Automatic Assessment Of Singing Voice Pronunciation: A Case Study With Jingju Music
2018
Zenodo
Chinese tonal languages and the strict conventions in oral transmission adopted by jingju singing training pose unique challenges that have not been addressed by the current MIR research, which motivates us ...
music retrieval. ...
Their note and expression segmentation are achieved using a photo resistor and four rings of surfacemounted light-emitting diodes (SMD LEDs). ...
doi:10.5281/zenodo.1490343
fatcat:f3mrhstkdff6ppmdadeasfuo7m
Co-verbal gestures among speakers with aphasia: Influence of aphasia severity, linguistic and semantic skills, and hemiplegia on gesture employment in oral discourse
2015
Journal of Communication Disorders
As for the non-content carrying gestures, beats were used primarily for reinforcing speech prosody or guiding speech flow, while non-identifiable gestures were associated with assisting lexical retrieval ...
The use of co-verbal gestures is common in human communication and has been reported to assist word retrieval and to facilitate verbal interactions. ...
scholars, who conducted studies that compared gestural profiles between individuals with and without aphasia, and concluded that speakers with impaired oral ability secondary to language deficits tended to use ...
doi:10.1016/j.jcomdis.2015.06.007
pmid:26186256
pmcid:PMC4530578
fatcat:3u26fh4mprf2vgmxdrn25trjxm
Retrieving ancient Maya glyphs with Shape Context
2009
2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops
Our first task is the automatic retrieval of Maya syllabic glyphs using the Shape Context descriptor. ...
We propose an improvement in the cost function used to compute similarity between shapes making it more restrictive and precise. ...
One of the most representative works in this domain is [9] , in which a system for retrieval of paintings and photos of art objects, using content and metadata, was developed by a multidisciplinary team ...
doi:10.1109/iccvw.2009.5457594
dblp:conf/iccvw/Roman-RangelPOG09
fatcat:r63b7shpqjb3lkhztr6zrugnpq
Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing
[article]
2019
arXiv
pre-print
Furthermore, speech detection experiments reporting AUCs around 69% encourage us for further exploration about the feasibility of PPG for speech processing tasks. ...
In addition, we evaluate whether speech/non-speech events may be inferred from PPG signal, where speech might translate in fluctuations into the pulse signal. ...
They performed feature extraction using the Fast Fourier Transform (FFT), and various machine learning algorithms such as SVM or Naïve Bayes. ...
arXiv:1911.04808v1
fatcat:v6i3rp7ngnhofd4ez5ln7ezfme
NTU corpus of Formosan languages: A state-of-the-art report
2008
Corpus Linguistics and Linguistic Theory
emphasized") prominent syllable which sometimes does not coincide with the stressed pattern of a word. ...
as a focused (or "emphasized") prominent syllable which sometimes does not coincide with the stressed pattern of a word. ...
doi:10.1515/cllt.2008.012
fatcat:duz5ebcp5jantcuplkqiwkkamm
Morpho-syntactic variations in English and Spanish clipped words: a contrastive study
2018
Zeitschrift für Romanische Philologie
This suggests that clipped or elliptical units are not only necessarily dependent on intrinsic morpho-grammatical and phonological traits, but they are also characterized by regularities and universal patterns ...
The other type of variation concerns syllable stress, and the corresponding use of a 'tilde'. ...
This is particularly useful in marginal speech and slang language: s'up? < what's up?, gotcha < I got you, kinda < kind of, outta < out of (Dalzell/Victor 2013). ...
doi:10.1515/zrp-2018-0051
fatcat:hlwlx54dwre25lzxe5bkkaru6i
Analyzing Ancient Maya Glyph Collections with Contextual Shape Descriptors
2010
International Journal of Computer Vision
Second, we present an objective evaluation of the performance of two state-ofthe-art shape-based contextual descriptors (Shape Context and Generalized Shape Context) in retrieval tasks, using two datasets ...
Our work is guided by realistic needs of archaeologists and scholars who critically need support for search and retrieval tasks in large Maya imagery collections. ...
One of the most representative works in this domain is Lewis et al. (2004) , in which a system for retrieval of paintings and photos of art objects, using content and metadata, was developed by a multidisciplinary ...
doi:10.1007/s11263-010-0387-x
fatcat:okcausjtu5bofpaj3jrymnqhnu
Early Language Experience in a Tseltal Mayan Village
2019
Child Development
Most directed speech came in the mornings, and interactional peaks contained nearly four times the baseline rate of directed speech. ...
Children were infrequently directly spoken to, with most directed speech coming from adults, and no increase with age. ...
Data Selection and Annotation Although the Tseltal corpus contains more than 500 hr of raw photo-linked audio, very little of it is useful without adding manual annotation. ...
doi:10.1111/cdev.13349
pmid:31891183
fatcat:ggove6l55za4lfbruulpax3crm
Error Variability in Apraxia of Speech: A Matter of Controversy
2012
Journal of Speech, Language and Hearing Research
Purpose: Error variability has traditionally been considered a hallmark of apraxia of speech (AOS). ...
However, in some of the current AOS literature, relatively invariable error patterns are claimed as a mandatory criterion for a diagnosis of AOS. ...
Acknowledgments This article originated from a presentation given at the "Apraxia of Speech: Mechanisms and Symptoms" satellite workshop during the 6th International Conference on Speech Motor Control ...
doi:10.1044/1092-4388(2012/11-0319)
pmid:23033448
fatcat:cdw3co4at5bnrovo6fs72lw2re
Searching the past
2011
Proceedings of the 19th ACM international conference on Multimedia - MM '11
The first one is the introduction and analysis of a new dataset of 3400+ Maya hieroglyphs, whose compilation involved manual search, annotation and segmentation by experts. ...
This dataset presents several challenges for visual description and automatic retrieval as it is rich in complex visual details. ...
In [13] , a system to retrieve paintings and photos of art objects using content and metadata is presented. Description and retrieval of Chinese characters has been broadly studied. ...
doi:10.1145/2072298.2072321
dblp:conf/mm/Roman-RangelGOG11
fatcat:2j5yw5mstndwtb2jqt2t5wgo7q
« Previous
Showing results 1 — 15 out of 422 results