Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Filters








3,283 Hits in 5.7 sec

Using Mel-Frequency Cepstral Coefficients in Missing Data Technique

Zhang Jun, Sam Kwong, Wei Gang, Qingyang Hong
2004 EURASIP Journal on Advances in Signal Processing  
In this paper, we propose a hybrid approach based on the marginalisation and the soft decision techniques that make use of the Mel-frequency cepstral coefficients (MFCCs) instead of filter bank coefficients  ...  Filter bank is the most common feature being employed in the research of the marginalisation approaches for robust speech recognition due to its simplicity in detecting the unreliable data in the frequency  ...  Applying missing data techniques to cepstral features is obviously attractive and natural.  ... 
doi:10.1155/s1110865704309030 fatcat:leuy5kdswjapjaqjrc7tbg5mne

Nondestructive Determination of Maturity of the Monthong Durian by Mel-Frequency Cepstral Coefficients (MFCCs) and Neural Network

Peerapol Khunarsa, Julalug Mahawan, Pisit Nakjai, Nerissa Onkhum
2016 Applied Mechanics and Materials  
After that the knocked-sound was analyzed and generated to Mel-frequency cepstral coefficients (MFCCs) that is used to train data for the classifier.  ...  The ability to select only good quality Durian without cutting or cleaving is useful because buyers will not waste money ordering undesirable Durian.This paper proposes a nondestructive technique to determine  ...  After that the knocked-sound was analyzed and generated to Mel-frequency cepstral coefficients (MFCCs) that is used to train data for the classifier.  ... 
doi:10.4028/www.scientific.net/amm.855.75 fatcat:jlfsjpbuobhrlim4lnkov2nxri

Wavelet Packet based Mel Frequency Cepstral Features for Text Independent Speaker Identification

Smriti Srivastava, JRP Gupta, Saurabh Bhardwaj, Krit Gupta, Abhishek Bhandari, Hitesh Bahl
2017 Figshare  
The present research proposes a paradigm which combines the Wavelet Packet Transform (WPT) with the distinguished Mel Frequency Cepstral Coefficients (MFCC) for extraction of speech feature vectors in  ...  The identification results of the MFCC fea- tures and the Wavelet Packet based Mel Frequency Cepstral (WP-MFC) Features are compared to validate the efficiency of the proposed paradigm.  ...  Mel Frequency Cepstral Coefficients: The advent of Mel Frequency Cepstral Coefficient (MFCC) technique for the task of feature extraction has over shadowed the existence of majority of its predecessor  ... 
doi:10.6084/m9.figshare.5327575.v1 fatcat:ifs575n4anhufhu3p26yusmxuq

Detection of AI-Synthesized Speech Using Cepstral Bispectral Statistics [article]

Arun Kumar Singh
2021 arXiv   pre-print
Also, Cepstral analysis revealed a durable power component in human speech that is missing for a synthesized speech.  ...  Higher-order statistics have less correlation for human speech in comparison to a synthesized speech.  ...  Analysis of Mel Frequency Cepstral Coefficient (MFCC) and visualization using the Mel spectrogram is described. Delta and Delta Square related to Mel Cepstrum is briefed. A.  ... 
arXiv:2009.01934v2 fatcat:ylp4wlyddng3xiefuwibzom3k4

Speech Enhancement Using Pitch Detection Approach For Noisy Environment [article]

Rashmi Makhijani, Urmila Shrawankar, V M Thakare
2013 arXiv   pre-print
Therefore, the proposed system can be used in an application designed for mathematical symbol recognition (especially symbols not available on the keyboard) in schools.  ...  As in some cases speaking may be more convenient than typing, even for rapid typists: many mathematical symbols are missing from the keyboard but can be easily spoken and recognized.  ...  Mel-Frequency Cepstral Coefficients (MFCC) Mel-frequency Cepstral coefficient is one of the most prevalent and popular method used in the field of voice feature extraction.  ... 
arXiv:1305.2352v1 fatcat:6k57i7f43fbylejiq72h2s2nx4

GMM based Language Identification using MFCC and SDC Features

Kshirod Sarmah, Utpal Bhattacharjee
2014 International Journal of Computer Applications  
In this paper, a baseline system for the LID system in multilingual environments has been developed using GMM as a classifier and MFCC combined with Shifted-Delta-Cepstral (SDC) as front end processing  ...  the combining features of MFCC with SDC over the baseline systems that using MFCC and SDC features in individual respectively.  ...  Mel-Frequency Cepstral Coefficients (MFCC) Feature Mel-Frequency Cepstral Coefficients (MFCC) is one of the most popular approaches of feature extraction technique in both speaker recognition and LID system  ... 
doi:10.5120/14840-3103 fatcat:pffq2mfqhrduzj2hfsuld3nlvy

Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition

Michael L. Seltzer, Alex Acero
2007 IEEE Transactions on Audio, Speech, and Language Processing  
One serious difficulty in the deployment of wideband speech recognition systems for new tasks is the expense in both time and cost of obtaining sufficient training data.  ...  In this paper, we propose a novel expectation-maximization (EM) algorithm in which wideband acoustic models are trained using a small amount of wideband speech and a larger amount of narrowband speech.  ...  FEATURE EXTRACTION FOR ASR In this paper, we assume that mel-frequency cepstral coefficients (MFCCs) are the features used for recognition.  ... 
doi:10.1109/tasl.2006.876774 fatcat:qjzkvcdc7bayxgzbpkkzytysqu

Throat to Acoustic Speech Mapping for Spectral Parameter Correction using Artificial Neural Network Approach

Subrata Kumer Paul, Rakhi Rani Paul, Masafumi Nishimura, Md. Ekramul Hamid
2020 Proceedings of International Exchange and Innovation Conference on Engineering & Sciences (IEICES)  
The proposed algorithm is tested using ATR503 Dataset. The simulation results show a noticeable performance in the field of speech communication in adverse environments.  ...  This paper exploits the nonlinear mapping property of Multi-Layered Feed Forward Neural Network (MLFFNN) for estimation of high-frequency components (4-8kHz) from the low-frequency band (0-4kHz) of TM  ...  The Mel Frequency Cepstral Coefficients (MFCCs) feature extraction method is performed on the speech signals to extract the cepstral coefficients are derived [6].  ... 
doi:10.5109/4102497 fatcat:gooawqhoeje7hhgyjf4c7zbg3e

Selective Gammatone Envelope Feature for Robust Sound Event Recognition

Yi Ren LENG, Huy Dat TRAN, Norihide KITAOKA, Haizhou LI
2012 IEICE transactions on information and systems  
Conventional features for Automatic Speech Recognition and Sound Event Recognition such as Mel-Frequency Cepstral Coefficients (MFCCs) have been shown to perform poorly in noisy conditions.  ...  In the experiments with Hidden Markov Model (HMM) recognizers, we shall show that our feature outperforms MFCCs significantly in four different noisy environments at various signal-to-noise ratios.  ...  Gammatone Cepstral Coefficients The gammatone filterbank output replaces the Mel filterbank output used in MFCCs.  ... 
doi:10.1587/transinf.e95.d.1229 fatcat:z5rexvl4djgt5ny2a3qrc4nrqq

合成單元與問題集之定義於隱藏式馬可夫模型中文歌聲合成系統之建立 (Synthesis Unit and Question Set Definition for Mandarin HMM-based Singing Voice Synthesis)

Ju-Yun Cheng, Yi-Chin Huang, Chung-Hsien Wu
2013 Taiwan Conference on Computational Linguistics and Speech Processing  
The spectrum parameter vectors consist of 49th-order STRAIGHT mel-cepstral coefficients including the zero-th coefficient, their delta, and delta-delta coefficients.  ...  In the baseline system, the most frequently used questions of F0 and mel-cepstral clustering trees are sub-syllables types, position of note and phrase level.  ...  Furthermore, we used three methods to refine our system, i.e. question set definition, pitch-shift pseudo data extension and vibrato post-processing.  ... 
dblp:conf/rocling/ChengHW13 fatcat:ggkusq4fbfhg5d7ly5a54zjilu

Optimization of Telephone Handset Distortion in Speaker Recognition by Discriminative Feature Design

Kadiri Kamoru Oluwatoyin
2014 IOSR Journal of Electronics and Communication Engineering  
This technique used mel-cepstral features, log spectrum and prosody based features with a nonlinear artificial neural network in designing speaker recognition features that minimize telephone handset distortion  ...  Effect of handset distortion was done to maximize speaker recognition performance specifically in the setting of telephone handset mismatch between training and testing as results on the 1998 NIST Speaker  ...  The mel-cepstral coefficients were computed by applying a sliding 25 ms window to the speech, resulting in a frame of speech every 10 ms.  ... 
doi:10.9790/2834-09523136 fatcat:4xymka7fzfhjhb5xg7vqz4fv5i

Distinctive features for normal and crackles respiratory sounds using cepstral coefficients

N. H. Mohd Johari, Noreha Abdul Malik, K. A. Sidek
2019 Bulletin of Electrical Engineering and Informatics  
In this paper, Linear Predictive Cepstral Coefficient (LPCC) and Mel-frequency Cepstral Coefficient (MFCC) are used to extract features from normal and crackles respiratory sounds.  ...  This is because in MFCC analysis, mel scale is used to wrap the frequency and it is approximately close to the human auditory perception [7] .  ...  The formula to convert the signal from frequency to Mel-Scale frequency is shown in (3) : ( ) ( ) (3) The formula to convert Mel-Scale frequency to frequency in Hz is shown in (4): ( ) ( ) (4) Discrete  ... 
doi:10.11591/eei.v8i3.1517 fatcat:h6ooads6sfcdfpvrraeic7dm7i

Advances in Missing Feature Techniques for Robust Large-Vocabulary Continuous Speech Recognition

Maarten Van Segbroeck, Hugo Van Hamme
2011 IEEE Transactions on Audio, Speech, and Language Processing  
The proposed system operates on either binary masks where hard decisions are made about the reliability of the data or on fuzzy masks which use a soft decision criterion.  ...  Channel compensation in speech recognition typically involves estimating an additive shift in the log-spectral or cepstral domain.  ...  , spectral peaks [8] and Mel-frequency cepstral coefficients (MFCCs) [9] , which later became the standard feature set in ASR.  ... 
doi:10.1109/tasl.2010.2045235 fatcat:yrhim5bjxneajablpdoy4h3bza

High Security and Capacity of Image Steganography for Hiding Human Speech Based on Spatial and Cepstral Domains

Yazen A. Khaleel
2020 ARO. The Scientific Journal of Koya University  
The suggested technique of image steganography is achieved using both spatial and cepstral domains, where the Mel-frequency cepstral coefficients (MFCCs) are adopted, as very efficient features of the  ...  Second is to improve the data security by hiding the secret data (MFCCs features) anywhere in the host image rather than directly using the least significant bits substitution of the cover image.  ...  cepstral coefficients TABLE II Nbits II Used in Both Direct and MFCCs Techniques MFCCs: Mel-frequency cepstral coefficients, Nbits: Number of bits Fig. 13.  ... 
doi:10.14500/aro.10670 fatcat:yin2i5wiqbax5a6ajsnwq6b6fi

Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems

Md. Jahangir Alam, Patrick Kenny, Douglas O'Shaughnessy
2012 Cognitive Computation  
In this paper we investigate low-variance multi-taper spectrum estimation methods to compute the mel-frequency cepstral coefficient (MFCC) features for robust speech and speaker recognition systems.  ...  In a speaker verification task, compared to the Hamming window technique, the sinusoidal weighted cepstrum estimator (SWCE), Multi-peak, and Thomson multi-taper techniques provide a relative improvement  ...  Currently, the most widely used speech features both in speaker and speech recognition systems are the Mel-frequency cepstral coefficient (MFCC) [1] and perceptual linear predictive (PLP) features [  ... 
doi:10.1007/s12559-012-9197-5 fatcat:6fg6w4dmbbhj7dnk6l2kect5ny
« Previous Showing results 1 — 15 out of 3,283 results