全部文献期刊会议图书|学者科研项目
中外文文献  中文文献  外文文献
作者:Wenfa Li , Gongming Wang , Ke Li
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2017, Vol.2017 (1)
摘要:Audio signals are a type of high-dimensional data, and their clustering is critical. However, distance calculation failures, inefficient index trees, and cluster overlaps, derived from the equidistance, redundant attribute, and sparsity, respectively, seriously affect the cluster...
作者:Gökay Dişken , Zekeriya Tüfekci , Ulus Çevik
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2017, Vol.2017 (1)
摘要:Robustness against background noise is a major research area for speech-related applications such as speech recognition and speaker recognition. One of the many solutions for this problem is to detect speech-dominant regions by using a voice activity detector (VAD). In this paper...
作者:Javier Tejedor , Doroteo T. Toledano , Paula Lopez-Otero ...
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2017, Vol.2017 (1)
摘要:Within search-on-speech, Spoken Term Detection (STD) aims to retrieve data from a speech repository given a textual representation of a search term. This paper presents an international open evaluation for search-on-speech based on STD in Spanish and an analysis of the results. T...
作者:Karim Dabbabi , Salah Hajji , Adnen Cherif
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2017, Vol.2017 (1)
摘要:The task of speaker diarization is to answer the question "who spoke when?" In this paper, we present different clustering approaches which consist of Evolutionary Computation Algorithms (ECAs) such as Genetic Algorithm (GA), Particle Swarm Optimization (PSO) algorithm, and Diffe...
作者:Vataya Chunwijitra , Chai Wutiwiwatchai
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2017, Vol.2017 (1)
摘要:Large vocabulary continuous speech recognition (LVCSR) has naturally been demanded for transcribing daily conversations, while developing spoken text data to train LVCSR is costly and time-consuming. In this paper, we propose a classification-based method to automatically se...
作者:Chung-Hsien Wu , Ming-Hsiang Su , Wei-Bin Liang
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2017, Vol.2017 (1)
摘要:With the exponential growth in computing power and progress in speech recognition technology, spoken dialog systems (SDSs) with which a user interacts through natural speech has been widely used in human-computer interaction. However, error-prone automatic speech recognition (ASR...
作者:Youna Ji , Yonghyun Baek , Young-cheol Park
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2017, Vol.2017 (1)
摘要:In speech enhancement, noise power spectral density (PSD) estimation plays a key role in determining appropriate de-nosing gains. In this paper, we propose a robust noise PSD estimator for binaural speech enhancement in time-varying noise environments. First, it is shown that the...
作者:Pablo Gimeno , Ignacio Viñals , Alfonso Ortega ...
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2020, Vol.2020 (4), pp.1-9
摘要:Abstract(#br)This paper presents a new approach based on recurrent neural networks (RNN) to the multiclass audio segmentation task whose goal is to classify an audio signal as speech, music, noise or a combination of these. The proposed system is based on the use of bidirect...
作者:Jing Wang , Jin Wang , Kai Qian ...
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2020, Vol.2020 (3), pp.1618-32
摘要:Abstract(#br)Binaural sound source localization is an important and widely used perceptually based method and it has been applied to machine learning studies by many researchers based on head-related transfer function (HRTF). Because the HRTF is closely related to human physiolog...
作者:Luis M. T. Jesus , Maria Conceição Costa
来源:[J].EURASIP Journal on Audio, Speech, and Music Processing(IF 0.63), 2020, Vol.2020 (2), pp.201-219
摘要:Abstract(#br)Experimental data combining complementary measures based on the oral airflow signal is presented in this paper, exploring the view that European Portuguese voiced stops are produced in a similar fashion to Germanic languages. Four Portuguese speakers were record...

我们正在为您处理中,这可能需要一些时间,请稍等。

资源合作:cnki.scholar@cnki.net, +86-10-82896619   意见反馈:scholar@cnki.net

×