


default search action
IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 27
Volume 27, Number 1, January 2019
- Dilek Hakkani-Tür:
Inaugural Editorial Innovations in an Era of Ubiquitous Audio, Speech, and Language Processing. 5-6 - Feng Bao
, Waleed H. Abdulla
:
A New Ratio Mask Representation for CASA-Based Speech Enhancement. 7-19 - Paul Magron
, Tuomas Virtanen
:
Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation. 20-31 - Thanh Thi Hien Duong
, Ngoc Q. K. Duong
, Cong-Phuong Nguyen, Quoc-Cuong Nguyen
:
Gaussian Modeling-Based Multichannel Audio Source Separation Exploiting Generic Source Spectral Model. 32-43 - Guoqiang Zhang
, Jiancheng Tao, Xiaojun Qiu
, Ian S. Burnett
:
Decentralized Two-Channel Active Noise Control for Single Frequency by Shaping Matrix Eigenvalues. 44-52 - Yan Zhao
, Zhong-Qiu Wang
, DeLiang Wang
:
Two-Stage Deep Learning for Noisy-Reverberant Speech Enhancement. 53-62 - Naijun Zheng
, Xiao-Lei Zhang
:
Phase-Aware Speech Enhancement Based on Deep Neural Networks. 63-76 - Takafumi Moriya
, Tomohiro Tanaka, Takahiro Shinozaki, Shinji Watanabe
, Kevin Duh:
Evolution-Strategy-Based Automation of System Development for High-Performance Speech Recognition. 77-88 - Herman Kamper
, Gregory Shakhnarovich, Karen Livescu
:
Semantic Speech Retrieval With a Visually Grounded Model of Untranscribed Speech. 89-98 - Mathew Shaji Kavalekalam
, Jesper Kjær Nielsen
, Jesper Bünsow Boldt, Mads Græsbøll Christensen
:
Model-Based Speech Enhancement for Intelligibility Improvement in Binaural Hearing Aids. 99-113 - M. V. Achuth Rao
, Prasanta Kumar Ghosh
:
Glottal Inverse Filtering Using Probabilistic Weighted Linear Prediction. 114-124 - Yang Sun
, Wenwu Wang
, Jonathon A. Chambers, Syed Mohsen Naqvi
:
Two-Stage Monaural Source Separation in Reverberant Room Environments Using Deep Neural Networks. 125-139 - Luciana Ferrer
, Mahesh Kumar Nandwana, Mitchell McLaren, Diego Castán, Aaron Lawson:
Toward Fail-Safe Speaker Recognition: Trial-Based Calibration With a Reject Option. 140-153 - Jamal Amini
, Richard C. Hendriks
, Richard Heusdens
, Meng Guo
, Jesper Jensen
:
Asymmetric Coding for Rate-Constrained Noise Reduction in Binaural Hearing Aids. 154-167 - Jianfei Yu
, Jing Jiang, Rui Xia:
Global Inference for Aspect and Opinion Terms Co-Extraction Based on Multi-Task Neural Networks. 168-177 - Zhong-Qiu Wang
, Xueliang Zhang
, DeLiang Wang
:
Robust Speaker Localization Guided by Deep Learning-Based Time-Frequency Masking. 178-188 - Ke Tan
, Jitong Chen
, DeLiang Wang
:
Gated Residual Networks With Dilated Convolutions for Monaural Speech Enhancement. 189-198 - Hoang Gia Ngo
, Minh Nguyen
, Nancy F. Chen
:
Phonology-Augmented Statistical Framework for Machine Transliteration Using Limited Linguistic Resources. 199-211 - Yuma Koizumi
, Shoichiro Saito, Hisashi Uematsu, Yuta Kawachi, Noboru Harada
:
Unsupervised Detection of Anomalous Sound Based on Deep Learning and the Neyman-Pearson Lemma. 212-224 - Yaron Laufer
, Sharon Gannot
:
A Bayesian Hierarchical Model for Speech Enhancement With Time-Varying Audio Channel. 225-239
Volume 27, Number 2, February 2019
- Toru Nakashika
, Shinji Takaki, Junichi Yamagishi
:
Complex-Valued Restricted Boltzmann Machine for Speaker-Dependent Speech Parameterization From Complex Spectra. 244-254 - Feifei Xiong
, Stefan Goetze
, Birger Kollmeier, Bernd T. Meyer
:
Joint Estimation of Reverberation Time and Early-To-Late Reverberation Ratio From Single-Channel Speech Signals. 255-267 - Fabian-Robert Stöter
, Soumitro Chakrabarty
, Bernd Edler, Emanuël A. P. Habets
:
CountNet: Estimating the Number of Concurrent Speakers Using Supervised Learning. 268-282 - Morten Kolbaek
, Zheng-Hua Tan
, Jesper Jensen:
On the Relationship Between Short-Time Objective Intelligibility and Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement. 283-295 - Martin Weiss Hansen
, Jesper Rindom Jensen
, Mads Græsbøll Christensen
:
Estimation of Fundamental Frequencies in Stereophonic Music Mixtures. 296-310 - Junwei Bao
, Duyu Tang, Nan Duan
, Zhao Yan
, Ming Zhou, Tiejun Zhao:
Text Generation From Tables. 311-320 - Andreas I. Koutrouvelis
, Richard C. Hendriks
, Richard Heusdens
, Jesper Jensen
:
A Convex Approximation of the Relaxed Binaural Beamforming Optimization Problem. 321-331 - Tetsuya Hashimoto
, Daisuke Saito, Nobuaki Minematsu:
Many-to-Many and Completely Parallel-Data-Free Voice Conversion Based on Eigenspace DNN. 332-341 - Fatemeh Pishdadian
, Bryan Pardo:
Multi-Resolution Common Fate Transform. 342-354 - Yiming Wu
, Wei Li
:
Automatic Audio Chord Recognition With MIDI-Trained Deep Feature and BLSTM-CRF Sequence Decoding Model. 355-366 - Keisuke Imoto
, Nobutaka Ono
:
Acoustic Topic Model for Scene Analysis With Intermittently Missing Observations. 367-382 - Ke Xiao
, Supin Wang, Mingxi Wan
, Liang Wu
:
Reconstruction of Mandarin Electrolaryngeal Fricatives With Hybrid Noise Source. 383-391 - Lakshmi Krishnan
, Terence Betlehem, Paul D. Teal
:
Fast Algorithms for Acoustic Impulse Response Shaping. 392-403 - Vahid Zakeri
, Antony J. Hodgson:
Automatic Identification of Hard and Soft Bone Tissues by Analyzing Drilling Sounds. 404-414 - Stefan Bilbao
, Brian Hamilton
:
Directional Sources in Wave-Based Acoustic Simulation. 415-428 - Yichi Zhang
, Bryan Pardo, Zhiyao Duan
:
Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation. 429-441 - Fangchen Feng
, Matthieu Kowalski
:
Underdetermined Reverberant Blind Source Separation: Sparse Approaches for Multiplicative and Convolutive Narrowband Approximation. 442-456 - Zhong-Qiu Wang
, DeLiang Wang
:
Combining Spectral and Spatial Features for Deep Learning Based Blind Speaker Separation. 457-468
Volume 27, Number 3, March 2019
- Mohsen Zareian Jahromi
, Adel Zahedi
, Jesper Jensen
, Jan Østergaard
:
Information Loss in the Human Auditory System. 472-481 - Yaakov Buchris
, Alon Amar
, Jacob Benesty
, Israel Cohen
:
Incoherent Synthesis of Sparse Arrays for Frequency-Invariant Beamforming. 482-495 - Yogachandran Rahulamathavan
, Kunaraj R. Sutharsini, Indranil Ghosh Ray
, Rongxing Lu
, Muttukrishnan Rajarajan:
Privacy-Preserving iVector-Based Speaker Verification. 496-506 - Jiajun Zhang
, Yang Zhao, Haoran Li
, Chengqing Zong
:
Attention With Sparsity Regularization for Neural Machine Translation and Summarization. 507-518 - Alastair H. Moore
, Wei Xue
, Patrick A. Naylor
, Mike Brookes
:
Noise Covariance Matrix Estimation for Rotating Microphone Arrays. 519-530 - Guang Yang
, Haibo He
, Qian Chen
:
Emotion-Semantic-Enhanced Neural Network. 531-543 - Thomas Dietzen
, Ann Spriet
, Wouter Tirry, Simon Doclo
, Marc Moonen
, Toon van Waterschoot:
Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction. 544-558 - Jianqing Gao
, Jun Du
, Enhong Chen
:
Mixed-Bandwidth Cross-Channel Speech Recognition via Joint Optimization of DNN-Based Bandwidth Expansion and Acoustic Modeling. 559-571 - Salil Deena
, Madina Hasan, Mortaza Doulaty
, Oscar Saz, Thomas Hain
:
Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition and Alignment. 572-582 - Femke B. Gelderblom
, Tron V. Tronstad
, Erlend Magnus Viggen
:
Subjective Evaluation of a Noise-Reduced Training Target for Deep Neural Network-Based Speech Enhancement. 583-594 - Maria Luis Valero
, Emanuël A. P. Habets
:
Low-Complexity Multi-Microphone Acoustic Echo Control in the Short-Time Fourier Transform Domain. 595-609 - Qiaoxi Zhu
, Philip Coleman
, Xiaojun Qiu
, Ming Wu, Jun Yang
, Ian S. Burnett
:
Robust Personal Audio Geometry Optimization in the SVD-Based Modal Domain. 610-620 - Jiangyan Yi
, Jianhua Tao
, Zhengqi Wen, Ye Bai:
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition. 621-630 - Jing-Xuan Zhang
, Zhen-Hua Ling
, Li-Juan Liu, Yuan Jiang, Li-Rong Dai:
Sequence-to-Sequence Acoustic Modeling for Voice Conversion. 631-644 - Xiaofei Li
, Laurent Girin
, Sharon Gannot
, Radu Horaud
:
Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function. 645-659
Volume 27, Number 4, April 2019
- Ziyue Zhao
, Huijun Liu, Tim Fingscheidt
:
Convolutional Neural Networks to Enhance Coded Speech. 663-678 - Henning F. Schepker
, Sven Erik Nordholm
, Linh Thi Thuc Tran, Simon Doclo
:
Null-Steering Beamformer-Based Feedback Cancellation for Multi-Microphone Hearing Aids With Incoming Signal Preservation. 679-691 - Zengxi Li
, Yan Song
, Li-Rong Dai, Ian McLoughlin
:
Listening and Grouping: An Online Autoregressive Approach for Monaural Speech Separation. 692-703 - Dong Deng
, Liping Jing, Jian Yu, Shaolong Sun
, Michael K. Ng
:
Sentiment Lexicon Construction With Hierarchical Supervision Topic Model. 704-718 - Mantong Zhou
, Minlie Huang
, Xiaoyan Zhu:
Story Ending Selection by Finding Hints From Pairwise Candidate Endings. 719-729 - Jan-Gerrit Richter
, Janina Fels
:
On the Influence of Continuous Subject Rotation During High-Resolution Head-Related Transfer Function Measurements. 730-741 - Jianguo Yu
, Konstantin Markov
, Tomoko Matsui:
Articulatory and Spectrum Information Fusion Based on Deep Recurrent Neural Networks. 742-752 - Fábio P. Itturriet
, Márcio Holsbach Costa
:
Perceptually Relevant Preservation of Interaural Time Differences in Binaural Hearing Aids. 753-764 - Johannes Abel
, Tim Fingscheidt
:
Sinusoidal-Based Lowband Synthesis for Artificial Speech Bandwidth Extension. 765-776 - Qiuqiang Kong
, Yong Xu
, Iwona Sobieraj, Wenwu Wang
, Mark D. Plumbley
:
Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data. 777-787 - Yi-Lin Tuan, Hung-yi Lee
:
Improving Conditional Sequence Generative Adversarial Networks by Stepwise Evaluation. 788-798 - Nikolaos Dionelis
, Mike Brookes
:
Modulation-Domain Kalman Filtering for Monaural Blind Speech Denoising and Dereverberation. 799-814 - Reza Lotfian
, Carlos Busso
:
Curriculum Learning for Speech Emotion Recognition From Crowdsourced Labels. 815-826 - Shoufeng Lin
:
Robust Pitch Estimation and Tracking For Speakers Based on Subband Encoding and The Generalized Labeled Multi-Bernoulli Filter. 827-841 - Xianghui Wang
, Israel Cohen
, Jingdong Chen
, Jacob Benesty
:
On Robust and High Directive Beamforming With Small-Spacing Microphone Arrays for Scattered Sources. 842-852 - Zhe Quan, Zhi-Jie Wang
, Yuquan Le, Bin Yao, Kenli Li, Jian Yin:
An Efficient Framework for Sentence Similarity Modeling. 853-865 - Nurul Lubis
, Sakriani Sakti, Koichiro Yoshino, Satoshi Nakamura
:
Positive Emotion Elicitation in Chat-Based Dialogue Systems. 866-877
Volume 27, Number 5, May 2019
- Francisco Javier Ibarrola
, Ruben Daniel Spies
, Leandro Ezequiel Di Persia
:
Switching Divergences for Spectral Learning in Blind Speech Dereverberation. 881-891 - Israel Cohen
, Jacob Benesty
, Jingdong Chen
:
Differential Kronecker Product Beamforming. 892-902 - Camelia Elisei-Iliescu, Constantin Paleologu
, Jacob Benesty
, Cristian Lucian Stanciu, Cristian Anghel
, Silviu Ciochina:
Recursive Least-Squares Algorithms for the Identification of Low-Rank Systems. 903-918 - Anurendra Kumar
, Tanaya Guha, Prasanta Kumar Ghosh:
Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing. 919-931 - Peter Jancovic
, Münevver Köküer:
Bird Species Recognition Using Unsupervised Modeling of Individual Vocalization Elements. 932-947 - Tomoki Koriyama
, Takao Kobayashi
:
Statistical Parametric Speech Synthesis Using Deep Gaussian Processes. 948-959 - Kazuki Shimada
, Yoshiaki Bando
, Masato Mimura, Katsutoshi Itoyama
, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition. 960-971 - Simon Widmark
:
Causal MSE-Optimal Filters for Personal Audio Subject to Constrained Contrast. 972-987
Volume 27, Number 6, June 2019
- Annamaria Mesaros
, Aleksandr Diment, Benjamin Elizalde
, Toni Heittola
, Emmanuel Vincent
, Bhiksha Raj, Tuomas Virtanen
:
Sound Event Detection in the DCASE 2017 Challenge. 992-1006 - Srikanth Raj Chetupalli
, Thippur V. Sreenivas
:
Late Reverberation Cancellation Using Bayesian Estimation of Multi-Channel Linear Predictors and Student's t-Source Prior. 1007-1018 - Lauri Juvela
, Bajibabu Bollepalli
, Vassilis Tsiaras, Paavo Alku
:
GlotNet - A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis. 1019-1030 - Fiete Winter
, Frank Schultz
, Gergely Firtha
, Sascha Spors
:
A Geometric Model for Prediction of Spatial Aliasing in 2.5D Sound Field Synthesis. 1031-1046 - Yuanyuan Liu
, Tan Lee
, Thomas K. T. Law, Kathy Yuet-Sheung Lee
:
Acoustical Assessment of Voice Disorder With Continuous Speech Using ASR Posterior Features. 1047-1059 - Christoph Pörschmann
, Johannes M. Arend
, Fabian Brinkmann
:
Directional Equalization of Sparse Head-Related Transfer Function Sets for Spatial Upsampling. 1060-1071 - Shreyas Srikanth Payal
, V. John Mathews
, Douglas J. Button, Ajay Iyer, Russell H. Lambert, Jeffrey Hutchings, Luis Antonio Azpicueta-Ruiz
:
Equalization of Nonlinear Propagation Distortion in Cylindrical Waveguides. 1072-1084 - Berrak Sisman
, Mingyang Zhang, Haizhou Li
:
Group Sparse Representation With WaveNet Vocoder Adaptation for Spectrum and Prosody Conversion. 1085-1097 - Jinkyu Lee
, Hong-Goo Kang
:
A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems. 1098-1109
Volume 27, Number 7, July 2019
- Jan-Hendrik Flesner
, Thomas Biberger
, Stephan Dieter Ewert
:
Subjective and Objective Assessment of Monaural and Binaural Aspects of Audio Quality. 1112-1125 - Bolaji Yusuf
, Batuhan Gündogdu
, Murat Saraclar
:
Low Resource Keyword Search With Synthesized Crosslingual Exemplars. 1126-1135 - Andreas I. Koutrouvelis
, Richard C. Hendriks
, Richard Heusdens
, Jesper Jensen
:
Robust Joint Estimation of Multimicrophone Signal Model Parameters. 1136-1150 - Benjamin Cauchi
, Kai Siedenburg
, João Felipe Santos
, Tiago H. Falk
, Simon Doclo
, Stefan Goetze
:
Non-Intrusive Speech Quality Prediction Using Modulation Energies and LSTM-Network. 1151-1163 - Yike Zhang
, Pengyuan Zhang
, Yonghong Yan
:
Tailoring an Interpretable Neural Language Model. 1164-1178 - Ashutosh Pandey
, DeLiang Wang
:
A New Framework for CNN-Based Speech Enhancement in the Time Domain. 1179-1188 - Vikram C. M.
, Nagaraj Adiga
, S. R. Mahadeva Prasanna:
Detection of Nasalized Voiced Stops in Cleft Palate Speech Using Epoch-Synchronous Features. 1189-1200 - Huaishao Luo
, Tianrui Li
, Bing Liu, Bin Wang, Herwig Unger:
Improving Aspect Term Extraction With Bidirectional Dependency Tree Representation. 1201-1212
Volume 27, Number 8, August 2019
- Teng Zhang
, Ji Wu
:
Constrained Learned Feature Extraction for Acoustic Scene Classification. 1216-1228 - Leonardo Gabrielli
, Stefano Tomassetti, Stefano Squartini, Carlo Zinato, Stefano Guaiana:
A Multi-Stage Algorithm for Acoustic Physical Model Parameters Estimation. 1229-1240 - Bing Yang
, Hong Liu
, Cheng Pang
, Xiaofei Li
:
Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering. 1241-1255 - Yi Luo
, Nima Mesgarani
:
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation. 1256-1266 - Achintya Kumar Sarkar
, Zheng-Hua Tan
, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. 1267-1279 - Jiawen Chua
, W. Bastiaan Kleijn
:
A Low Latency Approach for Blind Source Separation. 1280-1294 - Chao Pan
, Jingdong Chen
, Jacob Benesty
, Guangming Shi
:
On the Design of Target Beampatterns for Differential Microphone Arrays. 1295-1307 - Aqil M. Azmi
, Manal N. Almutery, Hatim A. Aboalsamh
:
Real-Word Errors in Arabic Texts: A Better Algorithm for Detection and Correction. 1308-1320 - Mandy Korpusik
, James R. Glass:
Deep Learning for Database Mapping and Asking Clarification Questions in Dialogue Systems. 1321-1334 - Junhyeong Pak
, Jong Won Shin
:
Sound Localization Based on Phase Difference Enhancement Using Deep Neural Networks. 1335-1345
Volume 27, Number 9, September 2019
- Randall Ali
, Giuliano Bernardi
, Toon van Waterschoot
, Marc Moonen
:
Methods of Extending a Generalized Sidelobe Canceller With External Microphones. 1349-1364 - Xiaofei Li
, Laurent Girin
, Sharon Gannot
, Radu Horaud
:
Multichannel Online Dereverberation Based on Spectral Magnitude Inverse Filtering. 1365-1377 - Lu Chen
, Zhi Chen
, Bowen Tan, Sishan Long, Milica Gasic
, Kai Yu
:
AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning. 1378-1391 - Luoqin Li
, Jiabing Wang
, Jichang Li, Qianli Ma
, Jia Wei:
Relation Classification via Keyword-Attentive Sentence Mechanism and Synthetic Stimulation Loss. 1392-1404 - Martin Bo Møller
, Jesper Kjær Nielsen
, Efren Fernandez-Grande
, Søren Krarup Olesen:
On the Influence of Transfer Function Noise on Sound Zone Control in a Room. 1405-1418 - Zhen Xu
, Chengjie Sun
, Yinong Long, Bingquan Liu
, Baoxun Wang
, Mingjiang Wang
, Min Zhang, Xiaolong Wang:
Dynamic Working Memory for Context-Aware Response Generation. 1419-1431 - Hirokazu Kameoka
, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
ACVAE-VC: Non-Parallel Voice Conversion With Auxiliary Classifier Variational Autoencoder. 1432-1443 - Xie Chen
, Xunying Liu
, Yu Wang
, Anton Ragni, Jeremy Heng Meng Wong
, Mark J. F. Gales:
Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition. 1444-1454 - Rui Wang
, Zhe Chen
, Fuliang Yin
:
DOA-Based Three-Dimensional Node Geometry Calibration in Acoustic Sensor Networks and Its Cramér-Rao Bound and Sensitivity Analysis. 1455-1468 - Chia-Hsuan Lee
, Hung-yi Lee
, Szu-Lin Wu, Chi-Liang Liu, Wei Fang
, Juei-Yang Hsu, Bo-Hsiang Tseng:
Machine Comprehension of Spoken Content: TOEFL Listening Test and Spoken SQuAD. 1469-1480 - Yi-Chen Chen, Sung-Feng Huang
, Hung-yi Lee
, Yu-Hsuan Wang, Chia-Hao Shen
:
Audio Word2vec: Sequence-to-Sequence Autoencoding for Unsupervised Learning of Audio Segmentation and Representation. 1481-1493
Volume 27, Number 10, October 2019
- Pairui Li
, Chuan Chen
, Wujie Zheng, Yuetang Deng, Fanghua Ye
, Zibin Zheng
:
STD: An Automatic Evaluation Metric for Machine Translation Based on Word Embeddings. 1497-1506 - Jie Zhang
, Richard Heusdens
, Richard Christian Hendriks
:
Relative Acoustic Transfer Function Estimation in Wireless Acoustic Sensor Networks. 1507-1519 - Jihwan Park
, Joon-Hyuk Chang
:
State-Space Microphone Array Nonlinear Acoustic Echo Cancellation Using Multi-Microphone Near-End Speech Covariance. 1520-1534 - Zhaojie Luo
, Jinhui Chen
, Tetsuya Takiguchi
, Yasuo Ariki
:
Emotional Voice Conversion Using Dual Supervised Adversarial Networks With Continuous Wavelet Transform F0 Features. 1535-1548 - Hala As'ad
, Martin Bouchard
, Homayoun Kamkar-Parsi
:
A Robust Target Linearly Constrained Minimum Variance Beamformer With Spatial Cues Preservation for Binaural Hearing Aids. 1549-1563 - Yijun Wang
, Yingce Xia, Li Zhao, Jiang Bian, Tao Qin
, Enhong Chen
, Tie-Yan Liu:
Semi-Supervised Neural Machine Translation via Marginal Distribution Estimation. 1564-1576 - Arindam Jati
, Panayiotis G. Georgiou
:
Neural Predictive Coding Using Convolutional Neural Networks Toward Unsupervised Learning of Speaker Characteristics. 1577-1589 - Federico Fontana
, Enrico Bozzo
:
Newton-Raphson Solution of Nonlinear Delay-Free Loop Filter Networks. 1590-1600 - Naoki Makishima
, Shinichi Mogami
, Norihiro Takamune, Daichi Kitamura
, Hayato Sumino, Shinnosuke Takamichi
, Hiroshi Saruwatari
, Nobutaka Ono
:
Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation. 1601-1615 - Jeena J. Prakash
, Hema A. Murthy:
Analysis of Inter-Pausal Units in Indian Languages and Its Application to Text-to-Speech Synthesis. 1616-1628 - Yunshi Lan
, Shuohang Wang, Jing Jiang:
Knowledge Base Question Answering With a Matching-Aggregation Model and Question-Specific Contextual Relations. 1629-1638 - Xuefeng Bai
, Hailong Cao
, Kehai Chen
, Tiejun Zhao:
A Bilingual Adversarial Autoencoder for Unsupervised Bilingual Lexicon Induction. 1639-1648 - Guanlong Zhao
, Ricardo Gutierrez-Osuna:
Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion. 1649-1660
Volume 27, Number 11, November 2019
- Zhuosheng Zhang
, Hai Zhao
, Kangwei Ling, Jiangtong Li
, Zuchao Li
, Shexia He, Guohong Fu:
Effective Subword Segmentation for Text Comprehension. 1664-1674 - Yue Xie
, Ruiyu Liang
, Zhenlin Liang, Chengwei Huang
, Cairong Zou, Björn W. Schuller
:
Speech Emotion Classification Using Attention-Based LSTM. 1675-1685 - Shuai Wang
, Zili Huang, Yanmin Qian
, Kai Yu
:
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. 1686-1696 - Rui Lu
, Zhiyao Duan
, Changshui Zhang
:
Audio-Visual Deep Clustering for Speech Separation. 1697-1712 - Tetiana Parshakova
, François Rameau
, Andriy Serdega
, In So Kweon, Dae-Shik Kim
:
Latent Question Interpretation Through Variational Adaptation. 1713-1724 - Jeremy Heng Meng Wong
, Mark John Francis Gales
, Yu Wang
:
General Sequence Teacher-Student Learning. 1725-1736 - Liming Shi
, Jesper Kjær Nielsen
, Jesper Rindom Jensen
, Max A. Little
, Mads Græsbøll Christensen
:
Robust Bayesian Pitch Tracking Based on the Harmonic Model. 1737-1751 - Yan Yang, Changchun Bao
:
RS-CAE-Based AR-Wiener Filtering and Harmonic Recovery for Speech Enhancement. 1752-1762 - Alberto Bernardini
, Paolo Maffezzoni
, Augusto Sarti
:
Linear Multistep Discretization Methods With Variable Step-Size in Nonlinear Wave Digital Structures for Virtual Analog Modeling. 1763-1776 - Dong Deng
, Liping Jing
, Jian Yu, Shaolong Sun
:
Sparse Self-Attention LSTM for Sentiment Lexicon Construction. 1777-1790 - Qiuqiang Kong
, Changsong Yu, Yong Xu, Turab Iqbal
, Wenwu Wang
, Mark D. Plumbley
:
Weakly Labelled AudioSet Tagging With Attention Neural Networks. 1791-1802 - Samy Elshamy
, Tim Fingscheidt
:
DNN-Based Cepstral Excitation Manipulation for Speech Enhancement. 1803-1814 - Nooshin Maghsoodi
, Hossein Sameti, Hossein Zeinali
, Themos Stafylakis:
Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors. 1815-1825 - Sining Sun
, Pengcheng Guo, Lei Xie
, Mei-Yuh Hwang:
Adversarial Regularization for Attention Based End-to-End Robust Speech Recognition. 1826-1838 - Masood Delfarah
, DeLiang Wang
:
Deep Learning for Talker-Dependent Reverberant Speaker Separation: An Empirical Study. 1839-1848
Volume 27, Number 12, December 2019
- Natsuki Ueno
, Shoichi Koyama
, Hiroshi Saruwatari
:
Three-Dimensional Sound Field Reproduction Based on Weighted Mode-Matching Method. 1852-1867 - Lijun Wu
, Xu Tan
, Tao Qin
, Jianhuang Lai
, Tie-Yan Liu:
Beyond Error Propagation: Language Branching Also Affects the Accuracy of Sequence Generation. 1868-1879 - Amit Das
, Jinyu Li
, Guoli Ye, Rui Zhao, Yifan Gong:
Advancing Acoustic-to-Word CTC Model With Attention and Mixed-Units. 1880-1892 - Niccolò Antonello
, Enzo De Sena
, Marc Moonen
, Patrick A. Naylor
, Toon van Waterschoot
:
Joint Acoustic Localization and Dereverberation Through Plane Wave Decomposition and Sparse Regularization. 1893-1905 - Federico Borra
, Alberto Bernardini
, Fabio Antonacci, Augusto Sarti
:
Uniform Linear Arrays of First-Order Steerable Differential Microphones. 1906-1918 - Li Chai
, Jun Du
, Qing-Feng Liu, Chin-Hui Lee
:
Using Generalized Gaussian Distributions to Improve Regression Error Modeling for Deep Learning-Based Speech Enhancement. 1919-1931 - Jun Qi
, Jun Du
, Sabato Marco Siniscalchi
, Chin-Hui Lee
:
A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement. 1932-1943 - Xudong Dang
, Qi Cheng
, Hongyan Zhu
:
Indoor Multiple Sound Source Localization via Multi-Dimensional Assignment Data Association. 1944-1956 - Martin Schneider
, Emanuël A. P. Habets
:
Iterative DFT-Domain Inverse Filter Optimization Using a Weighted Least-Squares Criterion. 1957-1969 - Kehai Chen
, Rui Wang
, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao:
Neural Machine Translation With Sentence-Level Topic Context. 1970-1984 - Alejandro Gómez Alanís
, Antonio M. Peinado
, José A. González
, Angel M. Gomez
:
A Gated Recurrent Convolutional Neural Network for Robust Spoofing Detection. 1985-1999 - Siyuan Feng
, Tan Lee
:
Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling. 2000-2011 - Wei Li
, Nancy F. Chen
, Sabato Marco Siniscalchi
, Chin-Hui Lee:
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models. 2012-2024 - Quansheng Tu, Huawei Chen
:
On Mainlobe Orientation of the First- and Second-Order Differential Microphone Arrays. 2025-2040 - Jan Chorowski
, Ron J. Weiss
, Samy Bengio, Aäron van den Oord:
Unsupervised Speech Representation Learning Using WaveNet Autoencoders. 2041-2053 - Vishnuvardhan Varanasi
, Ayushya Agarwal, Rajesh M. Hegde
:
Near-Field Acoustic Source Localization Using Spherical Harmonic Features. 2054-2066 - Yibin Zheng
, Jianhua Tao
, Zhengqi Wen, Jiangyan Yi
:
Forward-Backward Decoding Sequence for Regularizing End-to-End TTS. 2067-2079 - Yanhui Tu
, Jun Du
, Chin-Hui Lee
:
Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition. 2080-2091 - Yuzhou Liu
, DeLiang Wang
:
Divide and Conquer: A Deep CASA Approach to Talker-Independent Monaural Speaker Separation. 2092-2102 - Xuebo Liu
, Derek F. Wong
, Lidia S. Chao, Yang Liu
:
Latent Attribute Based Hierarchical Decoder for Neural Machine Translation. 2103-2112 - Jingyi Hu, Ning Chen
:
Enhanced Feature Summarizing for Effective Cover Song Identification. 2113-2126 - Qianli Ma
, Liuhong Yu, Shuai Tian, Enhuan Chen, Wing W. Y. Ng
:
Global-Local Mutual Attention Model for Text Classification. 2127-2139 - Vesa Välimäki
, Jussi Rämö
:
Neurally Controlled Graphic Equalizer. 2140-2149 - Sean U. N. Wood
, Johannes Stahl
, Pejman Mowlaee
:
Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability. 2150-2161 - Lukas Pfeifenberger
, Matthias Zöhrer
, Franz Pernkopf
:
Eigenvector-Based Speech Mask Estimation for Multi-Channel Speech Enhancement. 2162-2172 - Marc Arnela
, Saeed Dabbaghchian
, Oriol Guasch
, Olov Engwall
:
MRI-Based Vocal Tract Representations for the Three-Dimensional Finite Element Synthesis of Diphthongs. 2173-2182 - Varun Srivastava
, Mayank Mishra
:
Adversarial Approximate Inference for Speech to Electroglottograph Conversion. 2183-2196 - Kouhei Sekiguchi
, Yoshiaki Bando
, Aditya Arie Nugraha
, Kazuyoshi Yoshii
, Tatsuya Kawahara
:
Semi-Supervised Multichannel Speech Enhancement With a Deep Speech Prior. 2197-2212 - Qipeng Guo
, Xipeng Qiu
, Xiangyang Xue
, Zheng Zhang:
Low-Rank and Locality Constrained Self-Attention for Sequence Modeling. 2213-2222 - Jun Yu
, Qiang Ling
, Changwei Luo, Chang Wen Chen
:
Synthesizing 3D Trump: Predicting and Visualizing the Relationship Between Text, Speech, and Articulatory Movements. 2223-2233 - Ryosuke Sugiura
, Yutaka Kamamoto
, Takehiro Moriya:
Shape Control of Discrete Generalized Gaussian Distributions for Frequency-Domain Audio Coding. 2234-2248 - Zamir Ben-Hur
, David Lou Alon, Ravish Mehra, Boaz Rafaely
:
Efficient Representation and Sparse Sampling of Head-Related Transfer Functions Using Phase-Correction Based on Ear Alignment. 2249-2262 - Luca Remaggi
, Philip J. B. Jackson
, Wenwu Wang
:
Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source Separation. 2263-2277 - Biao Zhang
, Deyi Xiong
, Jinsong Su
, Jiebo Luo
:
Future-Aware Knowledge Distillation for Neural Machine Translation. 2278-2287 - Randall Ali
, Toon van Waterschoot, Marc Moonen
:
Integration of a Priori and Estimated Constraints Into an MVDR Beamformer for Speech Enhancement. 2288-2300 - Nitya Tiwari, Prem C. Pandey
:
Speech Enhancement Using Noise Estimation With Dynamic Quantile Tracking. 2301-2312 - Junwen Duan
, Xiao Ding
, Yue Zhang, Ting Liu:
TEND: A Target-Dependent Representation Learning Framework for News Document. 2313-2325 - Lujun Zhao
, Xipeng Qiu
, Qi Zhang
, Xuanjing Huang
:
Sequence Labeling With Deep Gated Dual Path CNN. 2326-2335 - Akihiro Kato
, Tomi H. Kinnunen
:
Statistical Regression Models for Noise Robust F0 Estimation Using Recurrent Deep Neural Networks. 2336-2349 - Dayiheng Liu
, Jie Fu
, Qian Qu, Jiancheng Lv
:
BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation. 2350-2361 - Andrés Marafioti
, Nathanaël Perraudin
, Nicki Holighaus
, Piotr Majdak
:
A Context Encoder For Audio Inpainting. 2362-2372 - Jichen Yang
, Rohan Kumar Das
, Nina Zhou
:
Extraction of Octave Spectra Information for Spoofing Attack Detection. 2373-2384 - Oren Barkan
, David Tsiris
, Ori Katz, Noam Koenigstein
:
InverSynth: Deep Estimation of Synthesizer Parameter Configurations From Audio Signals. 2385-2396

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.