


default search action
Yi-Chiao Wu
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j17]Wen-Chin Huang
, Yi-Chiao Wu
, Tomoki Toda
:
Multi-Speaker Text-to-Speech Training With Speaker Anonymized Data. IEEE Signal Process. Lett. 31: 2995-2999 (2024) - [j16]Li-Wen Chiu
, Yang-Ren Chou
, Yi-Chiao Wu, Meng-Liang Chung
, Bing-Fei Wu
, Kun-Ta Chou
:
Video-Based Contactless Detection of Sleep Apnea With Deep-Learning Model. IEEE Trans. Instrum. Meas. 73: 1-13 (2024) - [j15]Bing-Fei Wu
, Li-Wen Chiu
, Yi-Chiao Wu
, Chun-Chih Lai, Hao-Min Cheng, Pao-Hsien Chu
:
Contactless Blood Pressure Measurement Via Remote Photoplethysmography With Synthetic Data Generation Using Generative Adversarial Networks. IEEE J. Biomed. Health Informatics 28(2): 621-632 (2024) - [j14]Yi-Chiao Wu
, Chun-Hsien Lin
, Li-Wen Chiu
, Bing-Fei Wu
, Meng-Liang Chung
, Sung-Chun Tang
, Yu Sun
:
Contact-Free Atrial Fibrillation Screening With Attention Network. IEEE J. Biomed. Health Informatics 28(9): 5124-5135 (2024) - [c39]Yi-Chiao Wu, Dejan Markovic, Steven Krenn, Israel D. Gebru, Alexander Richard:
ScoreDec: A Phase-Preserving High-Fidelity Audio Codec with a Generalized Score-Based Diffusion Post-Filter. ICASSP 2024: 361-365 - [i35]Wen-Chin Huang, Yi-Chiao Wu, Tomoki Toda:
Multi-speaker Text-to-speech Training with Speaker Anonymized Data. CoRR abs/2405.11767 (2024) - [i34]Julius Richter, Yi-Chiao Wu, Steven Krenn, Simon Welker, Bunlong Lay, Shinji Watanabe, Alexander Richard, Timo Gerkmann:
EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation. CoRR abs/2406.06185 (2024) - [i33]Wenze Ren, Yi-Cheng Lin, Huang-Cheng Chou, Haibin Wu, Yi-Chiao Wu, Chi-Chun Lee, Hung-yi Lee, Yu Tsao:
EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations. CoRR abs/2407.15458 (2024) - [i32]Haibin Wu, Xuanjun Chen, Yi-Cheng Lin, Kai-Wei Chang, Jiawei Du, Ke-Han Lu, Alexander H. Liu, Ho-Lam Chung, Yuan-Kuei Wu, Dongchao Yang, Songxiang Liu, Yi-Chiao Wu, Xu Tan, James R. Glass, Shinji Watanabe, Hung-yi Lee:
Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models. CoRR abs/2409.14085 (2024) - [i31]Adam Polyak, Amit Zohar, Andrew Brown, Andros Tjandra, Animesh Sinha, Ann Lee, Apoorv Vyas, Bowen Shi, Chih-Yao Ma, Ching-Yao Chuang, David Yan, Dhruv Choudhary, Dingkang Wang, Geet Sethi, Guan Pang, Haoyu Ma, Ishan Misra, Ji Hou, Jialiang Wang, Kiran Jagadeesh, Kunpeng Li, Luxin Zhang, Mannat Singh, Mary Williamson, Matt Le, Matthew Yu, Mitesh Kumar Singh, Peizhao Zhang, Peter Vajda, Quentin Duval, Rohit Girdhar, Roshan Sumbaly, Sai Saketh Rambhatla, Sam S. Tsai, Samaneh Azadi, Samyak Datta, Sanyuan Chen, Sean Bell, Sharadh Ramaswamy, Shelly Sheynin, Siddharth Bhattacharya, Simran Motwani, Tao Xu, Tianhe Li, Tingbo Hou, Wei-Ning Hsu, Xi Yin, Xiaoliang Dai, Yaniv Taigman, Yaqiao Luo, Yen-Cheng Liu, Yi-Chiao Wu, Yue Zhao, Yuval Kirstain, Zecheng He, Zijian He, Albert Pumarola, Ali K. Thabet, Artsiom Sanakoyeu, Arun Mallya, Baishan Guo, Boris Araya, Breena Kerr, Carleigh Wood, Ce Liu, Cen Peng, Dmitry Vengertsev, Edgar Schönfeld, Elliot Blanchard, Felix Juefei-Xu, Fraylie Nord, Jeff Liang, John Hoffman, Jonas Kohler, Kaolin Fire, Karthik Sivakumar, Lawrence Chen, Licheng Yu, Luya Gao, Markos Georgopoulos, Rashel Moritz, Sara K. Sampson, Shikai Li, Simone Parmeggiani, Steve Fine, Tara Fowler, Vladan Petrovic, Yuming Du:
Movie Gen: A Cast of Media Foundation Models. CoRR abs/2410.13720 (2024) - 2023
- [j13]Yi-Chiao Wu
, Li-Wen Chiu
, Chun-Chih Lai, Bing-Fei Wu
, Sunny S. J. Lin
:
Recognizing, Fast and Slow: Complex Emotion Recognition With Facial Expression Detection and Remote Physiological Measurement. IEEE Trans. Affect. Comput. 14(4): 3177-3190 (2023) - [j12]Reo Yoneyama
, Yi-Chiao Wu
, Tomoki Toda
:
High-Fidelity and Pitch-Controllable Neural Vocoder Based on Unified Source-Filter Networks. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3717-3729 (2023) - [j11]Li-Wen Chiu
, Yang-Ren Chou
, Yi-Chiao Wu
, Bing-Fei Wu
:
Deep-Learning-Based Remote Photoplethysmography Measurement in Driving Scenarios With Color and Near-Infrared Images. IEEE Trans. Instrum. Meas. 72: 1-12 (2023) - [j10]Yi-Chiao Wu
, Li-Wen Chiu
, Bing-Fei Wu
, Linda Li-Chuan Lin
, Tsai-Hsuan Ho
, Meng-Liang Chung
, Shou-Fang Wu
:
Motion Robust Remote Photoplethysmography Measurement During Exercise for Contactless Physical Activity Intensity Detection. IEEE Trans. Instrum. Meas. 72: 1-14 (2023) - [c38]Yi-Chiao Wu, Israel D. Gebru, Dejan Markovic, Alexander Richard:
Audiodec: An Open-Source Streaming High-Fidelity Neural Audio Codec. ICASSP 2023: 1-5 - [c37]Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda:
Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder. ICASSP 2023: 1-5 - [i30]Apoorv Vyas, Bowen Shi, Matthew Le, Andros Tjandra, Yi-Chiao Wu, Baishan Guo, Jiemin Zhang, Xinyue Zhang, Robert Adkins, William Ngan, Jeff Wang, Ivan Cruz, Bapi Akula, Akinniyi Akinyemi, Brian Ellis, Rashel Moritz, Yael Yungster, Alice Rakotoarison, Liang Tan, Chris Summers, Carleigh Wood, Joshua Lane, Mary Williamson, Wei-Ning Hsu:
Audiobox: Unified Audio Generation with Natural Language Prompts. CoRR abs/2312.15821 (2023) - 2022
- [j9]Bing-Fei Wu
, Yi-Chiao Wu
, Li-Wen Chiu
, Hsuan-Po Liu:
Soft Label With Channel Encoding for Dependent Facial Image Classification. IEEE Access 10: 10661-10672 (2022) - [j8]Bing-Fei Wu
, Yi-Chiao Wu
, Yi-Wei Chou
:
A Compensation Network With Error Mapping for Robust Remote Photoplethysmography in Noise-Heavy Conditions. IEEE Trans. Instrum. Meas. 71: 1-11 (2022) - [c36]Bing-Fei Wu, Li-Wen Chiu, Yi-Chiao Wu, Chun-Chih Lai, Pao-Hsien Chu:
Contactless Blood Pressure Measurement via Remote Photoplethysmography with Synthetic Data Generation Using Generative Adversarial Network. CVPR Workshops 2022: 2129-2137 - [c35]Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing
, Wen-Chin Huang, Tomoki Toda:
Direct Noisy Speech Modeling for Noisy-To-Noisy Voice Conversion. ICASSP 2022: 6787-6791 - [c34]Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda:
Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation. INTERSPEECH 2022: 848-852 - [i29]Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda:
Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation. CoRR abs/2205.06053 (2022) - [i28]Yi-Chiao Wu, Patrick Lumban Tobing, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda:
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System. CoRR abs/2207.05913 (2022) - [i27]Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda:
Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder. CoRR abs/2210.15533 (2022) - 2021
- [b1]Yi-Chiao Wu:
Incorporating Prior Knowledge on Speech Production Mechanism into Neural Speech Waveform Generation. Nagoya University, Japan, 2021 - [j7]Wen-Chin Huang
, Tomoki Hayashi
, Yi-Chiao Wu
, Hirokazu Kameoka
, Tomoki Toda:
Pretraining Techniques for Sequence-to-Sequence Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 29: 745-755 (2021) - [j6]Yi-Chiao Wu
, Tomoki Hayashi
, Takuma Okamoto, Hisashi Kawai, Tomoki Toda
:
Quasi-Periodic Parallel WaveGAN: A Non-Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network. IEEE ACM Trans. Audio Speech Lang. Process. 29: 792-806 (2021) - [j5]Yi-Chiao Wu
, Tomoki Hayashi
, Patrick Lumban Tobing
, Kazuhiro Kobayashi
, Tomoki Toda
:
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1134-1148 (2021) - [c33]Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda:
Noisy-to-Noisy Voice Conversion Framework with Denoising Model. APSIPA ASC 2021: 814-820 - [c32]Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao:
HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network. ASRU 2021: 907-913 - [c31]Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing
, Tomoki Hayashi, Tomoki Toda:
Crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder. ICASSP 2021: 5934-5938 - [c30]Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi:
Any-to-One Sequence-to-Sequence Voice Conversion Using Self-Supervised Discrete Speech Representations. ICASSP 2021: 5944-5948 - [c29]Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda:
Unified Source-Filter GAN: Unified Source-Filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN. Interspeech 2021: 2187-2191 - [c28]Yi-Chiao Wu, Cheng-Hung Hu, Hung-Shin Lee, Yu-Huai Peng, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang
, Tomoki Toda:
Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder. Interspeech 2021: 3630-3634 - [i26]Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda:
crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder. CoRR abs/2103.02858 (2021) - [i25]Cheng-Hung Hu, Yi-Chiao Wu, Wen-Chin Huang, Yu-Huai Peng, Yu-Wen Chen, Pin-Jui Ku, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
The AS-NU System for the M2VoC Challenge. CoRR abs/2104.03009 (2021) - [i24]Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda:
Unified Source-Filter GAN: Unified Source-filter Network Based On Factorization of Quasi-Periodic Parallel WaveGAN. CoRR abs/2104.04668 (2021) - [i23]Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda:
Noisy-to-Noisy Voice Conversion Framework with Denoising Model. CoRR abs/2109.10608 (2021) - [i22]Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao:
HASA-net: A non-intrusive hearing-aid speech assessment network. CoRR abs/2111.05691 (2021) - [i21]Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda:
Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion. CoRR abs/2111.07116 (2021) - 2020
- [j4]Yi-Chiao Wu
, Patrick Lumban Tobing
, Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda
:
Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression. IEEE Access 8: 62094-62106 (2020) - [c27]Wen-Chin Huang, Patrick Lumban Tobing, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda:
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders. Blizzard Challenge / Voice Conversion Challenge 2020 - [c26]Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Toda:
Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN. Blizzard Challenge / Voice Conversion Challenge 2020 - [c25]Patrick Lumban Tobing
, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
:
Efficient Shallow Wavenet Vocoder Using Multiple Samples Output Based on Laplacian Distribution and Linear Prediction. ICASSP 2020: 7204-7208 - [c24]Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda
:
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-Autoregressive Pitch-Dependent Dilated Convolution Model for Parametric Speech Generation. INTERSPEECH 2020: 3535-3539 - [c23]Yi-Chiao Wu, Patrick Lumban Tobing
, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda
:
A Cyclical Post-Filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-Speech Systems. INTERSPEECH 2020: 3540-3544 - [c22]Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda
:
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining. INTERSPEECH 2020: 4676-4680 - [c21]Patrick Lumban Tobing
, Tomoki Hayashi, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda
:
Cyclic Spectral Modeling for Unsupervised Unit Discovery into Voice Conversion with Excitation and Waveform Modeling. INTERSPEECH 2020: 4861-4865 - [c20]Bing-Fei Wu, Yi-Chiao Wu:
Masked Neural Sparse Encoder for Face Occlusion Detection. SMC 2020: 2471-2476 - [i20]Yi-Chiao Wu, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Hayashi, Tomoki Toda:
Non-parallel Voice Conversion System with WaveNet Vocoder and Collapsed Speech Suppression. CoRR abs/2003.11750 (2020) - [i19]Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda:
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation. CoRR abs/2005.08654 (2020) - [i18]Yi-Chiao Wu, Patrick Lumban Tobing, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda:
A Cyclical Post-filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-speech Systems. CoRR abs/2005.08659 (2020) - [i17]Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda:
Quasi-Periodic WaveNet: An Autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network. CoRR abs/2007.05663 (2020) - [i16]Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto, Hisashi Kawai, Tomoki Toda:
Quasi-Periodic Parallel WaveGAN: A Non-autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network. CoRR abs/2007.12955 (2020) - [i15]Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda:
Pretraining Techniques for Sequence-to-Sequence Voice Conversion. CoRR abs/2008.03088 (2020) - [i14]Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Toda:
Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN. CoRR abs/2010.04429 (2020) - [i13]Wen-Chin Huang, Patrick Lumban Tobing, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda:
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders. CoRR abs/2010.04446 (2020) - [i12]Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi, Tomoki Toda:
Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations. CoRR abs/2010.12231 (2020)
2010 – 2019
- 2019
- [j3]Patrick Lumban Tobing
, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
:
Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder. IEEE Access 7: 171114-171125 (2019) - [c19]Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing
, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
, Yu Tsao
, Hsin-Min Wang
:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. EUSIPCO 2019: 1-5 - [c18]Patrick Lumban Tobing
, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
:
Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder. ICASSP 2019: 6815-6819 - [c17]Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing
, Kazuhiro Kobayashi, Tomoki Toda
:
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation. INTERSPEECH 2019: 196-200 - [c16]Patrick Lumban Tobing
, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
:
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder. INTERSPEECH 2019: 674-678 - [c15]Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing
, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda
, Yu Tsao
, Hsin-Min Wang
:
Investigation of F0 Conditioning and Fully Convolutional Networks in Variational Autoencoder Based Voice Conversion. INTERSPEECH 2019: 709-713 - [c14]Wen-Chin Huang, Yi-Chiao Wu, Kazuhiro Kobayashi, Yu-Huai Peng, Hsin-Te Hwang, Patrick Lumban Tobing, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion. SSW 2019: 57-62 - [c13]Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Statistical Voice Conversion with Quasi-periodic WaveNet Vocoder. SSW 2019: 63-68 - [i11]Wen-Chin Huang, Yi-Chiao Wu, Chen-Chou Lo, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion. CoRR abs/1905.00615 (2019) - [i10]Yi-Chiao Wu, Tomoki Hayashi, Patrick Lumban Tobing, Kazuhiro Kobayashi, Tomoki Toda:
Quasi-Periodic WaveNet Vocoder: A Pitch Dependent Dilated Convolution Model for Parametric Speech Generation. CoRR abs/1907.00797 (2019) - [i9]Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Statistical Voice Conversion with Quasi-Periodic WaveNet Vocoder. CoRR abs/1907.08940 (2019) - [i8]Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder. CoRR abs/1907.10185 (2019) - [i7]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - [i6]Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda:
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining. CoRR abs/1912.06813 (2019) - 2018
- [j2]Hsin-Te Hwang, Yi-Chiao Wu, Syu-Siang Wang, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Locally Linear Embedding Based Post-Filtering for Speech Enhancement. J. Inf. Sci. Eng. 34(6): 1469-1491 (2018) - [j1]Hsin-Te Hwang, Yi-Chiao Wu, Yu-Huai Peng, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Yih-Ru Wang, Sin-Horng Chen:
Voice Conversion Based on Locally Linear Embedding. J. Inf. Sci. Eng. 34(6): 1493-1516 (2018) - [c12]Yu-Huai Peng, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Exemplar-Based Spectral Detail Compensation for Voice Conversion. INTERSPEECH 2018: 486-490 - [c11]Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Hayashi, Patrick Lumban Tobing
, Tomoki Toda
:
Collapsed Speech Segment Detection and Suppression for WaveNet Vocoder. INTERSPEECH 2018: 1988-1992 - [c10]Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018. Odyssey 2018: 211-218 - [c9]Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:
NU Voice Conversion System for the Voice Conversion Challenge 2018. Odyssey 2018: 219-226 - [c8]Patrick Lumban Tobing
, Tomoki Hayashi, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda
:
An Evaluation of Deep Spectral Mappings and WaveNet Vocoder for Voice Conversion. SLT 2018: 297-303 - [i5]Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Hayashi, Patrick Lumban Tobing, Tomoki Toda:
Collapsed speech segment detection and suppression for WaveNet vocoder. CoRR abs/1804.11055 (2018) - [i4]Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang:
Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion. CoRR abs/1811.11078 (2018) - 2017
- [c7]Yu-Huai Peng, Chin-Cheng Hsu, Yi-Chiao Wu, Hsin-Te Hwang, Yi-Wen Liu, Yu Tsao
, Hsin-Min Wang
:
Fast locally linear embedding algorithm for exemplar-based voice conversion. APSIPA 2017: 591-595 - [c6]Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang
, Chin-Cheng Hsu, Ying-Hui Lai, Yu Tsao
, Hsin-Min Wang
:
A locally linear embbeding based postfiltering approach for speech enhancement. ICASSP 2017: 5555-5559 - [c5]Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang
, Chin-Cheng Hsu, Yu Tsao
, Hsin-Min Wang
:
A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement. INTERSPEECH 2017: 1953-1957 - [c4]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Voice Conversion from Unaligned Corpora Using Variational Autoencoding Wasserstein Generative Adversarial Networks. INTERSPEECH 2017: 3364-3368 - [i3]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks. CoRR abs/1704.00849 (2017) - 2016
- [c3]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Voice conversion from non-parallel corpora using variational auto-encoder. APSIPA 2016: 1-6 - [c2]Yi-Chiao Wu, Hsin-Te Hwang, Chin-Cheng Hsu, Yu Tsao
, Hsin-Min Wang
:
Locally Linear Embedding for Exemplar-Based Spectral Conversion. INTERSPEECH 2016: 1652-1656 - [c1]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao
, Hsin-Min Wang
:
Dictionary update for NMF-based voice conversion using an encoder-decoder network. ISCSLP 2016: 1-5 - [i2]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network. CoRR abs/1610.03988 (2016) - [i1]Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder. CoRR abs/1610.04019 (2016)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-21 00:16 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint