Skip to main content

Sequential Modeling for Polyps Identification from the Vocal Data

  • Conference paper
  • First Online:
Communications, Signal Processing, and Systems (CSPS 2018)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 516))

Abstract

Given the revival of neural networks and its recent impact in other disciplines and record-breaking performances in a variety of applications, in this paper, we employed a deep sequential model for polyps detection from the vocal data. Previous research of acoustic signal recognition (ASR) has focused on hand-crafted machine learning fashion, such as Mel-frequency cepstral coefficients with hidden Markov model and Gaussian mixture model. The deep model demonstrates its flexibility and potential to outperform the traditional methods, and we expand its scope on medical symptom identification. The mapping between the raw vocal signal and the symptom recognition is established, and we show that we can achieve a good recognition accuracy, which may appear to clinical diagnosis in the near future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
€32.70 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
EUR 29.95
Price includes VAT (France)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
EUR 160.49
Price includes VAT (France)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
EUR 210.99
Price includes VAT (France)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
EUR 210.99
Price includes VAT (France)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    We follow the implementation from https://github.com/jameslyons/python_speech_features.

  2. 2.

    The corresponding code will be uploaded to Github for open source purpose.

References

  1. Wang TJ, Massaro JM, Levy D, et al. A risk score for predicting stroke or death in individuals with new-onset atrial fibrillation in the community: the framingham heart study. J Am Med Assoc. 2003;290(8):1049–56.

    Article  Google Scholar 

  2. Rabiner LR. A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE. 1989;77(2):257–86.

    Article  Google Scholar 

  3. Roebroeck A, Seth AK, Valdes-Sosa P. Causal time series analysis of functional magnetic resonance imaging data. In: NIPS mini-symposium on causality in time series, 2011. p. 65–94.

    Google Scholar 

  4. Zhu F, Liang J. Soil moisture retrieval from UWB sensor data by leveraging fuzzy logic. IEEE Access, 2018. https://doi.org/10.1109/ACCESS.2018.2840159.

    Article  Google Scholar 

  5. Alipanahi B, Delong A, Weirauch MT, et al. Predicting the sequence specificities of DNA-and RNA-binding proteins by deep learning. Nat Biotechnol. 2015;33(8):831–9.

    Article  Google Scholar 

  6. Hochreiter S, Schmidhuber J. Long short-term memory. Neural comput. 1997;9(8):1735–80.

    Article  Google Scholar 

  7. Hinton G, Deng L, Yu D, et al. Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Process Mag. 2012;29(6):82–97.

    Article  Google Scholar 

  8. Sutskever I, Vinyals O, Le QV. Sequence to sequence learning with neural networks. Adv Neural Inf Process Syst, 2014, p. 3104–12.

    Google Scholar 

  9. van der Westhuizen J, Lasenby J. The unreasonable effectiveness of the forget gate. arXiv preprint arXiv:1804.04849, 2018.

  10. Tank A, Cover I, Foti NJ, et al. An interpretable and sparse neural network model for nonlinear granger causality discovery. In: Accepted by NIPs time series workshop, 2017.

    Google Scholar 

  11. Choi E, Schuetz A, Stewart WF, et al. Using recurrent neural network models for early detection of heart failure onset. J Am Med Inf Assoc. 2016;24(2):361–70.

    Google Scholar 

  12. Zhong Z, Jiang T, Zhang W, et al. Analyzing speech of patients with vocal polyps based on channel parameters and fuzzy logic systems. Comput Math Appl. 2011;62(7):2834–42.

    Article  Google Scholar 

  13. Mitra SK, Kuo Y. Digital signal processing: a computer-based approach. New York: McGraw-Hill Higher Education; 2006.

    Google Scholar 

Download references

Acknowledgments

This work was supported in part by NSFC under Grant 61771342, 61731006, 61711530132, and Tianjin Higher Education Creative Team Funds Program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhen Zhong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhu, F., Liang, Q., Zhong, Z. (2020). Sequential Modeling for Polyps Identification from the Vocal Data. In: Liang, Q., Liu, X., Na, Z., Wang, W., Mu, J., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2018. Lecture Notes in Electrical Engineering, vol 516. Springer, Singapore. https://doi.org/10.1007/978-981-13-6504-1_113

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-6504-1_113

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-6503-4

  • Online ISBN: 978-981-13-6504-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics