


default search action
Siddharth Sigtia
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c14]Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models. ICASSP 2024: 10451-10455 - [i14]Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models. CoRR abs/2403.14438 (2024) - 2023
- [i13]Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models. CoRR abs/2312.03632 (2023) - 2022
- [c13]Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Improving Voice Trigger Detection with Metric Learning. INTERSPEECH 2022: 1896-1900 - [i12]Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Improving Voice Trigger Detection with Metric Learning. CoRR abs/2204.02455 (2022) - 2021
- [c12]Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg:
Progressive Voice Trigger Detection: Accuracy vs Latency. ICASSP 2021: 6843-6847 - [c11]Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir:
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation. Interspeech 2021: 4209-4213 - [i11]Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir:
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation. CoRR abs/2105.06598 (2021) - 2020
- [c10]Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle:
Multi-Task Learning for Speaker Verification and Voice Trigger Detection. ICASSP 2020: 6844-6848 - [c9]Siddharth Sigtia, Pascal Clark, Rob Haynes, Hywel Richards, John Bridle:
Multi-Task Learning for Voice Trigger Detection. ICASSP 2020: 7449-7453 - [c8]Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir:
Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering. INTERSPEECH 2020: 3351-3355 - [i10]Siddharth Sigtia, Pascal Clark, Rob Haynes, Hywel Richards, John Bridle:
Multi-task Learning for Voice Trigger Detection. CoRR abs/2001.09519 (2020) - [i9]Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle:
Multi-task Learning for Speaker Verification and Voice Trigger Detection. CoRR abs/2001.10816 (2020) - [i8]Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir:
Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering. CoRR abs/2008.02323 (2020) - [i7]Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg:
Progressive Voice Trigger Detection: Accuracy vs Latency. CoRR abs/2010.15446 (2020)
2010 – 2019
- 2018
- [c7]Erik Marchi, Stephen Shum, Kvuveon Hwang, Sachin Kajarekar, Siddharth Sigtia, Hywel Richards, Rob Haynes, Yoon Kim, John Bridle:
Generalised Discriminative Transform via Curriculum Learning for Speaker Recognition. ICASSP 2018: 5324-5328 - [c6]Siddharth Sigtia, Rob Haynes, Hywel Richards, Erik Marchi, John Bridle:
Efficient Voice Trigger Detection for Low Resource Hardware. INTERSPEECH 2018: 2092-2096 - 2017
- [b1]Siddharth Sigtia:
Neural networks for analysing music and environmental audio. Queen Mary University of London, UK, 2017 - [j3]Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson
, Mark D. Plumbley
:
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1230-1241 (2017) - 2016
- [j2]Siddharth Sigtia, Emmanouil Benetos
, Simon Dixon:
An End-to-End Neural Network for Polyphonic Piano Music Transcription. IEEE ACM Trans. Audio Speech Lang. Process. 24(5): 927-939 (2016) - [j1]Siddharth Sigtia, Adam M. Stark, Sacha Krstulovic
, Mark D. Plumbley
:
Automatic Environmental Sound Recognition: Performance Versus Computational Cost. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2096-2107 (2016) - [i6]Alexander W. Churchill, Siddharth Sigtia, Chrisantha Fernando:
Learning to Generate Genotypes with Neural Networks. CoRR abs/1604.04153 (2016) - [i5]Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley:
Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging. CoRR abs/1607.03681 (2016) - [i4]Siddharth Sigtia, Adam M. Stark, Sacha Krstulovic, Mark D. Plumbley:
Automatic Environmental Sound Recognition: Performance versus Computational Cost. CoRR abs/1607.04589 (2016) - 2015
- [c5]Siddharth Sigtia, Emmanouil Benetos
, Nicolas Boulanger-Lewandowski, Tillman Weyde
, Artur S. d'Avila Garcez, Simon Dixon:
A hybrid recurrent neural network for music transcription. ICASSP 2015: 2061-2065 - [c4]Siddharth Sigtia, Nicolas Boulanger-Lewandowski, Simon Dixon:
Audio Chord Recognition with a Hybrid Recurrent Neural Network. ISMIR 2015: 127-133 - [c3]Peter Foster, Siddharth Sigtia, Sacha Krstulovic
, Jon Barker, Mark D. Plumbley
:
Chime-home: A dataset for sound source recognition in a domestic environment. WASPAA 2015: 1-5 - [i3]Siddharth Sigtia, Emmanouil Benetos, Simon Dixon:
An End-to-End Neural Network for Polyphonic Music Transcription. CoRR abs/1508.01774 (2015) - 2014
- [c2]Siddharth Sigtia, Simon Dixon:
Improved music feature learning with deep neural networks. ICASSP 2014: 6959-6963 - [c1]Siddharth Sigtia, Emmanouil Benetos, Srikanth Cherla, Tillman Weyde, Artur S. d'Avila Garcez, Simon Dixon:
An RNN-based Music Language Model for Improving Automatic Music Transcription. ISMIR 2014: 53-58 - [i2]Alexander W. Churchill, Siddharth Sigtia, Chrisantha Fernando:
A Denoising Autoencoder that Guides Stochastic Search. CoRR abs/1404.1614 (2014) - [i1]Siddharth Sigtia, Emmanouil Benetos, Nicolas Boulanger-Lewandowski, Tillman Weyde, Artur S. d'Avila Garcez, Simon Dixon:
A Hybrid Recurrent Neural Network For Music Transcription. CoRR abs/1411.1623 (2014)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-29 20:51 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint