default search action

combined dblp search
author search
venue search
publication search

ask others

Siddharth Sigtia

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WagnerCSGMMM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WagnerCSGMMM24
Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models. ICASSP 2024: 10451-10455
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-14438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-14438
Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models. CoRR abs/2403.14438 (2024)
2023
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03632
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03632
Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models. CoRR abs/2312.03632 (2023)
2022
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NayakHGRSSMLCAD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NayakHGRSSMLCAD22
Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Improving Voice Trigger Detection with Metric Learning. INTERSPEECH 2022: 1896-1900
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02455
Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Improving Voice Trigger Detection with Metric Learning. CoRR abs/2204.02455 (2022)
2021
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SigtiaBRCMG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SigtiaBRCMG21
Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg:
Progressive Voice Trigger Detection: Accuracy vs Latency. ICASSP 2021: 6843-6847
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GargCSASDD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GargCSASDD21
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir:
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation. Interspeech 2021: 4209-4213
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-06598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-06598
Vineet Garg, Wonil Chang, Siddharth Sigtia, Saurabh Adya, Pramod Simha, Pranay Dighe, Chandra Dhir:
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation. CoRR abs/2105.06598 (2021)
2020
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SigtiaMKNB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SigtiaMKNB20
Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle:
Multi-Task Learning for Speaker Verification and Voice Trigger Detection. ICASSP 2020: 6844-6848
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SigtiaCHRB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SigtiaCHRB20
Siddharth Sigtia, Pascal Clark, Rob Haynes, Hywel Richards, John Bridle:
Multi-Task Learning for Voice Trigger Detection. ICASSP 2020: 7449-7453
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AdyaGSSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AdyaGSSD20
Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir:
Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering. INTERSPEECH 2020: 3351-3355
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-09519
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-09519
Siddharth Sigtia, Pascal Clark, Rob Haynes, Hywel Richards, John Bridle:
Multi-task Learning for Voice Trigger Detection. CoRR abs/2001.09519 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-10816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-10816
Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle:
Multi-task Learning for Speaker Verification and Voice Trigger Detection. CoRR abs/2001.10816 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-02323
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-02323
Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir:
Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering. CoRR abs/2008.02323 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15446
Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg:
Progressive Voice Trigger Detection: Accuracy vs Latency. CoRR abs/2010.15446 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2018
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MarchiSHKSRHKB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MarchiSHKSRHKB18
Erik Marchi, Stephen Shum, Kvuveon Hwang, Sachin Kajarekar, Siddharth Sigtia, Hywel Richards, Rob Haynes, Yoon Kim, John Bridle:
Generalised Discriminative Transform via Curriculum Learning for Speaker Recognition. ICASSP 2018: 5324-5328
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SigtiaHRMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SigtiaHRMB18
Siddharth Sigtia, Rob Haynes, Hywel Richards, Erik Marchi, John Bridle:
Efficient Voice Trigger Detection for Low Resource Hardware. INTERSPEECH 2018: 2092-2096
2017
[b1]
- view
- export record
  dblp key:
  - phd/ethos/Sigtia17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ethos/Sigtia17
Siddharth Sigtia:
Neural networks for analysing music and environmental audio. Queen Mary University of London, UK, 2017
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/XuHWFSJP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuHWFSJP17
Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley:
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1230-1241 (2017)
2016
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SigtiaBD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SigtiaBD16
Siddharth Sigtia, Emmanouil Benetos, Simon Dixon:
An End-to-End Neural Network for Polyphonic Piano Music Transcription. IEEE ACM Trans. Audio Speech Lang. Process. 24(5): 927-939 (2016)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SigtiaSKP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SigtiaSKP16
Siddharth Sigtia, Adam M. Stark, Sacha Krstulovic, Mark D. Plumbley:
Automatic Environmental Sound Recognition: Performance Versus Computational Cost. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2096-2107 (2016)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ChurchillSF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ChurchillSF16
Alexander W. Churchill, Siddharth Sigtia, Chrisantha Fernando:
Learning to Generate Genotypes with Neural Networks. CoRR abs/1604.04153 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuHWFSJP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuHWFSJP16
Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley:
Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging. CoRR abs/1607.03681 (2016)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SigtiaSKP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SigtiaSKP16
Siddharth Sigtia, Adam M. Stark, Sacha Krstulovic, Mark D. Plumbley:
Automatic Environmental Sound Recognition: Performance versus Computational Cost. CoRR abs/1607.04589 (2016)
2015
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SigtiaBBWGD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SigtiaBBWGD15
Siddharth Sigtia, Emmanouil Benetos, Nicolas Boulanger-Lewandowski, Tillman Weyde, Artur S. d'Avila Garcez, Simon Dixon:
A hybrid recurrent neural network for music transcription. ICASSP 2015: 2061-2065
[c4]
- view
  - electronic edition @ uma.es (open access)
  - details & citations
- export record
  dblp key:
  - conf/ismir/SigtiaBD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/SigtiaBD15
Siddharth Sigtia, Nicolas Boulanger-Lewandowski, Simon Dixon:
Audio Chord Recognition with a Hybrid Recurrent Neural Network. ISMIR 2015: 127-133
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/FosterSKBP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/FosterSKBP15
Peter Foster, Siddharth Sigtia, Sacha Krstulovic, Jon Barker, Mark D. Plumbley:
Chime-home: A dataset for sound source recognition in a domestic environment. WASPAA 2015: 1-5
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SigtiaBD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SigtiaBD15
Siddharth Sigtia, Emmanouil Benetos, Simon Dixon:
An End-to-End Neural Network for Polyphonic Music Transcription. CoRR abs/1508.01774 (2015)
2014
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SigtiaD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SigtiaD14
Siddharth Sigtia, Simon Dixon:
Improved music feature learning with deep neural networks. ICASSP 2014: 6959-6963
[c1]
- view
- export record
  dblp key:
  - conf/ismir/SigtiaBCWGD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/SigtiaBCWGD14
Siddharth Sigtia, Emmanouil Benetos, Srikanth Cherla, Tillman Weyde, Artur S. d'Avila Garcez, Simon Dixon:
An RNN-based Music Language Model for Improving Automatic Music Transcription. ISMIR 2014: 53-58
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ChurchillSF14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ChurchillSF14
Alexander W. Churchill, Siddharth Sigtia, Chrisantha Fernando:
A Denoising Autoencoder that Guides Stochastic Search. CoRR abs/1404.1614 (2014)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SigtiaBBWGD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SigtiaBBWGD14
Siddharth Sigtia, Emmanouil Benetos, Nicolas Boulanger-Lewandowski, Tillman Weyde, Artur S. d'Avila Garcez, Simon Dixon:
A Hybrid Recurrent Neural Network For Music Transcription. CoRR abs/1411.1623 (2014)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.