publications | Anup Singh

2025

Interspeech

Language-Agnostic Speech Tokenizer for Spoken Term Detection with Efficient Retrieval

Anup Singh, Kris Demuynck, and Vipul Arora

In Proc. Interspeech 2025, 2025

@inproceedings{singh2025language,
  title = {Language-Agnostic Speech Tokenizer for Spoken Term Detection with Efficient Retrieval},
  author = {Singh, Anup and Demuynck, Kris and Arora, Vipul},
  booktitle = {Proc. Interspeech 2025},
  pages = {2630--2634},
  year = {2025},
}

ICASSP

BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection

Anup Singh, Kris Demuynck, and Vipul Arora

In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025

Bib PDF Code Poster Slides

@inproceedings{singh2025best,
  title = {BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection},
  author = {Singh, Anup and Demuynck, Kris and Arora, Vipul},
  booktitle = {ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages = {1--5},
  year = {2025},
  organization = {IEEE},
}

Under Review

Harmonic Summation-Based Robust Pitch Estimation in Noisy and Reverberant Environments

Anup Singh and Kris Demuynck

arxiv, 2025

PDF
Under Review

BEST-STD2.0: Balanced and Efficient Speech Tokenizer for Spoken Term Detection

Anup Singh, Vipul Arora, and Kris Demuynck

arxiv, 2025

PDF

2024

TASLP

FlowHash: Accelerating Audio Search with Balanced Hashing via Normalizing Flow

Anup Singh, Kris Demuynck, and Vipul Arora

IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024

Bib PDF Code

@article{singh2024flowhash,
  title = {FlowHash: Accelerating Audio Search with Balanced Hashing via Normalizing Flow},
  author = {Singh, Anup and Demuynck, Kris and Arora, Vipul},
  journal = {IEEE/ACM Transactions on Audio, Speech, and Language Processing},
  year = {2024},
  publisher = {IEEE},
}

2023

ICASSP

Simultaneously learning robust audio embeddings and balanced hash codes for query-by-example

Anup Singh, Kris Demuynck, and Vipul Arora

In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023

Bib PDF Video Code Poster Slides

@inproceedings{singh2023simultaneously,
  title = {Simultaneously learning robust audio embeddings and balanced hash codes for query-by-example},
  author = {Singh, Anup and Demuynck, Kris and Arora, Vipul},
  booktitle = {ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages = {1--5},
  year = {2023},
  organization = {IEEE},
}

2022

ISMIR

Attention-Based Audio Embeddings for Query-by-Example

Anup Singh, Kris Demuynck, and Vipul Arora

In 23rd International Society for Music Information Retrieval Conference, 2022

Bib PDF Video Poster

@inproceedings{ismir2022,
  title = {Attention-Based Audio Embeddings for Query-by-Example},
  author = {Singh, Anup and Demuynck, Kris and Arora, Vipul},
  booktitle = {23rd International Society for Music Information Retrieval Conference},
  year = {2022},
}