publications

2025

  1. Interspeech
    interspeech2025.png
    Language-Agnostic Speech Tokenizer for Spoken Term Detection with Efficient Retrieval
    Anup Singh, Kris Demuynck, and Vipul Arora
    In Proc. Interspeech 2025, 2025
  2. ICASSP
    icassp2025.png
    BEST-STD: Bidirectional Mamba-Enhanced Speech Tokenization for Spoken Term Detection
    Anup Singh, Kris Demuynck, and Vipul Arora
    In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025
  3. Under Review
    arxiv.png
    Harmonic Summation-Based Robust Pitch Estimation in Noisy and Reverberant Environments
    Anup Singh and Kris Demuynck
    arxiv, 2025
  4. Under Review
    best-std2.0.png
    BEST-STD2.0: Balanced and Efficient Speech Tokenizer for Spoken Term Detection
    Anup Singh, Vipul Arora, and Kris Demuynck
    arxiv, 2025

2024

  1. TASLP
    taslp2024.png
    FlowHash: Accelerating Audio Search with Balanced Hashing via Normalizing Flow
    Anup Singh, Kris Demuynck, and Vipul Arora
    IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024

2023

  1. ICASSP
    icassp2023.png
    Simultaneously learning robust audio embeddings and balanced hash codes for query-by-example
    Anup Singh, Kris Demuynck, and Vipul Arora
    In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023

2022

  1. ISMIR
    ismir2022.png
    Attention-Based Audio Embeddings for Query-by-Example
    Anup Singh, Kris Demuynck, and Vipul Arora
    In 23rd International Society for Music Information Retrieval Conference, 2022