publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. Gender Bias in Instruction-Guided Speech Synthesis Models
    Chun-Yi Kuan , and Hung-yi Lee
    2025
  2. Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning
    Chun-Yi Kuan , and Hung-yi Lee
    In ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , 2025

2024

  1. Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks
    Chien-yu Huang , Wei-Chih Chen , Shu-wen Yang , and 8 more authors
    arXiv preprint arXiv:2411.05361, 2024
  2. Building a Taiwanese Mandarin Spoken Language Model: A First Attempt
    Chih-Kai Yang , Yu-Kuan Fu , Chen-An Li , and 8 more authors
    arXiv preprint arXiv:2411.07111, 2024
  3. Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation
    Chun-Yi Kuan , Chih-Kai Yang , Wei-Ping Huang , and 2 more authors
    2024
  4. Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course
    Cheng-Han Chiang , Wei-Chih Chen , Chun-Yi Kuan , and 2 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , Nov 2024
  5. Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models
    Yi-Cheng Lin , Tzu-Quan Lin , Chih-Kai Yang , and 4 more authors
    Nov 2024
  6. Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models
    Chun-Yi Kuan , Wei-Ping Huang , and Hung-yi Lee
    Nov 2024
  7. Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech
    Chien-yu Huang , Ke-Han Lu , Shih-Heng Wang , and 8 more authors
    In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , Nov 2024

2023

  1. Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision
    Chih-Kai Yang , Kuan-Po Huang , Ke-Han Lu , and 3 more authors
    Nov 2023
  2. Towards General-Purpose Text-Instruction-Guided Voice Conversion
    Chun-Yi Kuan , Chen-An Li , Tsu-Yuan Hsu , and 5 more authors
    In 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) , Nov 2023