About Me
I’m a PhD student at the Language Technologies Institute of the School of Computer Science at Carnegie Mellon University. I am advised by Dr. Shinji Watanabe as a member of WAVLab.
I’m generally interested in improving speech & language processing. My current projects involve speech translation and multilingual speech recognition with end-to-end neural networks.
I also completed my master’s degree at CMU SCS where I was advised by Dr. Michael Shamos. Before that, I was a Technology Strategy Consultant at Accenture. I completed my undergraduate degree in Economics and Computer Science at The University of Chicago.
 
Updates
- May 2024: Interning at Meta FAIR with Dr. Michael Auli
- June 2023: Joining the SCALE 2023 Workshop at John’s Hopkins University
- June 2022: Joining the JSALT 2022 Workshop at John’s Hopkins University
- May 2021: Interning at Dr. Dong Yu’s AI lab at Tencent America
Selected Publications
Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking
 Brian Yan, Vineel Pratap, Shinji Watanabe, Michael Auli
 Pre-print, 2024
 paper
Improving Massively Multilingual ASR With Auxiliary CTC Objectives
 William Chen, Brian Yan, Jiatong Shi, Yifan Peng, Soumi Maiti, Shinji Watanabe
 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
 Best student paper award at IEEE ICASSP 2023
 paper
Exploration of Efficient End-to-End ASR Using Discretized Input from Self-Supervised Learning
 Xuankai Chang, Brian Yan, Yuya Fujita, Takashi Maekaku, Shinji Watanabe
 24th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2023
 paper
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
 Brian Yan*, Jiatong Shi*, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polák, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe
 Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL), 2023
 paper / poster
CMU’s IWSLT 2023 Simultaneous Speech Translation System
 Brian Yan*, Jiatong Shi*, Soumi Maiti, William Chen, Xinjian Li, Yifan Peng, Siddhant Arora, Shinji Watanabe
 Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT), 2023
 Winning submission to the IWSLT 2023 Simultaneous Speech-to-Speech Translation Track (English-to-German)
 paper
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
 Puyuan Peng, Brian Yan, Shinji Watanabe, David Harwath
 24th Annual Conference of the International Speech Communication Association (INTERSPEECH), 2023
 paper
CTC Alignments Improve Autoregressive Translation
 Brian Yan, Siddharth Dalmia, Yosuke Higuchi, Graham Neubig, Florian Metze, Alan W Black, Shinji Watanabe
 Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
 paper / talk / poster / TLDR
Towards Zero-Shot Code-Switched Speech Recognition
 Brian Yan, Matthew Wiesner, Ondrej Klejch, Preethi Jyothi, Shinji Watanabe
 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023
 paper / poster / TLDR
CMU’s IWSLT 2022 Dialect Speech Translation System
 Brian Yan, Patrick Fernandes, Siddharth Dalmia, Jiatong Shi, Yifan Peng, Dan Berrebbi, Xinyi Wang, Graham Neubig, Shinji Watanabe
 Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT), 2022
 Winning submission to the IWSLT 2022 Dialectal Track
 paper / talk
Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
 Brian Yan, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Siddharth Dalmia, Dan Berrebbi, Chao Weng, Shinji Watanabe, Dong Yu
 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022
 paper / talk / poster / TLDR
My Google Scholar is more comprehensive.
Activities
Talks
Controllable and Explainable End-to-End Speech Translation
 SIG SLT Seminar, 2022
Code-Switched Modeling
 JSALT Workshop, John’s Hopkins University, 2022
Building End-to-End Speech Translation Systems
 JSALT Workshop, John’s Hopkins University, 2022
Teaching
CS 11-751: Speech Recognition and Understanding
 Teaching Assistant
 Carnegie Mellon University, Fall 2023
CS 11-700: Language Technologies Institute Colloquium
 Teaching Assistant
 Carnegie Mellon University, 2021-22 Academic Year
CS 11-737: Multilingual NLP
 Teaching Assistant
 Carnegie Mellon University, Spring 2021 DSTA Course
Academic Service
Reviewer
 ICASSP, Interspeech, ACL, EMNLP, NAACL, SLT, ASRU, APSIPA
Contact
Email: byan[at]cs.cmu.edu
