Selected Publications

  1. Fucci, D., Gaido, M., Savoldi, B., Negri, M., Cettolo, M., & Bentivogli, L. (2026). SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation. Transactions of the Association for Computational Linguistics.
  2. Lam*, T., Gaido*, M., Papi, S., Bentivogli, L., & Haddow, B. (2025). Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison. Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers).
  3. Gaido, M., Papi, S., Bentivogli, L., Brutti, A., Cettolo, M., Gretter, R., Matassoni, M., Nabih, M., & Negri, M. (2024). MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing.
  4. Papi*, S., Gaido*, M., Pilzer, A., & Negri, M. (2024). When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
  5. Gaido, M., Papi, S., Negri, M., & Bentivogli, L. (2024). Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
  6. Gaido, M., Tang, Y., Kulikov, I., Huang, R., Gong, H., & Inaguma, H. (2023). Named Entity Detection and Injection for Direct Speech Translation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
  7. Gaido, M., Rodríguez, S., Negri, M., Bentivogli, L., & Turchi, M. (2021). Is “moby dick” a Whale or a Bird? Named Entities and Terminology in Speech Translation. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.
  8. Gaido, M., Cettolo, M., Negri, M., & Turchi, M. (2021). CTC-based Compression for Direct Speech Translation. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume.
  9. Gaido, M., Di Gangi, M., Negri, M., & Turchi, M. (2020). End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020. Proceedings of the 17th International Conference on Spoken Language Translation.
  10. Gaido, M., Savoldi, B., Bentivogli, L., Negri, M., & Turchi, M. (2020). Breeding Gender-aware Direct Speech Translation Systems. Proceedings of the 28th International Conference on Computational Linguistics.

You can also find the full list of my articles on my Google Scholar profile.