Selected Publications

Fucci, D., Gaido, M., Savoldi, B., Negri, M., Cettolo, M., & Bentivogli, L. (2026). SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation. Transactions of the Association for Computational Linguistics.
Lam*, T., Gaido*, M., Papi, S., Bentivogli, L., & Haddow, B. (2025). Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison. Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers).
Gaido, M., Papi, S., Bentivogli, L., Brutti, A., Cettolo, M., Gretter, R., Matassoni, M., Nabih, M., & Negri, M. (2024). MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing.
Papi*, S., Gaido*, M., Pilzer, A., & Negri, M. (2024). When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
Gaido, M., Papi, S., Negri, M., & Bentivogli, L. (2024). Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).
Gaido, M., Tang, Y., Kulikov, I., Huang, R., Gong, H., & Inaguma, H. (2023). Named Entity Detection and Injection for Direct Speech Translation. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Gaido, M., Rodríguez, S., Negri, M., Bentivogli, L., & Turchi, M. (2021). Is “moby dick” a Whale or a Bird? Named Entities and Terminology in Speech Translation. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing.
Gaido, M., Cettolo, M., Negri, M., & Turchi, M. (2021). CTC-based Compression for Direct Speech Translation. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume.
Gaido, M., Di Gangi, M., Negri, M., & Turchi, M. (2020). End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020. Proceedings of the 17th International Conference on Spoken Language Translation.
Gaido, M., Savoldi, B., Bentivogli, L., Negri, M., & Turchi, M. (2020). Breeding Gender-aware Direct Speech Translation Systems. Proceedings of the 28th International Conference on Computational Linguistics.

You can also find the full list of my articles on my Google Scholar profile.

Marco Gaido

Selected Publications