Marco Baroni received a PhD in Linguistics from the University of California, Los Angeles, in the year 2000. After several experiences in research and industry, he joined the Center for Mind/Brain Sciences of the University of Trento, where he became associate professor in 2013. In 2016, Marco joined the Facebook Artificial Intelligence Research team. In 2019, he became ICREA research professor, affiliated with the Linguistics Department of Pompeu Fabra University in Barcelona. Marco's work in the areas of multimodal and compositional distributed semantics has received widespread recognition, including a Google Research Award, an ERC Starting Grant, and the ICAI-JAIR best paper prize and the ACL test of time award. Marco's current research focuses on universal linguistic and visual representations in deep-learning-trained artificial neural networks such as large language models. In 2021, he was awarded an ERC Advanced Grant to work on related topics.
Research interests
While deep-learning-based artificial neural networks have revolutionized science, engineering and our daily life, we still know surprisingly little about how they work. Indeed, they can sometimes behave in completely unexpected ways, exposing weaknesses that can be used for harmful purposes. My current interest lies in opening the black box of modern neural networks, specifically in the domain of language, where I study large language models such as ChatGPT. I am focusing on two main approaches to this challenge. On the one hand, I am studying how large language models react to inputs that are outside their training distribution. On the other, I use tools from probability, linear algebra and information theory to measure the complexity of the internal representations of large language models.
Selected publications
- Boleda G & Baroni M et al. 2025 'Not a nuisance but a useful heuristic: Outlier dimensions favor frequent tokens in language models', Proceedings of the 8th BlackBoxNLP workshop: analyzing and interpreting neural networks for NLP, pp 109-136.
- Cheng E, Doimo D, Kervadec C, Macocco I, Yu J, Laio A. & Baroni M 2025, 'Emergence of a high-dimensional abstraction phase in language transformers', Proceedings of ICLR 2025 (International Conference on Learning Representations).
- Mahaut M, Franzon F, Dessì R & Baroni M 2025, 'Referential communication in heterogeneous communities of pre-trained visual deep networks', Transactions on Machine Learning Research.
- Ginn Nielsen B, Macocco I & Baroni M 2025. 'Prediction hubs are context-informed frequent tokens in LLMs.' Proceedings of ACL 2025 (63d Annual Meeting of the Association for Computational Linguistics), pp 23715-23745.
- Rakotonirina NC, Kervadec C, Baroni FFM & Baroni M 2025, 'Evil twins are not that evil: Qualitative insights into machine-generated prompts', Proceedings of the 8th blackboxnlp workshop: analyzing and interpreting neural networks for nlp, 48 - 68 - PII 48-68.
Selected research activities
- Co-organized the COLT Symposium on Emergent Features of Language in Minds and Machines
- Member of the ERC Consolidator Grant SH4 Panel (The Human Mind and Its Complexity)
- Provided a "signed definition" for the entry "intelligenza" ("intelligence") to the Vocabolario de la Lingua Italiana Zingarelli