Video Infrastructure, Meta
Course title: "Optimization of video streaming - from past to future"
Ioannis Katsavounidis (IEEE SM) received M.S., EEE and Ph.D. degrees in Electrical Engineering at the University of Southern California (USC), CA, USA. His work and research focused on Signal Processing, as part of the Signal and Image Processing Institute (SIPI), Caltech’s High Energy Physics department at the Italian National Laboratory at Gran Sasso (Laboratori Nazionali del Gran Sasso – LNGS), working as engineer for the MACRO (Monopoles, Astrophysics and Cosmic Ray Observatory) large-scale high-energy physics experiment. He was an Associate Professor at the Department of Electrical and Computer Engineering at the University of Thessaly in Volos, Greece (2008 to 2015); a Senior Research Scientist at Netflix in Los Gatos, CA, USA, (2015 to 2018). He is a Research Scientist at Meta Platforms in Menlo Park, CA, USA, supporting all video processing for the popular Facebook, Instagram and Messenger applications.
University of Campinas (Unicamp), Campinas, SP, Brazil
Anderson Rocha is Full-Professor of Artificial Intelligence and Digital Forensics at the Institute of Computing, University of Campinas (Unicamp), Brazil. He is the Head of the Artificial Intelligence Lab., Recod.ai, at Unicamp and was the former Director of the Institute for the 2019-2023 term. He has served as an elected member of the IEEE Information Forensics and Security Technical Committee (IFS-TC). He is a Microsoft Research and a Google Research Faculty Fellow. He is ranked among the Top 2 of research scientists worldwide, according to PlosOne/Stanford and Research.com studies. Finally, he is now a LinkedIn Top Voice in Artificial Intelligence for continuously raising awareness of Al and its potential impacts on society at large.
Technische Universität, München, Germany
Eckehard Steinbach is currently a Full Professor for Media Technology. His current research interests are in the area of audio-visual-haptic information processing and communication as well as networked and interactive multimedia systems. In March 2005 Prof. Steinbach has been appointed as a guest professor at the Chinesisch-Deutschen Hochschulkolleg (CDHK) at Tongji University in Shanghai. Prof. Steinbach and his team have received several best paper, best student paper or best poster awards for their work. Prof. Steinbach is the recipient of the 2011 “Forschungspreis Technische Kommunikation” of the Alcatel-Lucent Foundation. He was elected Fellow of the IEEE in 2015 for his contributions to visual and haptic communications.
University of Trento, Italy
Giulia Boato is full Professor at the Department of Information Engineerig and Computer Science (DISI) of the University of Trento (Italy). In 2006 she was visiting researcher at the Signal Theory and Communications Department of the University of Vigo (Spain). Since 2006 she has been collaborating with the Signal Processing Department of the Tampere University of Technology (Finland), in particular with prof. Karen Egiazarian. Since 2009 she has been working with prof. Hany Farid of the Dartmouth College (USA) on digital image and video forensics techniques.
Information Technologies Institute (ITI-CERTH), Greece
Course title: "Foundation models for video understanding tasks"
Vasileios Mezaris is a Greek researcher and expert in multimedia understanding and artificial intelligence. He is currently a Research Director (Senior Researcher Grade A) at the Information Technologies Institute (ITI) of the Centre for Research and Technology Hellas (CERTH) in Thessaloniki, Greece, where he also leads the Intelligent Digital Transformation Laboratory. His research focuses on areas such as image and video analysis and annotation, machine learning and deep learning for multimedia understanding, big data analytics, multimedia indexing and retrieval, as well as explainable and green AI. He has served in editorial roles for major journals including IEEE Signal Processing Letters and the IEEE Transactions on Multimedia. Mezaris has participated in numerous research projects, often as coordinator or principal investigator. He is also a Senior Member of the IEEE.
University College London (UCL), UK
Course title: "Graph signal processing toward graph generative models"
Laura Toni is a Professor in the Department of Electronic and Electrical Engineering at University College London (UCL). She specializes in large-scale signal processing, machine learning, reinforcement learning, and multimedia systems. Toni is a Turing Fellow at the Alan Turing Institute and a member of European Laboratory for Learning and Intelligent Systems. She earned her Ph.D. in electrical engineering from the University of Bologna and held postdoctoral positions at University of California, San Diego and École Polytechnique Fédérale de Lausanne. Her work includes research on online adaptive strategies, graph processing, and dynamic network decision-making. She has (co-)authored numerous publications and serves in leadership roles for international conferences and journals.
Concordia University, Université de Montréal, Canada
Course title: "Compact audio and speech representations"
Mirco Ravanelli is a Professor in the Department of Computer Science and Software Engineering at Concordia University. He works on deep learning for sequence processing and Conversational AI. He is also an Adjunct Professor at Université de Montréal and an Associate Member of the Mila – Quebec AI Institute, where he was a postdoctoral researcher under Yoshua Bengio . Ravanelli specializes in speech processing, machine learning, and representation learning. He is the creator and leader of SpeechBrain, an open-source toolkit for speech and conversational AI. His work has been recognized with awards such as the 2022 Amazon Research Award. His research contributes to advancing conversational systems and speech technologies.
FBK Trento, Italy, and Former Amazon Inc.
Course title part 2: "Compact audio and speech representations"
Maurizio Omologo has worked in speech recognition, artificial intelligence, and microphone array signal processing. His research spans robust speech recognition, deep learning for far-field audio, and acoustic signal enhancement.
He has authored and co-authored many influential publications in automatic speech recognition and related fields.
He was historically affiliated with Fondazione Bruno Kessler (FBK) in Italy, where he worked on distant-speech interaction projects.
He has collaborated on projects involving transformer-based and neural speech models. He has also been involved in the development of practical corpora for speech data analysis. His work bridges speech technology and machine learning, influencing both theory and applications. Maurizio Omologo has worked at Amazon as a Principal Applied Scientist, particularly associated with Amazon Alexa / speech technology research.
École des Ponts ParisTech (ENPC), France
Gül Varol is a computer vision researcher and Associate Professor (permanent researcher) at École des Ponts ParisTech in France. Her work focuses on vision-and-language research, video understanding, human motion synthesis, and sign language analysis. She previously worked as a postdoctoral researcher at the University of Oxford (Visual Geometry Group) and received her PhD from Inria Paris and École Normale Supérieure (ENS).