Tackling multiple tasks with a single visual language model

28 avril 2022

We introduce Flamingo, a single visual language model (VLM) that sets a new state of the art in few-shot learning on a wide range of open-ended multimodal tasks.