Visionllm: Large language model is also an open-ended decoder for vision-centric tasks

Publication
Thirty-seventh Annual Conference on Neural Information Processing Systems (NeurIPS) 2023

Related