Try it now — load your own PDF or use the sample: ...
Abstract: Large multimodal models (LMM) have recently shown encouraging progress with visual instruction tuning. In this paper, we present the first systematic study to investigate the design choices ...