Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?
2 years ago
1
-
Homepage
-
Technology
- Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?