Exploring the intersection of vision and language to create richer text representations.
This project focuses on developing text embeddings that are grounded in visual information. More details about the methodology, experiments, and results will be added soon. The poster on the right provides a visual overview of the project.