Pramuditha Perera

Featured in: Favicon

Articles

Benchmarking diverse-modal entity linking with generative models

Jun 9, 2023 | amazon.science | Henry Zhu |Sheng Zhang |Pramuditha Perera

Entities can be expressed in diverse formats, such as texts, images, or column names and cell values in tables. While existing entity linking (EL) models work well on per modality configuration, such as text-only EL, visual grounding, or schema linking, it is more challenging to design a unified model for diverse modality configurations.
Generate then select: Open-ended visual question answering guided by world knowledge

Jun 9, 2023 | amazon.science | Sheng Zhang |Gukyeong Kwon |Pramuditha Perera |Henry Zhu

The open-ended Visual Question Answering (VQA) task requires AI models to jointly reason over visual and natural language inputs using world knowledge. Recently, pre-trained Language Models (PLM) such as GPT-3 have been applied to the task and shown to be powerful world knowledge sources.

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.