Pavlo Molchanov's profile photo

Pavlo Molchanov

Featured in:

Articles

  • May 3, 2024 | developer.nvidia.com | Yao Lu |Hongxu Yin |Ji Lin |Pavlo Molchanov

    Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among multiple images, support in context learning or understand videos. Also, they don’t optimize for inference speed. We developed VILA, a visual language model with a holistic pretraining, instruction tuning, and deployment pipeline that helps our NVIDIA clients succeed in their multi-modal products.

Contact details

Socials & Sites

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.

Start Your 7-Day Free Trial →