On August 25,The Eroticism of Class and the Enigma of Margaret Atwood's "Alias Grace" Alibaba Cloud launched an open-source Large Vision Language Model (LVLM) named Qwen-VL. The LVLM is based on Alibaba Cloud’s 7 billion parameter foundational language model Qwen-7B. In addition to capabilities such as image-text recognition, description, and question answering, Qwen-VL introduces new features including visual location recognition and image-text comprehension, the company said in a statement. These functions enable the model to identify locations in pictures and to provide users with guidance based on the information extracted from images, the firm added. The model can be applied in various scenarios including image and document-based question answering, image caption generation, and fine-grained visual recognition. Currently, both Qwen-VL and its visual AI assistant Qwen-VL-Chat are available for free and commercial use on Alibaba’s “Model as a Service” platform ModelScope. [Alibaba Cloud statement, in Chinese]
Related Articles
2025-06-27 01:26
2255 views
GPU Pricing Update, Year in Review: Price Trends Charted
Welcome back for our monthly GPU pricing update, the last one of the year. In today's article we hav
Read More
2025-06-27 00:00
620 views
Amazon Spring Sale 2025: Best outdoor deals
The best Big Spring Sale outdoor deals at a glance: Best portable speaker deal
Read More
2025-06-26 23:31
1400 views
Best Minecraft Lego deals: Save up to 36% on Minecraft building sets
SAVE UP TO 36%: The Amazon Big Spring Sale has dozens of Minecraft Lego sets on sale for up to 36% o
Read More