UNIT.City — місце, де люди працюють... КРАЩЕ! Обирай свій простір просто зараз 👉

China's Alibaba Qwen has released a competitor to OpenAI's Operator AI agent, which can control a PC and phone

Alibaba's AI division Qwen, which is DeepSeek's main domestic competitor, has released a new family of AI models, Qwen2.5-VL. These models can analyze files, understand videos, count objects in images, and even control a computer — similar to OpenAI's Operator AI agent. Of course, the AI ​​has some limitations on the topics it's allowed to discuss.

Leave a comment
China's Alibaba Qwen has released a competitor to OpenAI's Operator AI agent, which can control a PC and phone

Alibaba's AI division Qwen, which is DeepSeek's main domestic competitor, has released a new family of AI models, Qwen2.5-VL. These models can analyze files, understand videos, count objects in images, and even control a computer — similar to OpenAI's Operator AI agent. Of course, the AI ​​has some limitations on the topics it is allowed to discuss.

According to the results of a comparative analysis conducted by the Qwen team, the best model Qwen2.5-VL outperforms OpenAI's GPT-4o, Anthropic's Claude 3.5 Sonnet, and Google's Gemini 2.0 Flash in various indicators of video understanding, mathematics, document analysis, and question answer evaluation, TechCrunch writes .

Source: TechCrunch

Qwen2.5-VL is available for testing on Alibaba’s Qwen Chat app and for download from the Hugging Face AI developer platform. It can analyze charts and graphs, extract data from scanned invoices and forms, and “understand” hours of video, the Qwen team says. It can also recognize “IP addresses from movies and TV shows, as well as a wide range of products,” the team says, suggesting the models may have been trained in part on copyrighted works.

Qwen2.5-VL, like any Chinese AI, has some limitations on the topics it can discuss. When a TechCrunch reporter asked the largest and most powerful model in the family, Qwen2.5-VL-72B, to talk about “Xi Jinping’s mistakes,” Qwen Chat gave an error message.

One of the most interesting features of Qwen2.5-VL is its ability to interact with software — both on PCs and mobile devices. A video posted on X by Philipp Schmid, CTO of Hugging Face, showed Qwen2.5-VL launching the Booking.com Android app and booking a flight from Chongqing to Beijing.


In the video below, the Qwen2.5-VL runs applications on a Linux desktop, but doesn't appear to do anything other than switch tabs. Perhaps tellingly, the Qwen2.5-VL scored poorly in OSWorld's Qwen benchmark, a test that attempts to simulate a real-world computing environment.

Two less sophisticated models of the Qwen2.5-VL series, the Qwen2.5-VL-3B and Qwen2.5-VL-7B, are available under a permissive license. The flagship model, the Qwen2.5-VL-72B, has a special Alibaba license, which requires companies and developers with more than 100 million monthly active users to request permission from Qwen/Alibaba before deploying the model commercially.

Recall that Chinese AI lab DeepSeek attracted a lot of attention after its chatbot rose to the top of the Apple App Store charts . The excitement triggered a drop in the stock prices of technology companies , including top graphics processor manufacturer Nvidia, and Mark Zuckerberg rushed to announce that Meta plans to invest $60 billion in AI development by 2025 .

As of Monday evening, tech stocks had lost about $1 trillion following the progress of Chinese AI startup DeepSeek.

Previously, dev.ua did a detailed analysis of how DeepSeek managed to outperform its competitors .

Chinese tech giant Alibaba unveils 100 new open-source AI models and text-to-video technology
Chinese tech giant Alibaba unveils 100 new open-source AI models and text-to-video technology
On the topic
Chinese tech giant Alibaba unveils 100 new open-source AI models and text-to-video technology
Alibaba closes quantum computing research lab as co-founder launches new food business
Alibaba closes quantum computing research lab as co-founder launches new food business
On the topic
Alibaba closes quantum computing research lab as co-founder launches new food business
Telegram continues to be the main social network from which Ukrainians get news
Telegram continues to be the main social network from which Ukrainians get news
On the topic
Telegram continues to be the main social network from which Ukrainians get news

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

Discussion
No comments yet.