Skip to main content
info

"Informed AI News" is an AI-curated publications aggregation platform, ensuring you access only the most valuable information, with the aim of eliminating the information gap and transcending the confines of information cocoons. Find out more >>

Alibaba Cloud Launches Qwen2 Series Models, Leading the New Trend in Open-Source Large Models

Alibaba Cloud Launches Qwen2 Series Models, Leading the New Trend in Open-Source Large Models

Summary:

Alibaba Cloud has introduced the Qwen2 series of models, with Qwen2-72B being hailed as the world's most powerful open-source language model. This model has outperformed the strongest American open-source model, Llama3-70B, as well as numerous Chinese closed-source models such as Wenxin 4.0 and Doubao Pro, in multiple international authoritative evaluations. The Qwen2 series includes five size models, supports 128k long text processing, and has significantly improved capabilities in coding and mathematics. Alibaba Cloud has also publicly disclosed the technical details behind it, including the use of GQA for accelerated inference and enhanced multilingual capabilities.

Insight:

The rise of open-source models marks a new era of technology sharing and innovation. The success of Qwen2 not only demonstrates the potential of open-source models but also challenges the traditional advantages of closed-source models. This competition accelerates the rapid development of technology, lowers the barrier for AI applications, and enables more developers and businesses to utilize advanced technologies for innovation.

Explanation of Terms:

  • Open-source model: A software model where the source code is publicly available, allowing anyone to view, use, modify, and distribute it.
  • Closed-source model: A model where the source code is not publicly available and is typically controlled and maintained by a specific company or organization.
  • GQA (Grouped Query Attention): An optimization technique used to enhance the efficiency and speed of a model when processing large amounts of data.
  • Multilingual capabilities: The ability of a model to understand and generate text in multiple languages.

Full article>>