Олександр Кузьменко AI Eng 18 June 2025, 10:55

Chinese startup MiniMax releases M1 AI model that outperforms GPT-4o in token count and DeepSeek R1 in efficiency

Chinese startup MiniMax, known for its Hailuo AI, has released a new large language model, MiniMax-M1. It has 1 million input tokens (80,000 output), making it one of the largest AI models by this metric.

What is a «context window» and tokens?

A «context window» in large language models (LLMs) refers to the maximum number of tokens that a model can process at one time. Tokens are basic units of text that can include whole words, parts of words, punctuation marks, or code characters. These tokens are converted into numeric vectors that the model uses to represent and manipulate values using its parameters (weights and biases).

M1 is available in the open source community of artificial intelligence code sharing Hugging Face and in the code sharing community GitHub under the Apache 2.0 license. This means that companies can use it for commercial purposes and modify it as they see fit without restrictions or payment.

M1 is trained using reinforcement learning using an innovative, inventive, and highly efficient technique. The model is trained using a hybrid Mixture-of-Experts (MoE) architecture with a lightning-fast attention mechanism designed to reduce inference costs.

According to benchmark data, MiniMax-M1 consumes only 25% of the floating point operations (FLOPs) required by DeepSeek R1 at a generation length of 100,000 tokens. M1 also competes with OpenAI o3, Gemini 2.5 Pro, Claude 4 Opus, DeepSeek R1, DeepSeek R1-0528, and Qwen3-235B in various benchmarks (AIME 2024, LiveCodeBench, SWE-bench Verified, Tau-bench, and MRCR), where it outperforms in some metrics and lags in others.

Day 1/5 of #MiniMaxWeek: We’re open-sourcing MiniMax-M1, our latest LLM — setting new standards in long-context reasoning.

— World’s longest context window: 1M-token input, 80k-token output
— State-of-the-art agent use among open-source models
— RL at unmatched efficiency:… pic.twitter.com/bGfDlZA54n
— MiniMax (official) (@MiniMax__AI) June 16, 2025

MiniMax has made it clear in its posts that it is trying to displace DeepSeek as China’s leading AI player. It was founded in late 2021 and is backed by investors including Alibaba and Tencent.

This winter, the startup launched three new AI models: MiniMax-Text-01 for text only, MiniMax-VL-01 for image and text recognition, and T2A-01-HD for generating sound, including speech. The developers claim that all of them are better than AI models from Google and Anthropic.

Read the country's main IT news in our Telegram

Chinese AI startup MiniMax has introduced three new artificial intelligence models: will they be able to compete with Western counterparts?

DeepSeek has updated its R1 AI model — it has become even more powerful in programming and love for the Chinese Communist Party

Chinese programmers travel with AI in suitcases to bypass US chip restrictions

Leave a comment

Text: Олександр Кузьменко Photo: GIGAZINE Source: VentureBeat Tags: ai, china, startup

Found an error in the text? Highlight it and press Ctrl+Enter. Found an error in the text? Highlight it and press the 'Report an error' button.

Розміщення реклами

Advertising Placement

Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть

Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua

Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент. Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.

У TikTok тепер можна генерувати фон за допомогою нейромережі. Ми протестували її та ділимося результатами

У TikTok з’явилася нова функція «Розумний фон». З її допомогою як фон для тіктоків можна підставляти згенеровані нейромережею зображення. Редакція dev.ua протестувала цю технологію і ділиться своїми враженнями.

1 comment

Які IT-спеціальності будуть потрібні в найближчі п'ять років? Ми з'ясували у голови американського стартапу ADAM Дениса Гурака

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

No comments yet.

Sign in to leave a comment