![]() |
市場調査レポート
商品コード
1694624
Li Autoの電化、接続性、インテリジェンス、シェアリングにおけるレイアウトの分析(2024年~2025年)Analysis on Li Auto's Layout in Electrification, Connectivity, Intelligence and Sharing, 2024-2025 |
||||||
|
Li Autoの電化、接続性、インテリジェンス、シェアリングにおけるレイアウトの分析(2024年~2025年) |
出版日: 2025年03月02日
発行: ResearchInChina
ページ情報: 英文 270 Pages
納期: 即日から翌営業日
|
Mind GPT:自動車AIの「スーパーブレイン」
Li Xiangは、Mind GPTをLi AutoのAI戦略の中核と位置づけています。2025年1月現在、Mind GPTは2023年12月以来、何度もイテレーションを繰り返し、50万台を超える車両に搭載され、毎日1億2,000万件の対話要求を処理し、98.7%の精度でタスクを実行しています。Li Xiangは、Mind GPTを今後数年間で業界のトップ3にランクインさせるという目標を提案しました。
2025年1月16日、Li Autoは正式にOTA 7.0の配信を開始しました。最大の目玉は、Lixiang TongxueのMind GPTがMind GPT-3oにアップグレードされたことです。最初のバージョンが登場して以来、Li AutoのMind GPTは何度も改良を重ね、現在では知覚から認知、能動的な表現に至るまで、インテリジェントなエージェントを誇っています。
2023年4月、Mind GPT 1.0がリリースされた
2023年12月、Mind GPT 1.0がOTA 5.0対応車に搭載され、国家基盤モデル登録に登録されました。これは自動車用大型言語モデルとしてはもっとも早い時期のものです。
2024年8月、言語理解、Q&A、論理的推論を特徴とするMind GPT 2.0が発表されました。モデルアーキテクチャはMixture of Experts(MoE)とTransformerで構成され、モデルサイズが倍増しました。
2025年1月、Mind GPT-3oがOTA 7.0を通じて配信されました。これは、聞く、見る、記憶すること(Face IDやファミリーアカウントと連携し、家族の選好や要求を記憶すること)、上手に推論すること(複雑な問題を理解し、分解することができ、知覚や思考の過程をアニメーションで直感的に表示すること、すなわちLixiang Tongxue Workflow)、表現すること(より人間的な声を持ち、数十種類の助詞をサポートし、より口語的な会話を行うこと)、300以上のツール(交通規制の問い合わせ、Meituanなど)を使うことができます。
強力なマルチモーダルエンドツーエンド基盤モデルとして、Mind GPT-3oは音声、視覚、言語などの複数のモダリティを理解し、数百ミリ秒でフィードバックを提供することができます。外部情報を正確に認識し、深く理解し、単一のアーキテクチャ内で自然かつ正確に表現し、完全で首尾一貫した知的処理システムを構築することができます。Mind GPT-3oによって、Lixiang Tongxueはインテリジェントインタラクションなどで質的な飛躍を遂げました。Lixiang Tongxueは記憶、計画、道具、表現能力などで総合的に向上しています。タスクをこなし、認知力を向上させ、車内の乗客に情緒的な同伴を提供することができます。
知的な対話体験という点では、Lixiang Tongxueは優れた知覚と豊富な知識を持っています。コックピットの外でも、さまざまな場所に関する質問に正確に答えることができ、動物、植物、車、絵画などを鋭敏に感知し、周囲の建物や地理情報まで知ることができます。
同時に、Lixiang Tongxueはスマートなアシスタントであるだけでなく、思いやりのある家族の一員でもあります。あなたとあなたの家族を正確に識別し、全員の選好や特別な要求を記憶し、あらゆる旅行をより簡単で便利にし、あなたを本当に中心とすることができます。
日常生活では、交通規制の情報を素早くチェックし、スケジュールを明確に提示し、選好やニーズに応じてMeituanで質の高いレストランを絞り込むことができます。それだけでなく、地元で人気のイベント情報を適時に提供することもできます。
コックピットの進化:「機能の積み重ね」から「能動的な先見性」へ
Mind GPT-3oのパワーアップにより、スマートスペースはOTA 7.0を通じて全面的な進化を遂げました。新たにアップグレードされたRGB+IRビジュアルモジュールと豊富なマルチモーダル情報入力により、Lixiang Tongxueはユーザーの指示を理解するだけでなく、車内の状況も把握できます。この機能により、Lixiang Tongxueは利用者の意図をよりよく理解することができます。例えば、車内の乗客がある観光スポットについて話しているとき、Lixiang Tongxueは視覚認識と音声分析を通じて、観光スポットの搭載、ナビゲーションなどの関連情報をユーザーに素早く提供することができます。
当レポートでは、Li Autoについて調査分析し、電化、接続性、インテリジェンス、シェアリングにおける同社の戦略や開発動向などを分析しています。
Mind GPT: The "super brain" of automotive AI
Li Xiang regards Mind GPT as the core of Li Auto's AI strategy. As of January 2025, Mind GPT had undergone multiple iterations since December 2023, been installed in more than 500,000 vehicles, processed 120 million interaction requests daily, and performed tasks with the accuracy of 98.7%. Li Xiang proposed the goal of making Mind GPT rank amog the top three in the industry in the next few years.
On January 16, 2025, Li Auto officially started pushing OTA 7.0. The biggest highlight is that Mind GPT in "Lixiang Tongxue" was upgraded to Mind GPT-3o. Since the first version came out, Li Auto's Mind GPT has undergone multiple iterations and now boasts an intelligent agent from perception to cognition to active expression.
In April 2023, Mind GPT 1.0 was released;
In December 2023, Mind GPT 1.0 landed on vehicles with OTA 5.0 and was listed in the national foundation model registration. It is one of the earliest automotive large language models;
In August 2024, Mind GPT 2.0 was launched, featuring language understanding, Q&A, and logical reasoning. The model architecture is consisted of Mixture of Experts (MoE) and Transformer, doubling the model size;
In January 2025, Mind GPT-3o was pushed via OTA 7.0. It can listen, see and remember (connected with Face ID and family accounts, it can remember the preferences and requirements of family members), excel in reasoning (it can understand and dismantle complex problems, and intuitively display the process of perception and thinking through animation, that is, "Lixiang Tongxue Workflow"), express (it has a more human voice and supports dozens of modal particles; it can conduct more colloquial conversations; it is versatile and can sing and imitate animal sounds), and use 300+ tools (such as traffic restriction inquiry, Meituan, etc.).
As a powerful multi-modal end-to-end foundation model, Mind GPT-3o can understand multiple modalities such as speech, vision, and language, and provide feedback in hundreds of milliseconds. It can accurately perceive external information, deeply understand and express it naturally and accurately within a single architecture, building a complete and coherent intelligent processing system. Empowered by Mind GPT-3o, "Lixiang Tongxue" has achieved a qualitative leap in intelligent interaction and other aspects. "Lixiang Tongxue" has been comprehensively improved in terms of memory, planning, tools, and expression skills. It can complete tasks, improve cognition, and provide emotional companionship for passengers in vehicles.
In terms of intelligent interactive experience, "Lixiang Tongxue" has excellent perception and a wealth of knowledge. Even outside the cockpit, it can accurately answer questions related to various locations, and can keenly perceive animals, plants, cars, paintings, etc., and even know the surrounding buildings and geographical information.
At the same time, "Lixiang Tongxue" is not only a smart assistant, but also a considerate family member. It can accurately identify you and your family, remember everyone's preferences and special requirements, make every trip easier and more convenient, and truly make you the center.
In daily life, it can quickly check information about traffic restrictions, clearly present schedules, and filter high-quality restaurants on Meituan according to your tastes and needs. Not only that, it can also provide timely information on popular local events.
Cockpit evolution: from "function stacking" to "active foresight"
With the empowerment of Mind GPT-3o, the smart space has achieved all-round evolution via OTA 7.0. The newly upgraded RGB+IR visual module and rich multimodal information input allow "Lixiang Tongxue" to not only understand the instructions of users, but also see the situation in vehicles. The function enables "Lixiang Tongxue" to better understand the intentions of users. For example, when passengers in the vehicle are discussing a certain scenic spot, "Lixiang Tongxue" can quickly provide users with relevant information about the scenic spot, including introductions, navigation, etc. through visual recognition and voice analysis.
With the powerful cognitive capabilities of Mind GPT, "Lixiang Tongxue" can also become a 24-hour travel assistant, car assistant and entertainment assistant for the family. It can plan the best route in advance according to the user's daily travel habits and remind traffic information in real time. In terms of car use, AI Task Master adds an individual task, such as "Close the sunshade when parking for a while." It also supports extended services, such as "Turn on the front air conditioner for ten minutes."
For entertainment, "Lixiang Tongxue" can recommend suitable music, movies and other entertainment content according to the user's preferences. When the user wants to listen to a certain type of music, "Lixiang Tongxue" can quickly select songs that suit the user's taste from the massive music library and play them.
The "Lixiang Tongxue" APP goes online, allowing Lixiang AI to spread from IVI to mobile phones, homes and other scenarios, providing general AI services such as Q&A, visual recognition (like menus, animals and plants).
Advancement of Mind GPT-3o
The advancement of Mind GPT-3o stems from Li Auto's full-stack independent R&D, scenario-based deep customization, hybrid deployment architecture, and ecological collaboration.
The "multimodal end-to-end integration" architecture is the most distinctive feature of Mind GPT-3o. Unlike traditional automotive foundation models that rely on modular stacking (such as independent speech recognition and image processing modules), Mind GPT-3o achieves deep integration of speech, vision, and language understanding through a single model. The complete link from perception to cognition to expression is fulfilled in a closed loop in one model. This design greatly reduces system latency (response in hundreds of milliseconds) and reduces information loss caused by multi-module collaboration.
Mind GPT-3o is trained on the basis of 3 trillion tokens of diverse data, covering multiple dimensions such as user habits, road scenarios, and voice interactions, far exceeding the industry average. OTA updates happen frequently. 17 iterations were completed via OTA in 2024, with an average cycle of 19 days. It responds quickly to user feedback and continuously optimizes functions (such as RedNote content call, multi-modal instruction integration), forming a closed loop of "user feedback - model optimization - experience upgrade".
End-to-end architecture collaboration: Mind GPT-3o is deeply coupled with the smart driving system. For example, in scenarios such as highway toll stations and roundabout traffic, vision language models (VLMs) are used to assist end-to-end model decision-making to achieve anthropomorphic driving (such as automatically selecting ETC lanes and dynamically overtaking). However, the cockpits and smart driving systems of traditional OEMs are mostly independent modules with weak collaboration.
All-scenario coverage: It supports "D2D" navigation, AI inference visualization and other functions. Combined with 2.9 billion kilometers of intelligent driving data, it forms a complete closed loop from perception to decision-making, and has better mobility ecosystem integration capabilities than competitors that only focus on cockpit interaction.
Technical path: Li Auto is the first OEM that launched a fully self-developed multi-modal cognitive model. It uses the self-developed Taskformer neural network architecture to achieve unified feature representation of multi-modal data such as voice, vision, and text, avoiding system fragmentation that relies on third-party models.
Scenario focus: It is deeply optimized for the automotive environment, covering 111 fields and more than 1,000 exclusive capabilities (such as semantic understanding, multilingual communication, fuzzy perception, etc.), especially in spatial command execution in family scenarios (such as air-to-air control of the rear screen, voiceprint recognition) and personalized services (such as children's mode, holiday greetings).
Computing efficiency: Through the inference load distribution of cloud GPT and edge NPU, its dependence on hardware computing power is reduced, so that old vehicle models can also run foundation models smoothly. Compared with the edge-only deployment of some competitors (relying on 8295 and Orin), it has stronger compatibility.
Balance between privacy and performance: Critical tasks (such as navigation and vehicle control) are processed by the edge to ensure privacy, and complex tasks (such as Q&A, entertainment) call cloud computing power to improve the experience, taking into account both efficiency and security.
The breakthrough of Mind GPT-3o is to upgrade the foundation model from a "question-answering tool" to a "task planning hub". Its core capabilities include:
Complex problem dismantling: For example, if a user puts forward a vague requirement like "a family outing on the weekend", the model can automatically decompose it into sub-tasks such as route planning, attraction recommendation, diner reservations, weather, etc., and coordinate the automotive system with external APIs (such as Meituan and AutoNavi) to complete these sub-tasks.
Tool chain integration: It has 20+ built-in vertical scenario tools such as traffic restriction query, calendar management, fault diagnosis, etc., and supports third-party service expansion.