封面
市场调查报告书
商品编码
1400759

汽车语音功能产业分析(2023-2024)

Automotive Voice Industry Report, 2023-2024

出版日期: | 出版商: ResearchInChina | 英文 180 Pages | 商品交期: 最快1-2个工作天内

价格
简介目录

汽车语音互动市场特色如下:

1. OEM市场2023年将有46个品牌标配汽车音响功能

2019年至2023年1月至9月,配备音讯功能的汽车数量和安装率均有所增加。 2023年前三季度,近1,200万辆汽车预载汽车音响,普及率接近80%。

到2023年,AITO、Avatr、HiPhi、Rising Auto、ZEEKR、Voyah、Li Auto、Lynk &Co、Tank、NIO、Xpeng等46个乘用车品牌汽车音响功能采用率将达到100% 。自豪的。 到2023年,将有超过2,000万辆汽车配备车用音响,安装率超过80%。

2、车企自主开发语音功能将推动语音功能供应链重构

整车厂对智慧汽车语音能力的差异化需求和自主开发导向,将使传统语音能力供应链中的Tier 2厂商能够直接与整车厂合作。 产业链上、中、下游的界线越来越模糊。 例如,GWM、ZEEKR、Wuling等汽车製造商将直接与AISpeech合作,提高智慧语音功能的采用率和智慧化程度。

随着产业链关係的变化,汽车音响功能的竞争格局也会发生相对应的变化。 从2023年1月至2023年9月的装车量来看,AISpeech排名第三,支援30多家汽车製造商的150多种车型。

3、看讲功能将成为标准配置,并行指令、跨声区继承、离线音讯功能、外部音讯功能等进阶功能将搭载在汽车上

先前分析显示,视觉和语音功能仅在部分新兴汽车厂商和国内主要自主品牌具备,且最长连续通话时间仅为90秒,双语音区识别仍是主流解决方案。 。

2023年,视觉辨识与语音辨识功能将成为新兴汽车製造商旗舰车型的标配,可实现长达120秒的连续对话。 小鹏汽车还推出了 "驾驶座全时互动" 功能(启动后,驾驶者可以一边看中控台萤幕一边看东西、说话,而无需启动萤幕内容)。 同时,四重距离识别已成为新的主流解决方案,理想汽车、Xpeng Motor也推出了六重距离识别解决方案。

此外,到2023年,汽车上将安装更先进的语音功能。

4.语音对话:根据智慧座舱场景,基础模型装车的第一步

随着ChatGPT的热潮,相关底层模型技术将从AI迅速扩展到其他领域。 2023年,汽车产业基础模型的普及将加速,不少汽车厂商正在探索将基础模型与智慧座舱、智慧驾驶等场景结合的落地机会。

在智慧座舱场景中,语音互动是底层模型融入汽车的第一个手段。 2023年2月,Baidu发表了ChatGPT的中文版ERNIE Bot,GWM、Geely、Voyah等品牌紧随其后。 2023年4月,Alibaba透露,AliOS智慧车载作业系统已在统一千问基础车型上进行连接测试,后续将在IM汽车上应用。 在华为HarmonyOS 4.0中,智慧助理小艺首次连结盘古模型,主要完善智慧互动、场景布置、语言理解、生产力、个人化服务等功能。

本报告对全球及中国汽车音频功能市场及产业进行了分析,概述了技术概况、市场基本结构、汽车整车厂音频功能的开发和利用现状以及主要应用领域。汽车音响功能提供商,正在调查其概要、主要技术、经营策略等。

目录

第一章汽车音响功能产业概述

  • 汽车音响功能概述
  • 汽车音响功能的使用场景
  • 汽车音响技术
  • 汽车语音功能:互动架构
  • 汽车语音功能:常用互动功能
  • 汽车音响功能的开发要素
  • 汽车音响功能的发展历史
  • 汽车语音功能产业链演进
  • 汽车语音功能产业链
  • 市场规模预测(2023-2026 年)
  • 语音功能提供者:市场排名
  • 其他音讯技术

第二章汽车音响功能在OEM的应用

  • OEM厂商音讯功能对比
  • OEM语音功能开发模型:概述
  • OEM OTA(无线)语音功能
  • 小鹏汽车
    • 相容汽车音讯功能的基准模型
    • 汽车音响功能
    • 语音技术
    • 内部开发的语音功能:架构
    • 自主开发的语音功能:基本功能
    • 汽车音讯功能伙伴
  • 理想汽车
    • 相容汽车音讯功能的基准模型
    • 汽车语音功能技巧
    • 车辆控制功能
    • 内部开发的音讯技术
    • 基本模型
    • 驾驶舱互动规划
    • 汽车音讯功能伙伴
  • NIO
  • AITO
  • Aion
  • Rising Auto
  • Jiyue
  • ZEEKR
  • IM Motor
  • Denza
  • Leap Motor
  • Neta Auto
  • Geely
  • GWM
  • Changan
  • Chery

第三章汽车语音功能供应商

  • 汽车语音功能供应商概况:市场地位、技术竞争力、底层车款布局
  • iFLYTEK
  • Cerence
  • AISpeech
  • Unisound
  • txzing.com
  • VW-Mobvoi
  • Mobvoi
  • Pachira
  • Tencent
  • Baidu
  • Alibaba
  • Huawei
  • Volcano Engine
  • Microsoft
  • VoiceAI

第四章汽车音响功能产业链

  • 平台整合:PATEO
  • 平台整合:Tinnove
  • 语音处理引擎:SinoVoice
  • 声音处理引擎:Megatronix
  • 资料收集/标註:Haitian Ruisheng
  • 资料收集/註释:Testin
  • 资料收集/註解:DataBaker
  • 语料库:Magic Data
  • 晶片:Horizon
  • 晶片:ShensiliCon
  • 晶片:Chipintelli
  • 音讯晶片:Rockchip
  • 音讯晶片:WUQi Micro
  • 音讯晶片:LAPIS Semiconductor

第五章汽车音响功能发展趋势

简介目录
Product Code: LMM020

The automotive voice interaction market is characterized by the following:

1. In OEM market, 46 brands install automotive voice as a standard configuration in 2023.

From 2019 to the first nine months of 2023, automotive voice saw rising installations and installation rate. In the first three quarters of 2023, nearly 12 million vehicles were pre-installed with automotive voice, with the installation rate of nearly 80%.

In 2023, there are 46 passenger car brands boasting automotive voice installation rate of 100%, including AITO, Avatr, HiPhi, Rising Auto, ZEEKR, Voyah, Li Auto, Lynk & Co, Tank, NIO, and Xpeng. In 2023, over 20 million vehicles are equipped with automotive voice, with the installation rate higher than 80%.

2. Automakers' self-development of voice facilitates the reshaping of the voice supply chain.

OEMs' differentiated demand for intelligent automotive voice and their preference for independent development enable Tier 2 vendors in the conventional voice supply chain to cooperate directly with OEMs. Boundaries between upstream, midstream and downstream of the industry chain tend to blur. For example, the direct cooperation of automakers like GWM, ZEEKR and Wuling with AISpeech improves their installation and intelligence levels of intelligent voice.

The change in industry chain relationships makes the automotive voice competitive pattern change accordingly. By installations from January to September 2023, AISpeech that supported more than 150 models of over 30 automakers ranked third.

3. See-and-speak function becomes a standard configuration, and advanced functions such as parallel instruction, cross-sound-zone inheritance, offline voice, and out-of-vehicle voice are available on cars.

In ResearchInChina's China Automotive Voice Industry Report, 2021-2022, "see-and-speak" was only installed by some emerging carmakers and leading Chinese independent brands, the longest continuous conversation duration was only 90 seconds, and dual-sound-zone recognition was still the mainstream solution.

In 2023, "see-and-speak" has become a standard configuration in emerging carmakers' flagship models, with up to 120-second continuous dialogue. Xpeng Motor has also introduced the "Full-time Dialogue at Driver's Seat" function (when turned on, it allows the driver to see and speak when looking at the center console screen, without needing to wake up the content on the screen). Meanwhile, four-sound-zone recognition has become a new mainstream solution, and Li Auto and Xpeng Motor also introduced six-sound-zone recognition solutions.

In addition, more advanced voice functions became available on cars in 2023.

Parallel instruction: support up to 10 actions in one instruction;

Cross-sound-zone inheritance: available on models of Xpeng, ZEEKR, and Li Auto (cross-sound-zone inheritance: when a person finishes an instruction, if other passengers want to continue, they can trigger this function by saying "I want too").

Offline instruction: more controllable content. Jiyue 01 supports all-zone, full offline voice. In offline state, Jiyue 01 still enables extremely fast interaction with occupants.

Out-of-vehicle voice: this function in Changan Nevo A07 allows for voice control on trunk, windows, music, air conditioning, pull-out/in, and other functions; this function in Jiyue 01 allows for voice control on car/parking, air conditioning, audio, lights, windows, doors, tailgate, and charging cover.

4. Voice interaction is the first stop for foundation models to get on vehicles in intelligent cockpit scenarios.

The boom of ChatGPT allows the related foundation model technology to rapidly extend from AI to all other sectors. In 2023, foundation models gain pace in automotive industry, and quite a few automakers are exploring the opportunities to implement foundation models in intelligent cockpit, intelligent driving and other scenarios.

In intelligent cockpit scenarios, voice interaction is the first stop for foundation models to get on vehicles. In February 2023, Baidu released a Chinese version of ChatGPT - ERNIE Bot, and brands like GWM, Geely, and Voyah followed; in April 2023, Alibaba disclosed that AliOS intelligent vehicle operating system has been connected to Tongyi Qianwen foundation model for testing, and will later be applied by IM Motors; in August 2023, in Huawei HarmonyOS 4.0, intelligent assistant Xiaoyi was connected to Pangu model for the first time, mainly to improve capabilities of intelligent interaction, scenario arrangement, language understanding, productivity and personalized service.

Besides conventional Internet companies, voice providers as important foundation model players such as iFLYTEK, AISpeech and Unisound have also launched related products.

iFLYTEK Spark cognitive foundation model has six core capabilities: penetrative understanding of multi-round dialogues, knowledge application, empathic chat & dialogue, self-guided reply in multi-round dialogues, file-based rapid learning of new knowledge, and evolution based on correction opinions of massive users;

AISpeech DFM-2 is an industry language foundation model with generalized intelligence. In the field of in-vehicle interaction, AISpeech integrates Lyra automotive voice assistant with DFM-2, which significantly improves capabilities in planning, creation, knowledge, intervention, plug-in, multi-level semantic dialogue, and documentation, and supports multi-modal, multi-intent, multi-sound-zone, and all-scenario multi-round continuous dialogues.

Table of Contents

1 Overview of Automotive Voice Industry

  • 1.1 Overview of Automotive Voice
  • 1.2 Application Scenarios of Automotive Voice
  • 1.3 Automotive Voice Technologies
  • 1.4 Automotive Voice Interaction Architecture
  • 1.5 Automotive Voice Common Interaction Functions
  • 1.6 Automotive Voice Development Factors
  • 1.7 Development History of Automotive Voice
  • 1.8 Automotive Voice Industry Chain Evolution
  • 1.9 Automotive Voice Industry Chain
  • 1.10 Market Size Forecast (2023-2026)
  • 1.11 Voice Providers Market Rankings
  • 1.12 Other Voice Technologies

2 Automotive Voice Applications for OEMs

  • 2.1 Voice Function Comparison of OEMs
  • 2.2 Summary of Voice Development Models by OEMs
  • 2.3 OTA Voice Functions of OEMs
  • 2.4 Xpeng Motor
    • 2.4.1 Automotive Voice-enabled Benchmark Models
    • 2.4.2 Automotive Voice Functions
    • 2.4.3 Voice Technology
    • 2.4.4 Self-developed Voice Architecture
    • 2.4.5 Self-developed Voice Basic Capabilities
    • 2.4.6 Automotive Voice Partners
  • 2.5 Li Auto
    • 2.5.1 Automotive Voice-enabled Benchmark Models
    • 2.5.2 Automotive Voice Skills
    • 2.5.3 Vehicle Control Functions
    • 2.5.4 Self-developed Voice Technology
    • 2.5.5 Foundation Model
    • 2.5.6 Cockpit Interaction Planning
    • 2.5.7 Automotive Voice Partners
  • 2.6 NIO
  • 2.7 AITO
  • 2.8 Aion
  • 2.9 Rising Auto
  • 2.10 Jiyue
  • 2.11 ZEEKR
  • 2.12 IM Motor
  • 2.13 Denza
  • 2.14 Leap Motor
  • 2.15 Neta Auto
  • 2.16 Geely
  • 2.17 GWM
  • 2.18 Changan
  • 2.19 Chery

3 Automotive Voice Providers

  • 3.1 Summary of Automotive Voice Providers: Market Position & Technical Competitiveness & Foundation Model Layout
  • 3.2 iFLYTEK
    • 3.2.1 Profile
    • 3.2.2 Intelligent Vehicle Business Performance
    • 3.2.3 Intelligent Vehicle Core Technology
    • 3.2.4 Voice Interaction Full Link Technology
    • 3.2.5 Automotive Interaction Development Plan
    • 3.2.6 Text-To-Speech (TTS) Technology
    • 3.2.7 Interaction Model
    • 3.2.8 Application of Interaction Foundation Model in Intelligent Cockpit
    • 3.2.9 Cockpit OS Enhanced by Foundation Models
    • 3.2.10 Knowledge Graph of iFLYTEK Interaction Foundation Model
    • 3.2.11 Interaction Foundation Model Core Capabilities
    • 3.2.12 Interaction Foundation Model Enabling Automotive Human-Machine Interaction
    • 3.2.13 Accumulation in Cognitive Intelligent Foundation Model Technology
    • 3.2.14 "1+N" System
    • 3.2.15 Multilingual Interaction System
    • 3.2.16 Support for Automotive Minor Languages
    • 3.2.17 Open Platform Voice Technology Support
    • 3.2.18 Out-of-vehicle Voice Interaction System
  • 3.3 Cerence
    • 3.3.1 Automotive Voice Recognition Hardware Framework
    • 3.3.2 Vehicle-Cloud Integration Solution
    • 3.3.3 Core Technology
    • 3.3.4 ARK Main Content
    • 3.3.5 SSE
    • 3.3.6 Drive
    • 3.3.7 Automotive Voice Interaction + AI Solution
    • 3.3.8 Co-Pilot
    • 3.3.9 Biometrics
    • 3.3.10 ICC
    • 3.3.11 Out-of-vehicle Voice Interaction
    • 3.3.12 TTS
    • 3.3.13 Other Voice Solutions
    • 3.3.14 Product Development Roadmap (2023~)
  • 3.4 AISpeech
    • 3.4.1 Profile
    • 3.4.2 Voice and Language Key Technologies
    • 3.4.3 "Cloud + Chip" Integration Strategy
    • 3.4.4 Customized Development Platform for All-link Intelligent Dialogue System: DUI
    • 3.4.5 Industry Language Model: DFM
    • 3.4.6 Intelligent Telematics Solutions
    • 3.4.7 Automotive Voice Assistant
    • 3.4.8 Intelligent Cockpit Products
    • 3.4.9 Cooperation Model Cases
  • 3.5 Unisound
    • 3.5.1 Intelligent Automotive Solutions
    • 3.5.2 Foundation Model
    • 3.5.3 Voice Technology Capabilities
    • 3.5.4 TTS
    • 3.5.5 Automotive Voice Solution Business Models
    • 3.5.6 Core Technology
    • 3.5.7 Automotive Voice Chip
    • 3.5.8 Automotive Voice Solution Supporting
  • 3.6 txzing.com
  • 3.7 VW-Mobvoi
  • 3.8 Mobvoi
  • 3.9 Pachira
  • 3.10 Tencent
  • 3.11 Baidu
    • 3.11.1 Core Voice Technology
    • 3.11.2 Voice Chip
    • 3.11.3 DuerOS
    • 3.11.4 DuerOS Empowered by Foundation Models
    • 3.11.5 ERNIE Foundation Model Enabled Cockpit Voice Interaction
    • 3.11.6 ERNIE Foundation Model Analysis
  • 3.12 Alibaba
  • 3.13 Huawei
  • 3.14 Volcano Engine
  • 3.15 Microsoft
  • 3.16 VoiceAI

4 Automotive Voice Industry Chain

  • 4.1 Platform Integration: PATEO
  • 4.2 Platform Integration: Tinnove
  • 4.3 Voice Processing Engine: SinoVoice
  • 4.4 Voice Processing Engine: Megatronix
    • 4.4.1 Product Layout
    • 4.4.2 Automotive Voice SmartMega® VOS Module
    • 4.4.3 Automotive Voice Customized and Cooperation Modes
    • 4.4.4 Implemented Model Cases
  • 4.5 Data Collection / Annotation: Haitian Ruisheng
    • 4.5.1 Voice Business
    • 4.5.2 Structure of Training Dataset
    • 4.5.3 Speech Services: Data Collection Services
    • 4.5.4 Speech Services: Data Annotation Services
  • 4.6 Data Collection / Annotation: Testin
  • 4.7 Data Collection / Annotation: DataBaker
  • 4.8 Corpus: Magic Data
  • 4.9 Chip: Horizon
  • 4.10 Chip: ShensiliCon
  • 4.11 Chip: Chipintelli
  • 4.12 Voice Chip: Rockchip
  • 4.13 Voice Chip: WUQi Micro
  • 4.14 Voice Chip: LAPIS Semiconductor

5 Development Trends of Automotive Voice

  • 5.1 Trend 1
  • 5.2 Trend 2
  • 5.3 Trend 3
  • 5.4 Trend 4
  • 5.5 Trend 5
  • 5.6 Trend 6
  • 5.7 Trend 7
  • 5.8 Trend 8