Synthesize Real-Time Streaming Digital Human Video
Introduction
This document describes how to quickly implement real-time streaming video synthesis based on server APIs.
Real-time streaming digital human video generation and push streaming require use of Express Video.
Prerequisites
Before synthesizing real-time streaming digital human videos, ensure that:
- You have created a project in the ZEGOCLOUD Console, and get its valid AppID and AppSign. For more details, please refer to Admin Console doc How to view project info.
- You have contacted ZEGOCLOUD technical support to enable the Digital Human AI platform service and related interface permissions.
- You have obtained the digital human ID and timbre ID. For details, please refer to Query Digital Human Assets.
- Your project has integrated the Express Video SDK.
实现流程
1 Generate a signature
Generate a signature according to the signature mechanism described in Accessing Server APIs. The signature needs to be set in the common parameters of the following requests. The ZEGOCLOUD server will verify the signature after receiving the request to validate the legitimacy of the requester.
2 Get the list of available digital humans
Call the GetDigitalHumanList interface to get the list of available digital humans.
3 Get the list of available timbres
Call the GetTimbreList interface to get the list of available timbres.
4 Create a digital human video stream task
Call the CreateDigitalHumanStreamTask interface to create a digital human video stream task, and generate the real-time digital human video.
5 Get the status of the digital human video stream task
Call the GetDigitalHumanStreamTaskStatus interface to get the status of the digital human video stream task. If the status is 3: Streaming
, it means that the task is running, and the client can enter the room to pull the digital human video stream.
6 Drive the digital human to speak
Use Text, Audio, RTC Audio Stream, or Web Socket Interface to drive the digital human to speak.
7 Stop the digital human video stream task
Call the StopDigitalHumanStreamTask interface to stop the digital human video stream task.