Set Video Encoding Method

2024-01-02

Introduction

When developers publish and play video streams, they can set detailed encoding and decoding, including enabling layered video encoding, enabling video large and small stream encoding, using hardware encoding and decoding, and setting encoding methods.

Layered Video Encoding

Layered video encoding divides the bitstream into a base layer and an enhancement layer. This encoding method can provide better experience for users with different network states. The base layer ensures the most basic video quality, while the enhancement layer is a supplement to the base layer. For users with better networks, only pulling the enhancement layer can get a better experience. For users with poor network states, only pulling the base layer can ensure basic video quality.

When developers encounter the following situations in co-hosting or stream mixing business, it is recommended to use the layered video encoding function:

Need to display video streams of different qualities on different terminals.
Need to maintain the smoothness of co-hosting in poor network environments.
Need to adaptively pull the quality of video streams according to network state.

Note

Layered video encoding uses ZEGO's private protocol. The playing stream end can only pull video streams of different layers from the ZEGO server.

Video Large and Small Stream Encoding

Video large and small stream encoding works together with layered video encoding to divide the bitstream into large resolution type and small resolution type.

The most significant difference is that layered video encoding uses one encoder to encode base layer and enhancement layer bitstreams, while video large and small stream encoding uses two encoders to encode base layer and enhancement layer bitstreams.

For specific differences, advantages, and disadvantages between the two, please view Video Large and Small Stream and Layered Encoding. Developers can choose layered video encoding or video large and small stream encoding by combining their differences and specific business requirements.

Note

Video large and small stream encoding uses ZEGO's private protocol. The playing stream end can only pull video streams of different layers from the ZEGO server.

Hardware Encoding and Decoding

Developers can choose to enable hardware encoding and hardware decoding. After enabling hardware encoding and decoding, GPU will be used for encoding and decoding, reducing CPU usage. If certain models have severe device heating when publishing or playing large-resolution audio and video streams, hardware encoding and decoding can be enabled.

Video Encoding Method

Developers can perform video encoding configuration to align encoding between different ends, thereby achieving multi-end interoperability.

Usage scenarios:

Generally, the default encoding can be used.
When the bitrate needs to be reduced under the same resolution and frame rate, H.265 can be used.
When interoperability with mini-programs is required, H.264 must be used.

Download Sample Source Code

Please refer to Download Sample Source Code to obtain the source code.

For related source code, please check the files in the "/ZegoExpressExample/Examples/AdvancedVideoProcessing/EncodingAndDecoding" directory.

Prerequisites

Before implementing video encoding and decoding functions, ensure that:

A project has been created in the ZEGOCLOUD Console, and valid AppID and AppSign have been obtained. For details, please refer to Console - Project Information.
ZEGO Express SDK has been integrated into the project, and basic audio and video streaming functionality has been implemented. For details, please refer to Quick Start - Integration and Quick Start - Implementation Process.

Implementation Steps

Layered Video Encoding (H.264 SVC)

Using layered video encoding requires the following two steps:

Enable layered video encoding by specifying a specific encoder before publishing stream.
Specify the layered video to be pulled when playing stream.

Enable Layered Video Encoding

Before publishing stream (startPublishingStream), call the setVideoConfig interface to set the parameter "codecID" in the ZegoVideoConfig class to enable/disable layered video encoding function.

Setting "codecID" to "ZEGO_VIDEO_CODEC_ID_SVC" can enable this function.
Setting "codecID" to "ZEGO_VIDEO_CODEC_ID_DEFAULT", "ZEGO_VIDEO_CODEC_ID_VP8", or "ZEGO_VIDEO_CODEC_ID_H265" can disable this function.

ZegoVideoConfig videoConfig;
videoConfig.codecID = ZEGO_VIDEO_CODEC_ID_SVC;
engine->setVideoConfig(videoConfig);

std::string streamID = "MultiLayer-1";
engine->startPublishingStream(streamID);

ZegoVideoConfig videoConfig;
videoConfig.codecID = ZEGO_VIDEO_CODEC_ID_SVC;
engine->setVideoConfig(videoConfig);

std::string streamID = "MultiLayer-1";
engine->startPublishingStream(streamID);

Specify the Layered Video to Pull

After the publishing stream end enables layered video encoding, the playing stream end can call the setPlayStreamVideoType interface before or after playing stream. At this time, the playing stream end will pull appropriate video layers according to the network situation by default, for example, only pulling the base layer in weak networks. Developers can also pass in specific playing stream parameters to pull specific video layers. Currently, the supported video layers are as follows:

Enumeration Value	Description
ZEGO_VIDEO_STREAM_TYPE_DEFAULT	Select layers according to network state
ZEGO_VIDEO_STREAM_TYPE_SMALL	Specify pulling base layer (small resolution)
ZEGO_VIDEO_STREAM_TYPE_BIG	Specify pulling enhancement layer (large resolution)

Taking pulling the enhancement layer as an example:

engine->setPlayStreamVideoType(playStreamID,ZEGO_VIDEO_STREAM_TYPE_BIG);

engine->setPlayStreamVideoType(playStreamID,ZEGO_VIDEO_STREAM_TYPE_BIG);

Video Large and Small Stream Encoding (H.264 DualStream)

The implementation of video large and small stream encoding (H.264 DualStream) is similar to that of layered video encoding (H.264 SVC), requiring the following two steps:

Before publishing stream, enable video large and small stream encoding by specifying a specific encoder.
When playing stream, specify the video bitstream to be pulled.

Enable Layered Video Encoding

Before publishing stream (startPublishingStream), call the setVideoConfig interface to set the parameter codecID in the ZegoVideoConfig class to ZegoVideoCodecID.H264DualStream to enable the video large and small stream encoding function.

ZegoVideoConfig videoConfig;
videoConfig.codecID = ZEGO_VIDEO_CODEC_ID_H264_DUAL_STREAM;
engine->setVideoConfig(videoConfig);

std::string streamID = "MultiLayer-1";
engine->startPublishingStream(streamID);

ZegoVideoConfig videoConfig;
videoConfig.codecID = ZEGO_VIDEO_CODEC_ID_H264_DUAL_STREAM;
engine->setVideoConfig(videoConfig);

std::string streamID = "MultiLayer-1";
engine->startPublishingStream(streamID);

Specify the Layered Video to Pull

After the publishing stream end enables video large and small stream encoding, the playing stream end can call the setPlayStreamVideoType interface before or after playing stream. At this time, the playing stream end will pull appropriate video stream layers according to the network situation by default, for example, only pulling the base layer in weak networks. Developers can also pass in specific playing stream parameters to pull specific video layers. Currently, the supported video layers are as follows:

Enumeration Value	Description
ZEGO_VIDEO_STREAM_TYPE_DEFAULT	Select layers according to network state
ZEGO_VIDEO_STREAM_TYPE_SMALL	Specify pulling base layer (small resolution)
ZEGO_VIDEO_STREAM_TYPE_BIG	Specify pulling enhancement layer (large resolution)

Taking pulling the enhancement layer as an example:

engine->setPlayStreamVideoType(playStreamID,ZEGO_VIDEO_STREAM_TYPE_BIG);

engine->setPlayStreamVideoType(playStreamID,ZEGO_VIDEO_STREAM_TYPE_BIG);

Hardware Encoding and Decoding

Since a small number of device models have poor support for hardware encoding/decoding, the SDK uses software encoding and software decoding by default. If developers have requirements for using hardware encoding, they can refer to this section to set it themselves.

Enable Hardware Encoding

Warning

This function must be set before publishing stream to take effect. If set after publishing stream, it will take effect only after stopping publishing stream and republishing stream.

If developers need to enable hardware encoding, they can call the enableHardwareEncoder interface.

// Enable hardware encoding
engine->enableHardwareEncoder(true);

// Enable hardware encoding
engine->enableHardwareEncoder(true);

Enable Hardware Decoding

Warning

This function must be set before playing stream to take effect. If set after playing stream, it will take effect only after stopping playing stream and replaying stream.

If developers need to enable hardware decoding, they can call the enableHardwareDecoder interface.

// Enable hardware decoding
engine->enableHardwareDecoder(true);

// Enable hardware decoding
engine->enableHardwareDecoder(true);