Product Features
2026-03-05
Communication Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Audio and Video Call | Users join the same room and conduct audio and video calls. |
|
| Audio and Video Live Streaming | In the same room, including anchors and viewers, anchors can conduct audio and video live streaming, and viewers in the room can watch the live streaming. |
|
| User Permission Control | Use Token to control user permissions, such as: specifying users can enter/leave rooms; specifying users to speak/mute; specifying users. | Video conference |
| Pre-call Detection | Before conducting audio and video calls or live streaming, detect devices such as cameras, microphones, and displays to ensure normal operation of calls or live streaming. | Normal call function detection |
| Call Quality Monitoring | Detect the quality of audio and video, such as resolution, frame rate, bitrate, sampling rate, and other multi-indicator detection to ensure quality stability. | Bank account opening, remote authentication, and other scenarios with high requirements and limitations on audio and video quality |
| Network Speed Testing | Before users publish/play streams, detect uplink and downlink network speeds to determine the bitrate of audio and video streams suitable for publishing/playing under the current network environment. | Call scenarios, education scenarios, live streaming scenarios |
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Live Streaming Co-hosting | In a room, multiple anchors can appear and conduct same-screen co-hosting live streaming. |
|
| Multi-source Capture | Provides flexible and easy-to-use audio and video capture source and channel management capabilities, reducing developers' development and maintenance costs. | Video conference, online education |
| Publish Multiple Streams Simultaneously | A user can publish multiple audio and video streams, such as sharing the screen while sending the camera's video stream. | Playing PPT in a video conference while seeing the speaker's picture |
| Supplemental Enhancement Information (SEI) | Text information is packaged with audio and video content and transmitted through the streaming media channel to achieve precise synchronization between text data and audio and video content. |
|
| Traffic Control | ZEGO industry-leading technology. The SDK dynamically adjusts video publishing stream bitrate, frame rate, resolution, and audio bitrate according to its own and the peer's current network environment status, automatically adapting to the current network environment and network fluctuations, thereby ensuring smooth video publishing. | All scenarios that hope to have high-quality real-time audio and video services |
| Cloud Proxy | By setting the SDK's cloud proxy interface, all corresponding SDK traffic is relayed through cloud proxy servers to communicate with RTC and L3 (Ultra Low Latency Live Streaming). | Hospitals, government, company internal, and other restrictive network environments with intranets |
| Geofencing | Limits audio and video and signaling data transmission to a specific region to meet regional data privacy and security related regulations, that is, limits access to audio and video services in a specific region. | Call scenarios |
| Audio and Video Stream Encryption | Encrypt the stream when publishing, and must have a decryption key consistent with the encryption key when playing. | Scenarios that need to encrypt stream information to protect communication security |
| Range Voice | Imitates the real world. People have different auditory experiences based on factors such as the direction and distance of sound. For example, the farther the distance, the smaller the sound. At the same time, people who can receive the sound source can be grouped and limited. For example, in a room, group discussions, and different groups cannot hear each other's sounds. |
|
| Mass Scale Range Audio and Video | ZEGO industry-leading technology. Automatically pulls remote audio and video within the listening range based on user locations in the cloud and provides spatial audio effects (defaults to pulling the closest 12 routes). A single scenario supports 10,000 users to enable microphones and cameras at the same time. | Virtual office, virtual exhibitions, open virtual worlds, and other virtual scenarios |
| Multi-user Status Real-time Synchronization | ZEGO industry-leading technology. Provides ordered, high-frequency, low-latency, large-scale status synchronization services, helping developers quickly implement real-time information synchronization capabilities such as player positions, actions, and appearances in virtual gameplay, while supporting 10,000 users online simultaneously in a single scenario. | Virtual office, virtual exhibitions, virtual social networking, virtual KTV, and other metaverse scenarios, as well as general scenarios that require ultra-high frequency, low latency, and large-scale synchronization of information or control commands |
Room Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Room Connection Status Description | Determine the user's connection status in the room and the conversion process of each connection status. | - |
| Real-time Messaging and Signaling | Real-time messaging mainly provides the function of sending and receiving pure text messages. You can send broadcast messages and barrage messages to other users in the same room, or send custom messages to specified users, and implement interactive functions such as likes, gifts, and quizzes according to your needs. |
|
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Login to Multiple Rooms | A user can enter multiple rooms at the same time for audio and video calls or watch live streaming. | Teacher multi-class online teaching |
Audio Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Audio Spectrum and Volume Change | Audio spectrum: The energy value of digital audio signals at each frequency point. Volume change: The volume of a certain stream. |
|
| Ear Return and Channel Settings |
|
|
| Audio 3A Processing | During real-time audio and video calls or live streaming, 3A processing can be performed on audio to improve call or live streaming quality and user experience.
| All scenarios that hope to have high-quality real-time audio and video services |
| Voice Changer/Reverb/Stereo | To increase fun and interactivity, users can use voice changers to be funny, use reverb to enhance the atmosphere, and use stereo to make sound more three-dimensional. ZEGO Express SDK provides multiple preset voice changer, reverb, reverb echo, and stereo effects. Developers can flexibly set the sound they want. |
|
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Scenario-based AI Noise Reduction | Real-time automatic identification of different scenarios, intelligently adjusting AI noise reduction strategies to provide the best noise reduction and audio quality effects. In call scenarios, all sounds except human voice are identified as noise and eliminated. In music scenarios, automatically adjust noise reduction effects to restore music audio quality. | Voice rooms, conferences, voice team-up, and other 1v1 or multiplayer audio and video call scenarios, as well as live streaming or online KTV scenarios with sound cards, impromptu singing, and near-field music |
| Custom Audio Capture | Developers can obtain audio information themselves and then hand it over to the SDK for transmission. |
|
| Custom Audio Rendering | Audio is rendered by developers themselves and then played. | Developers have their own special rendering requirements |
| Custom Audio Processing | Developers can perform special audio processing themselves. | When there are special sound processing requirements that the SDK cannot meet, such as special voice changers |
| Get the Raw Audio Data | The function of obtaining raw audio recording. The obtained raw audio data format is PCM. | Audio data retention or special processing |
| AI Voice Changer | "Conan Voice Changer Bow Tie" in real-time calls, perfectly reproducing the target character's timbre and rhythm, while retaining the user's speech rate, emotion, and tone, switching timbres at will, with ultra-low latency. |
|
Video Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Common Video Configuration | During video calls or live streaming, customize and set related configurations for captured and played video, such as video capture resolution, video encoding output resolution, video frame rate, bitrate, view mode, and mirror mode, etc. |
|
| Video Capture Rotation | For mobile devices, provides 4 capture rotation modes (fixed ratio mode, adaptive mode, alignment mode, and custom mode), simplifying the complex adaptation problems developers face when implementing multi-terminal rotation performance, such as camera angle, resolution, automatic rotation, statusbar position adaptation, etc. | - |
| Screen Sharing | Share screen content with other users in the room as video during video calls or interactive live streaming. |
|
| Watermark and Screenshot | Can add watermarks such as copyright Logos to the video picture. | Video sharing with copyright, etc. |
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Set Video Encoding Method | Can set video encoding and decoding in detail, including enabling layered video encoding, using hardware encoding and decoding, and setting encoding methods, etc. | When there are special requirements for encoding and decoding |
| Custom Video Capture | Customize providing video input sources to ZEGO Express SDK to input video data, and ZEGO Express SDK encodes and publishes the stream. |
|
| Custom Video Rendering | Custom video rendering refers to the SDK providing local preview and remote playing stream video frame data to the outside for users to render themselves. |
|
| Custom Video Preprocessing | Developers perform custom preprocessing on video data themselves. | Beautification, adding accessories, and other operations |
| Super Resolution | Double the width and height of the played video stream picture at the playing end. For example: The original picture resolution pulled by the playing end is 640p x 360p. After super-resolution processing of the picture, the resolution will be improved to 1280p x 720p. | 1V1 video call scenarios, live streaming scenarios, online education |
| Object Segmentation | ZEGO industry-leading technology. At the publishing end, separate the subject (mostly a person) in the rectangular video through AI algorithms and transmit it in the RTC network, and render at the playing end. | Multi-person remote same-stage, show live streaming same-stage PK, multiplayer online study, and other multi-person same-stage scenarios |
| H.265 | Through more advanced H.265 encoding technology, provides higher clarity at the same bitrate. | Poor network environment requires higher audio and video call, live streaming experience is more sensitive to bandwidth |
| Video Small and Large Streams and Layered Encoding | Divide the stream into a base layer and an enhancement layer, which can provide better experience for users with different network states and device performance. | Video call |
| Publishing Stream Video Enhancement | ZEGO Express SDK provides multiple video preprocessing enhancement capabilities. Developers can adjust picture effects at the publishing end according to business needs.
|
|
Live Streaming Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Stream Mixing | Mix multiple video streams from multiple people into one stream, so you only need to pull one stream to see pictures of all members in the room and hear sounds of all members in the room. | Multi-person call anchor co-hosting |
| Using CDN for Live Streaming | Unify access to multiple CDN capabilities. This function supports publishing to CDN, connecting RTC products and CDN live streaming products, making it convenient for users to watch live streaming content directly from webpages or third-party players. | Basic live streaming with high concurrency, scenarios without strong requirements for live streaming latency |
| CDN Publishing Stream Authentication | To prevent attackers from stealing the developer's publishing stream URL address to use elsewhere, or forging the developer's server to generate the publishing stream URL address, thereby causing traffic loss, you can configure CDN publishing stream authentication through ZEGOCLOUD Console. After enabling authentication, you need to splice relevant authentication parameters in the publishing stream URL address, otherwise you cannot publish the stream. | - |
| Playing Stream by URL | When the publishing end uses third-party publishing tools (such as OBS software, network camera IP Camera, etc.) to publish the stream to a CDN, or uses the ZEGO SDK's forward-to-CDN function to push audio and video content to a third-party CDN, you can use the method of directly passing in the URL address to play the stream. | Obtaining third-party live streaming pictures |
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Ultra Low Latency Live Streaming | Focuses on providing stable and reliable live streaming services. Compared with standard video live streaming products, it has lower audio and video latency, stronger synchronization, better weak network resistance, and can bring users millisecond-level live streaming experience. |
|
| Single Stream Transcoding | Convert a single original stream into transcoded streams with different encoding formats and resolutions on the cloud. In live streaming and other scenarios, viewers can choose streams of different resolutions to watch based on access network quality, terminal devices, etc., to ensure playback smoothness. | Live streaming scenarios |
Other Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Media Player | Provides the ability to play audio and video media files and supports publishing the audio and video data of played media files. |
|
| Audio Effect Player | Provides an audio effect player and performs unified management of sound effects, implementing enhanced realism or setting the atmosphere by playing short effect sounds. |
|
| Audio and Video Recording | During video calls, live streaming, or online teaching, users often need to record and save videos for subsequent on-demand viewing by other users. ZEGO provides multiple recording solutions to meet recording needs in different scenarios. |
|
| Camera Zoom | By setting the camera zoom factor through the SDK, you can achieve the effect of enlarging distant objects when shooting. | Outdoor live streaming |
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Push Whiteboard to Third-party Platforms | Utilize the stream mixing feature of ZEGO Express SDK to merge audio and video streams and ZegoSuperBoard content into one stream and output it to third-party platforms, such as WeChat, Video Account, etc., thereby achieving better dissemination and marketing effects. |
|
| Play Transparent Gift Effects | ZEGO Express SDK media player provides the function of playing MP4 materials (MP4 materials after RGB and Alpha splicing) with separated RGB channel and Alpha channel, realizing the dynamic effect of playing transparent gifts, that is, when playing gift effects, it will not block live streaming room content, greatly improving user experience. |
|
2024-02-05
