Product Features
Communication Capabilities
Basic Features
| Basic Features | Feature Description | Use Cases |
|---|---|---|
| Voice Call | Users join the same Room and conduct audio calls. |
|
| Audio Live Streaming | In the same Room, including Hosts and Audience, Hosts can conduct audio live streaming, and Audience in the Room can watch the live streaming. |
|
| User Permission Control | Use Token to control user permissions, such as: specifying users to join/leave Room; specifying users to speak/mute; specifying users. | Video conference |
| Pre-call Detection | Before conducting audio/video calls or live streaming, perform device detection on cameras, microphones, monitors, etc., to ensure the normal operation of calls or live streaming. | Normal call function detection |
| Call Quality Monitoring | Detect the quality of audio/video, such as resolution, frame rate, bitrate, sampling rate, and other multi-indicator detection, to ensure stable quality. | Bank account opening, remote authentication, and other scenarios with high requirements and limitations on audio/video quality |
| Network Speed Test | Before users publish/play streams, detect the upstream and downstream network speeds to determine the bitrate of audio/video streams suitable for publishing/playing in the current network environment. | Call scenarios, education scenarios, live streaming scenarios |
Advanced Features
| Advanced Features | Feature Description | Use Cases |
|---|---|---|
| Live Co-hosting | Multiple Hosts can appear in one Room for co-hosting live streaming. |
|
| Multi-Source Capture | Provides flexible and easy-to-use audio/video capture sources and channel management capabilities, reducing developer development and maintenance costs. | Video conferences, online education |
| Publishing Multiple Streams Simultaneously | One user can publish multiple audio/video streams, such as sharing the screen while sending the camera video stream. | Playing PPT during a video conference while seeing the host's view |
| Supplemental Enhancement Information (SEI) | Text information is packaged with audio/video content and transmitted through the streaming media channel to achieve precise synchronization between text data and audio/video content. |
|
| Traffic Control | ZEGO industry-leading technology. The SDK dynamically adjusts the video publishing bitrate, frame rate, resolution, and audio bitrate based on its own and the peer's current network environment status, automatically adapting to the current network environment and network fluctuations, thereby ensuring smooth video publishing. | All scenarios that require high-quality real-time audio/video services |
| Cloud Proxy | By setting the SDK's cloud proxy interface, all traffic corresponding to the SDK is relayed through cloud proxy servers to communicate with RTC and L3 (Ultra-low Latency Live Streaming). | Hospitals, government, company internal, and other restricted network environments such as intranets |
| Geo-fencing | Limits the transmission of audio/video and signaling data to a certain region to meet regional data privacy and security regulations, i.e., limiting access to audio/video services in a specific region. | Call scenarios |
| Audio/Video Stream Encryption | Encrypt the stream when publishing, and the stream must have a decryption key consistent with the encryption key when playing. | Scenarios that need to encrypt stream information to protect communication security |
| Game Voice | Mimics the real world, where people have different auditory experiences based on factors such as the orientation and distance of the sound, such as the farther the distance, the smaller the sound. At the same time, grouping restrictions can be applied to those who can receive the audio source, such as in a Room, group discussions, and different groups cannot hear each other. |
|
| Mass-Scale Range Audio/Video | ZEGO industry-leading technology. Automatically fetches remote audio/video within the listening range based on user locations in the cloud and provides spatial audio effects (by default fetching the 12 closest streams). Supports 10,000 users simultaneously enabling microphones and cameras in a single scenario. | Virtual offices, virtual exhibitions, open virtual worlds, and other virtual scenarios |
| Real-time Synchronization of Multi-User Status | ZEGO industry-leading technology. Provides an ordered, high-frequency, low-latency, large-scale status synchronization service, helping developers quickly implement real-time information synchronization capabilities such as player position, actions, and appearance in virtual gameplay. Supports 10,000 users online simultaneously in a single scenario. | Virtual offices, virtual exhibitions, virtual social networking, virtual KTV, and other metaverse scenarios, as well as general scenarios that require ultra-high frequency, low-latency, and large-scale synchronization of information or control commands |
Room Capabilities
Basic Features
| Basic Features | Feature Description | Use Cases |
|---|---|---|
| Room Connection Status Description | Determine the user's connection status in the Room and the transition process of each connection status. | - |
| Real-time Messaging and Signaling | Real-time messaging mainly provides the ability to send and receive pure text messages. You can send broadcast messages and bullet screen messages to other users in the same Room, or send custom messages to specified users, and implement interactive functions such as likes, gifts, and quizzes as needed. |
|
Advanced Features
| Advanced Features | Feature Description | Use Cases |
|---|---|---|
| Login to Multiple Rooms | One user can enter multiple Rooms simultaneously for audio/video calls or watching live streaming. | Teacher multi-class online teaching |
Audio Capabilities
Basic Features
| Basic Features | Feature Description | Use Cases |
|---|---|---|
| Audio Spectrum and Sound Level | Audio spectrum: the energy value of digital audio signals at each frequency point. Sound level: the volume of a specific stream. |
|
| Ear Monitor and Channel Setting |
|
|
| Audio 3A Processing | During real-time audio/video calls or live streaming, 3A processing can be performed on audio to improve call or live streaming quality and user experience.
| All scenarios that require high-quality real-time audio/video services |
| Voice Changer/Reverb/Stereo | To increase fun and interactivity, users can use voice changers for fun, use reverb to create atmosphere, and use stereo to make sound more three-dimensional. ZEGO Express SDK provides a variety of preset voice changer, reverb, reverb echo, and stereo effects. Developers can flexibly set the sound they want. |
|
Advanced Features
| Advanced Features | Feature Description | Use Cases |
|---|---|---|
| Audio Mixing | The SDK obtains a path of audio data from the App and integrates the audio data provided by the App with the audio data collected by the SDK into one path of audio data, thereby achieving the ability to play custom sounds and music files during calls or live streaming, and allowing others in the Room to hear them. |
|
| Scenario-Based AI Noise Reduction | Real-time automatic recognition of different scenarios, intelligently adjusts AI noise reduction strategy to provide the best noise reduction and sound quality effects. In call scenarios, all sounds except human voice are identified as noise and eliminated. In music scenarios, automatically adjusts noise reduction effects to restore music sound quality. | Voice rooms, conferences, voice gaming, and other 1v1 or multi-person audio/video call scenarios, as well as live streaming or online KTV scenarios such as sound cards, singing, and near-field music |
| Custom Audio Capture | Developers can obtain audio information on their own and then hand it over to the SDK for transmission. |
|
| Custom Audio Rendering | Audio is rendered by developers themselves and then played. | Developers have their own special rendering requirements |
| Custom Audio Processing | Developers can perform special audio processing on their own. | When there are special sound processing requirements that the SDK cannot meet, such as special voice changers |
| Getting Original Audio Data | The function of obtaining original audio recording. The obtained original audio data format is PCM. | Audio data retention or special processing |
Live Streaming Capabilities
Basic Features
| Basic Features | Feature Description | Use Cases |
|---|---|---|
| Stream Mixing | Mix multiple video streams from multiple people into one stream, so that you only need to pull one stream to see the views of all members in the Room and hear the sounds of all members in the Room. | Multi-person call Host co-hosting |
| Using CDN for Live Streaming | Unify access to multiple CDN capabilities. This feature supports publishing streams to CDN, connecting RTC products and CDN live streaming products, making it convenient for users to watch live streaming content directly from web pages or third-party players. | Basic live streaming with high concurrency and scenarios without strong requirements on live streaming latency |
| CDN Stream Publishing Authentication | To prevent attackers from stealing the developer's publishing URL address for use elsewhere, or forging the developer's server to generate a publishing URL address, resulting in traffic loss, you can configure CDN publishing authentication through the ZEGO Console. After enabling authentication, you need to splice relevant authentication parameters in the publishing URL address, otherwise you cannot publish streams. | - |
| Playing Stream by URL | When the publisher uses third-party publishing tools (such as OBS software, network camera IP Camera, etc.) to push the stream to CDN, or uses the ZEGO SDK to relay the audio/video view to a third-party CDN, you can use the method of directly passing in the URL address to play the stream. | Third-party live streaming view acquisition |
Advanced Features
| Advanced Features | Feature Description | Use Cases |
|---|---|---|
| Ultra-low Latency Live Streaming | Focuses on providing stable and reliable live streaming services. Compared with standard video live streaming products, audio/video latency is lower, synchronization is stronger, weak network resistance is better, providing users with millisecond-level live streaming experience. |
|
| Direct to CDN | The process of pushing audio/video streams directly from the local client to CDN, allowing users to watch directly from web pages or third-party players by pulling the stream URL address. | Developers who have audio/video distribution service cooperation with third-party CDNs can use |
Other Capabilities
Basic Features
| Basic Features | Feature Description | Use Cases |
|---|---|---|
| Media Player | Provides the ability to play audio/video media files and supports publishing the audio and video data of the played media files. |
|
| Audio Effect Player | Provides an audio effect player and performs unified management of audio effects, achieving effects such as playing short effect sounds that enhance realism or create scene atmosphere. |
|
| Audio/Video Recording | During video calls, live streaming, or online teaching, users often need to record and save videos for subsequent on-demand viewing by other users. ZEGO provides a variety of recording solutions to meet recording needs in different scenarios. |
|
