Product Features
Communication Capabilities
Basic Functions
| Basic Functions | Function Description | Business Scenarios |
|---|---|---|
| Voice Call | Users join the same room and conduct audio calls. |
|
| Audio Live Streaming | In the same room, including a host and audience, the host can conduct audio live streaming, and the audience in the room can watch the live stream. |
|
| User Permission Control | Use Token to control user permissions, such as: specifying users can enter/exit rooms; specifying users to speak/mute; specifying users. | Video conference |
| Pre-call Detection | Before conducting audio and video calls or live streaming, perform device detection on cameras, microphones, monitors, etc. to ensure the normal operation of calls or live streaming. | Normal call function detection |
| Call Quality Monitoring | Detect the quality of audio and video, such as resolution, frame rate, bitrate, sampling rate, and other multi-indicator detection to ensure stable quality. | Bank account opening, remote appraisal, and other scenarios with high requirements and limitations on audio and video quality |
| Network Speed Test | Before users publish/play streams, detect the upstream and downstream network speeds to determine the suitable bitrate of audio and video streams for publishing/playing in the current network environment. | Call scenarios, education scenarios, live streaming scenarios |
Advanced Functions
| Advanced Functions | Function Description | Business Scenarios |
|---|---|---|
| Live Co-hosting | In one room, multiple hosts can appear to conduct co-hosting on the same screen. |
|
| Multi-source Capture | Provide flexible and easy-to-use audio and video capture source and channel management capabilities, reducing developers' development and maintenance costs. | Video conferences, online education |
| Publish Multiple Streams Simultaneously | One user can publish multiple audio and video streams, such as sharing the screen while sending the camera video stream. | Playing PPT during a video conference while seeing the presenter's screen |
| Supplemental Enhancement Information (SEI) | Text information is packaged with audio and video content and transmitted through the streaming media channel to achieve precise synchronization between text data and audio and video content. |
|
| Traffic Control | ZEGO industry-leading technology. The SDK dynamically adjusts the video publishing bitrate, frame rate, and resolution, as well as audio bitrate based on its own and the peer's current network environment status, automatically adapting to the current network environment and network fluctuations, thereby ensuring smooth video publishing. | All scenarios that hope to have high-quality real-time audio and video services |
| Cloud Proxy | By setting the SDK's cloud proxy interface, all traffic corresponding to the SDK is relayed through cloud proxy servers to communicate with RTC and L3 (ultra-low latency live streaming). | Hospitals, government, companies, and other restrictive network environments with intranets |
| Geofencing | Limits audio and video and signaling data transmission to a certain region to meet regional data privacy and security-related regulations, that is, limiting access to audio and video services in a specific region. | Call scenarios |
| Audio and Video Stream Encryption | Encrypt the stream when publishing, and must have a decryption key consistent with the encryption key when playing. | Scenarios that need to encrypt stream information to protect communication security |
| Game Voice | Mimic the real world, where people have different auditory experiences based on factors such as the direction and distance of the sound, for example, the farther the distance, the smaller the sound, etc. At the same time, people who can receive the sound source can be grouped and limited, for example, in a room, group discussions, and different groups cannot hear each other's sounds. |
|
| Mass-scale Range Audio and Video | ZEGO industry-leading technology. Automatically pulls remote audio and video within the listening range based on user locations in the cloud and provides spatial audio effects (pulling the nearest 12 channels by default), supporting 10,000 users to turn on microphones and cameras simultaneously in a single scene. | Virtual offices, virtual exhibitions, open virtual worlds, and other virtual scenarios |
| Real-time Multi-user Status Sync | ZEGO industry-leading technology. Provides an ordered, high-frequency, low-latency, large-scale status synchronization service, helping developers quickly implement real-time information synchronization capabilities such as player positions, actions, and appearances in virtual gameplay, while supporting 10,000 users online simultaneously in a single scene. | Virtual offices, virtual exhibitions, virtual social networking, virtual KTV, and other metaverse scenarios, as well as general scenarios that require ultra-high frequency, low latency, and large-scale synchronization of information or control commands |
Room Capabilities
Basic Functions
| Basic Functions | Function Description | Business Scenarios |
|---|---|---|
| Room Connection Status Description | Determine the user's connection status in the room and the conversion process of each connection status. | - |
| Real-time Messaging and Signaling | Real-time messaging mainly provides the function of sending and receiving pure text messages, which can send broadcast messages and barrage messages to other users in the same room, or send custom messages to specified users, and can implement interactive functions such as likes, gift giving, and answering according to needs. |
|
Advanced Functions
| Advanced Functions | Function Description | Business Scenarios |
|---|---|---|
| Login to Multiple Rooms | One user can enter multiple rooms at the same time for audio and video calls or watching live streaming. | Teacher multi-class online teaching |
Audio Capabilities
Basic Functions
| Basic Functions | Function Description | Business Scenarios |
|---|---|---|
| Audio Spectrum and Sound Level Changes | Audio spectrum: the energy value of digital audio signals at each frequency point. Sound level change: the volume level of a certain stream. |
|
| Ear Monitor and Channel Settings |
|
|
| Audio 3A Processing | During real-time audio and video calls or live streaming, 3A processing can be performed on audio to improve call or live streaming quality and user experience.
| All scenarios that hope to have high-quality real-time audio and video services |
| Voice Changer/Reverb/Stereo | To increase fun and interactivity, users can use voice changers to be funny, use reverb to set the atmosphere, and use stereo to make the sound more three-dimensional. ZEGO Express SDK provides a variety of preset voice changer, reverb, reverb echo, and stereo effects. Developers can flexibly set the sound they want. |
|
Advanced Functions
| Advanced Functions | Function Description | Business Scenarios |
|---|---|---|
| Audio Mixing | The SDK obtains one channel of audio data from the App and integrates the audio data provided by the App with the audio data collected by the SDK into one channel of audio data, thereby enabling the ability to play custom sounds and music files during calls or live streaming, and letting others in the room hear them. |
|
| Scenario-based AI Noise Reduction | Real-time automatic identification of different scenarios, intelligently adjusting AI noise reduction strategies to provide the best noise reduction and sound quality effects. In call scenarios, all sounds except human voice are identified as noise and eliminated. In music scenarios, the noise reduction effect is automatically adjusted to restore music sound quality. | Voice rooms, conferences, voice chat for games and other 1v1 or multi-person audio and video call scenarios, as well as sound cards, singing while playing, near-field music live streaming or online KTV scenarios |
| Custom Audio Capture | Developers can obtain audio information by themselves and hand it over to the SDK for transmission. |
|
| Custom Audio Rendering | Audio is rendered by developers themselves and then played. | Developers have their own special rendering requirements |
| Custom Audio Processing | Developers can perform special audio processing by themselves. | When there are special sound processing requirements that the SDK cannot meet, such as special voice changing |
| Get Original Audio Data | Get the function of original audio recording. The obtained original audio data format is PCM. | Audio data retention or special processing |
| AI Voice Changer | The "Conan Voice Changer Bow Tie" in real-time calls, perfectly reproducing the target character's timbre and rhythm, while retaining the user's speech rate, emotion, and tone, switching timbres at will, with ultra-low latency. |
|
Live Streaming Capabilities
Basic Functions
| Basic Functions | Function Description | Business Scenarios |
|---|---|---|
| Stream Mixing | Mix multiple users' multiple video streams into one stream, so that you only need to play one stream to see the screens of all members in the room and hear the sounds of all members in the room. | Multi-person call host co-hosting |
| Using CDN Live Streaming | Unified access to multiple CDN capabilities. This function supports pushing streams to CDN, connecting RTC products and CDN live streaming products, making it convenient for users to watch live content directly from web pages or third-party players. | Basic live streaming with high concurrency, scenarios with no strong requirements for live streaming latency |
| CDN Publishing Authentication | To prevent attackers from stealing the developer's publishing URL address to use it elsewhere, or forging the developer's server to generate the publishing URL address, thereby causing traffic loss, you can configure CDN publishing authentication by yourself through the ZEGOCLOUD Console. When authentication is enabled, relevant authentication parameters need to be spliced into the publishing URL address, otherwise publishing will fail. | - |
| Playing Stream by URL | When the publishing end uses third-party publishing tools (such as OBS software, network camera IP Camera, etc.) to push streams to the CDN, or when the audio and video screen is pushed to a third-party CDN by using the ZEGO SDK re-publishing CDN function, you can play streams by directly passing in the URL address. | Third-party live streaming screen acquisition |
Advanced Functions
| Advanced Functions | Function Description | Business Scenarios |
|---|---|---|
| Ultra-low Latency Live Streaming | Focuses on providing stable and reliable live streaming services. Compared with standard video live streaming products, audio and video latency is lower, synchronization is stronger, weak network resistance is better, bringing users millisecond-level live streaming experience. |
|
| Direct to CDN | The process of pushing audio and video streams directly from the local client to the CDN, and users can watch directly from web pages or third-party players through the playing URL address. | Developers who have audio and video distribution service cooperation with third-party CDNs can use |
Other Capabilities
Basic Functions
| Basic Functions | Function Description | Business Scenarios |
|---|---|---|
| Media Player | Provides the ability to play audio and video media files, and supports pushing the audio and video data of the played media files to streams. |
|
| Audio Effect Player | Provides an audio effect player and performs unified management of audio effects to achieve effects such as enhancing realism or setting the atmosphere for playing short sound effects. |
|
| Audio and Video Recording | During video calls, live streaming, or online teaching, users often need to record and save videos for subsequent on-demand viewing by other users. ZEGO provides a variety of recording solutions to meet recording needs in different scenarios. |
|
