Product Features
2024-02-05
Communication Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Audio and Video Call | Users join the same Room and conduct audio and video calls. |
|
| Audio and Video Live Streaming | In the same Room, including hosts and audience, hosts can conduct audio and video live streaming, and audience in the Room can watch the live streaming. |
|
| User Permission Control | Use Token to control user permissions, such as: specifying users can join/leave rooms; specifying users to speak/mute; specifying users. | Video Conference |
| Pre-call Detection | Before conducting audio and video calls or live streaming, perform device detection on cameras, microphones, monitors, etc., to ensure the normal operation of calls or live streaming. | Normal call function detection |
| Call Quality Monitoring | Detect the quality of audio and video, such as resolution, frame rate, bitrate, sampling rate, and other multi-indicator detections, to ensure stable quality. | Bank account opening, remote authentication, and other scenarios with high requirements and limitations on audio and video quality |
| Network Speed Test | Before users publish/play streams, detect uplink and downlink network speeds to determine the bitrate of audio and video streams suitable for publishing/playing under the current network environment. | Call scenarios, education scenarios, live streaming scenarios |
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Live Streaming Co-hosting | Multiple hosts can appear in the same Room for co-hosting live streaming on the same screen. |
|
| Multi-source Capture | Provides flexible and easy-to-use audio and video capture sources and channel management capabilities, reducing developers' development and maintenance costs. | Video conferences, online education |
| Publishing Multiple Streams Simultaneously | One user can publish multiple audio and video streams, such as sending the camera's video stream while sharing the screen. | In a video conference, see the presenter's screen while playing PPT |
| Supplemental Enhancement Information (SEI) | Text information is packaged with audio and video content and transmitted through the streaming media channel to achieve precise synchronization between text data and audio and video content. |
|
| Traffic Control | ZEGO industry-leading technology. The SDK dynamically adjusts the video publishing stream's bitrate, frame rate, resolution, and audio bitrate based on its own and the peer's current network environment status, automatically adapting to the current network environment and network fluctuations, thereby ensuring smooth video publishing. | All scenarios that require high-quality real-time audio and video services |
| Cloud Proxy | By setting the SDK's cloud proxy interface, all traffic corresponding to the SDK is relayed through cloud proxy servers to communicate with RTC and L3 (Ultra-low Latency Live Streaming). | Hospitals, government, company internal, and other restrictive network environments such as intranets |
| Geo-fencing | Restricts audio, video, and signaling data transmission to a specific region to meet regional data privacy and security regulations, that is, restricting access to audio and video services in a specific region. | Call scenarios |
| Audio and Video Stream Encryption | Encrypt the stream when publishing, and the stream must have a decryption key consistent with the encryption key when playing. | Scenarios that need to encrypt stream information to protect communication security |
| Game Voice | Imitate the real world, where people have different auditory experiences based on factors such as the direction and distance of sound. For example, the farther the distance, the smaller the sound. At the same time, people who can receive the sound source can be grouped and restricted. For example, in a Room, group discussions, and different groups cannot hear each other's voices. |
|
| Ten-thousand User Range Audio and Video | ZEGO industry-leading technology. Automatically pulls remote audio and video within the listening range based on the user's location in the cloud and provides spatial audio effects (by default, pulls the 12 closest streams). A single scenario supports 10,000 users to enable microphones and cameras at the same time. | Virtual offices, virtual exhibitions, open virtual worlds, and other virtual scenarios |
| Multi-person State Real-time Synchronization | ZEGO industry-leading technology. Provides an ordered, high-frequency, low-latency, large-scale state synchronization service to help developers quickly realize real-time information synchronization capabilities such as player positions, actions, and images in virtual gameplay. At the same time, it supports 10,000 users online at the same time in a single scenario. | Virtual offices, virtual exhibitions, virtual social networking, virtual KTV, and other metaverse scenarios, as well as general scenarios that require ultra-high frequency, low latency, and large-scale synchronization of information or control commands |
Room Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Room Connection Status Description | Determine the user's connection status in the Room and the conversion process of each connection status. | - |
| Real-time Messaging and Signaling | Real-time messaging mainly provides the function of sending and receiving pure text messages. It can send broadcast messages and barrage messages to other users in the same Room, or send custom messages to specified users, and can implement interactive functions such as likes, gifts, and quizzes as needed. |
|
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Login to Multiple Rooms | One user can enter multiple Rooms at the same time for audio and video calls or watch live streaming. | Teachers teaching multiple classes online |
Audio Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Audio Spectrum and Volume Changes | Audio spectrum: the energy value of digital audio signals at each frequency point. Volume changes: the volume of a certain stream. |
|
| Ear Monitoring and Sound Channel Settings |
|
|
| Audio 3A Processing | During real-time audio and video calls or live streaming, 3A processing can be performed on audio to improve the quality of calls or live streaming and user experience.
| All scenarios that require high-quality real-time audio and video services |
| Voice Changer/Reverb/Stereo | To increase fun and interactivity, users can use voice changers to be funny, use reverb to heighten the atmosphere, and use stereo to make sound more three-dimensional. ZEGO Express SDK provides a variety of preset voice changer, reverb, reverb echo, and stereo effects, and developers can flexibly set the sound they want. |
|
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Scenario-based AI Noise Suppression | Real-time automatic recognition of different scenarios, intelligently adjusting AI noise suppression strategies to provide the best noise suppression and sound quality effects. In call scenarios, all sounds except human voices are identified as noise and eliminated. In music scenarios, automatically adjust the noise suppression effect to restore music sound quality. | Voice rooms, conferences, voice chat gaming, and other 1v1 or multi-person audio and video call scenarios, as well as live streaming or online KTV scenarios with sound cards, singing and playing, near-field music |
| Custom Audio Capture | Developers can obtain audio information by themselves and then hand it over to the SDK for transmission. |
|
| Custom Audio Render | Audio is rendered and played by the developer itself. | Developers have their own special rendering requirements |
| Custom Audio Processing | Developers can perform special audio processing by themselves. | When there are special sound processing requirements that the SDK cannot meet, such as special voice changing |
| Original Audio Data Acquisition | The function of obtaining original audio recording, and the obtained original audio data format is PCM. | Audio data retention or special processing |
| AI Voice Changer | The "Conan Voice Changer Bow Tie" in real-time calls perfectly reproduces the target character's timbre and rhythm, while preserving the user's speech rate, emotion, and tone, switching timbres at will, with ultra-low latency. |
|
Video Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Common Video Configurations | During video calls or live streaming, customize the settings for capturing and playing video, such as video capture resolution, video encoding output resolution, video frame rate, bitrate, view mode, and mirror mode, etc. |
|
| Video Capture Rotation | For mobile devices, 4 capture rotation modes are provided (fixed ratio mode, adaptive mode, alignment mode, and custom mode), simplifying the complex adaptation problems faced by developers when implementing multi-end rotation performance, such as camera angle, resolution, auto-rotation, statusbar position adaptation, etc. | - |
| Screen Sharing | Share screen content with other users in the Room in the form of video during video calls or interactive live streaming. |
|
| Basic Face Beautification | When conducting video calls or live streaming, if you want to present a good skin condition to the other party, you can combine AI Effects to achieve basic face beautification functions. |
|
| Watermark and Screenshot | You can add watermarks such as copyright logos to the video screen. | Video sharing with copyrights, etc. |
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Set Video Encoding Method | Detailed settings can be made on video encoding and decoding, including enabling layered video encoding, using hardware encoding and decoding, and setting encoding methods, etc. | When there are special requirements for encoding and decoding |
| Custom Video Capture | Customize providing video input sources to ZEGO Express SDK to input video data, and ZEGO Express SDK performs encoding and stream publishing. |
|
| Custom Video Rendering | Custom video rendering refers to the SDK providing local preview and remote stream playing video frame data to the outside world for users to render by themselves. |
|
| Custom Video Pre-processing | Developers perform custom pre-processing on video data by themselves. | Operations such as beauty effects, adding accessories, etc. |
| Super Resolution | Double the width and height of the played video stream at the playing stream end. For example: if the resolution of the original screen played at the playing end is 640p x 360p, after super-resolution processing, the resolution will be increased to 1280p x 720p. | 1v1 video call scenarios, live streaming scenarios, online education |
| Subject Segmentation | ZEGO industry-leading technology. At the publishing stream end, the subject in the rectangular video (mostly human) is separated through AI algorithms and transmitted in the RTC network, and rendered at the playing stream end. | Multi-person remote co-hosting, showcase live streaming co-hosting PK, multi-person online study, and other multi-person co-hosting scenarios |
| Low-light Enhancement | When the ambient light is dark and the brightness of the screen collected by the camera does not meet the business requirements of seeing the other party's face clearly or performing face recognition, etc., the screen brightness is enhanced. | Video calls in situations with dark ambient light |
| H.265 | Through the more advanced H.265 encoding technology, higher clarity is provided at the same bitrate. | Higher audio and video call and live streaming experience is required in poor network environments and is sensitive to bandwidth |
| Video Large and Small Streams and Layered Encoding | Divide the stream into a base layer and an enhancement layer, which can provide better experience for users with different network statuses and device performance. | Video calls |
Live Streaming Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Stream Mixing | Mix multiple video streams from multiple people into one stream, so that you only need to play one stream to see the screens of all members in the Room and hear the voices of all members in the Room. | Multi-person call host co-hosting |
| Use CDN Live Streaming | Unify access to multiple CDN capabilities. This function supports publishing to CDN, opening up RTC products and CDN live streaming products, making it convenient for users to watch live streaming content directly from web pages or third-party players. | Basic live streaming with high concurrency, scenarios without strong requirements for live streaming latency |
| CDN Publishing Authentication | To prevent attackers from stealing the developer's publishing URL address to use it elsewhere, or forging the developer's server to generate the publishing URL address, resulting in traffic loss, you can configure CDN publishing authentication by yourself through ZEGOCLOUD Console. When authentication is enabled, relevant authentication parameters must be spliced into the publishing URL address, otherwise publishing will not be possible. | - |
| Play Stream by URL | When the publishing end uses third-party publishing tools (such as OBS software, network camera IP Camera, etc.) to push streams to the CDN, or uses ZEGO SDK to relay the CDN function to push audio and video screens to a third-party CDN, you can directly pass in the URL address to play streams. | Third-party live streaming screen acquisition |
Advanced Features
| Advanced Features | Feature Description | Business Scenarios |
|---|---|---|
| Ultra-low Latency Live Streaming | Focuses on providing stable and reliable live streaming services. Compared with standard video live streaming products, audio and video latency is lower, synchronization is stronger, weak network resistance is better, bringing millisecond-level live streaming experience to users. |
|
| Direct CDN Publishing | The process of pushing audio and video streams directly from the local client to the CDN, allowing users to watch directly from web pages or third-party players through the playing URL address. | Developers who have audio and video distribution service cooperation with third-party CDNs can use |
| Single Stream Transcoding | Convert a single original stream into transcoded streams with different encoding formats and resolutions in the cloud. In scenarios such as live streaming, audiences can choose streams with different resolutions to watch based on access network quality, terminal devices, etc., to ensure playback smoothness. | Live streaming scenarios |
Other Capabilities
Basic Features
| Basic Features | Feature Description | Business Scenarios |
|---|---|---|
| Media Player | Provides the ability to play audio and video media files and supports pushing the audio and video data of the played media files out as streams. |
|
| Sound Effect Player | Provides a sound effect player and performs unified management of sound effects to achieve effects such as playing control of short sound effects that enhance realism or heighten the scene atmosphere. |
|
| Audio and Video Recording | When conducting video calls, live streaming, or online teaching, users often need to record and save videos for subsequent on-demand viewing by other users. ZEGO provides a variety of recording solutions to meet recording needs in different scenarios. |
|
| Camera Zoom | By setting the camera's zoom ratio through the SDK, the effect of enlarging distant objects during shooting can be achieved. | Outdoor live streaming |
2024-02-05
