logo
On this page

Product Features


Communication Capabilities

Basic Functions

Basic FunctionsFunction DescriptionBusiness Scenarios
Voice CallUsers join the same room and conduct audio calls.
  • 1v1 voice call
  • Multi-person voice conference
Audio Live StreamingIn the same room, including a host and audience, the host can conduct audio live streaming, and the audience in the room can watch the live stream.
  • Emotional FM
  • Voice chat room
User Permission ControlUse Token to control user permissions, such as: specifying users can enter/exit rooms; specifying users to speak/mute; specifying users.Video conference
Pre-call DetectionBefore conducting audio and video calls or live streaming, perform device detection on cameras, microphones, monitors, etc. to ensure the normal operation of calls or live streaming.Normal call function detection
Call Quality MonitoringDetect the quality of audio and video, such as resolution, frame rate, bitrate, sampling rate, and other multi-indicator detection to ensure stable quality.Bank account opening, remote appraisal, and other scenarios with high requirements and limitations on audio and video quality
Network Speed TestBefore users publish/play streams, detect the upstream and downstream network speeds to determine the suitable bitrate of audio and video streams for publishing/playing in the current network environment.Call scenarios, education scenarios, live streaming scenarios

Advanced Functions

Advanced FunctionsFunction DescriptionBusiness Scenarios
Live Co-hostingIn one room, multiple hosts can appear to conduct co-hosting on the same screen.
  • Multi-host co-hosting from different regions
  • Multi-person KTV chorus
  • Showcase live streaming
Multi-source CaptureProvide flexible and easy-to-use audio and video capture source and channel management capabilities, reducing developers' development and maintenance costs.Video conferences, online education
Publish Multiple Streams SimultaneouslyOne user can publish multiple audio and video streams, such as sharing the screen while sending the camera video stream.Playing PPT during a video conference while seeing the presenter's screen
Supplemental Enhancement Information (SEI)Text information is packaged with audio and video content and transmitted through the streaming media channel to achieve precise synchronization between text data and audio and video content.
  • Precise layout of video screens
  • Remote lyrics synchronization
  • Live quiz
Traffic ControlZEGO industry-leading technology. The SDK dynamically adjusts the video publishing bitrate, frame rate, and resolution, as well as audio bitrate based on its own and the peer's current network environment status, automatically adapting to the current network environment and network fluctuations, thereby ensuring smooth video publishing.All scenarios that hope to have high-quality real-time audio and video services
Cloud ProxyBy setting the SDK's cloud proxy interface, all traffic corresponding to the SDK is relayed through cloud proxy servers to communicate with RTC and L3 (ultra-low latency live streaming).Hospitals, government, companies, and other restrictive network environments with intranets
GeofencingLimits audio and video and signaling data transmission to a certain region to meet regional data privacy and security-related regulations, that is, limiting access to audio and video services in a specific region.Call scenarios
Audio and Video Stream EncryptionEncrypt the stream when publishing, and must have a decryption key consistent with the encryption key when playing.Scenarios that need to encrypt stream information to protect communication security
Game VoiceMimic the real world, where people have different auditory experiences based on factors such as the direction and distance of the sound, for example, the farther the distance, the smaller the sound, etc. At the same time, people who can receive the sound source can be grouped and limited, for example, in a room, group discussions, and different groups cannot hear each other's sounds.
  • Metaverse
  • Same room, grouped communication or battle
Mass-scale Range Audio and VideoZEGO industry-leading technology. Automatically pulls remote audio and video within the listening range based on user locations in the cloud and provides spatial audio effects (pulling the nearest 12 channels by default), supporting 10,000 users to turn on microphones and cameras simultaneously in a single scene.Virtual offices, virtual exhibitions, open virtual worlds, and other virtual scenarios
Real-time Multi-user Status SyncZEGO industry-leading technology. Provides an ordered, high-frequency, low-latency, large-scale status synchronization service, helping developers quickly implement real-time information synchronization capabilities such as player positions, actions, and appearances in virtual gameplay, while supporting 10,000 users online simultaneously in a single scene.Virtual offices, virtual exhibitions, virtual social networking, virtual KTV, and other metaverse scenarios, as well as general scenarios that require ultra-high frequency, low latency, and large-scale synchronization of information or control commands

Room Capabilities

Basic Functions

Basic FunctionsFunction DescriptionBusiness Scenarios
Room Connection Status DescriptionDetermine the user's connection status in the room and the conversion process of each connection status.-
Real-time Messaging and SignalingReal-time messaging mainly provides the function of sending and receiving pure text messages, which can send broadcast messages and barrage messages to other users in the same room, or send custom messages to specified users, and can implement interactive functions such as likes, gift giving, and answering according to needs.
  • Showcase live streaming
  • Voice chat room

Advanced Functions

Advanced FunctionsFunction DescriptionBusiness Scenarios
Login to Multiple RoomsOne user can enter multiple rooms at the same time for audio and video calls or watching live streaming.Teacher multi-class online teaching

Audio Capabilities

Basic Functions

Basic FunctionsFunction DescriptionBusiness Scenarios
Audio Spectrum and Sound Level ChangesAudio spectrum: the energy value of digital audio signals at each frequency point.
Sound level change: the volume level of a certain stream.
  • Determine which user on the mic is speaking, whether the microphone, speaker, etc. are available
  • Audio spectrum animation display, etc.
Ear Monitor and Channel Settings
  • Headphone acquisition monitoring: after inserting headphones (ordinary headphones or Bluetooth headphones) into the device, you can hear the sound collected by the device's microphone from the local headphone side.
  • Stereo: refers to two sound channels. When hearing sounds, you can determine the specific position of the sound source based on the phase difference of the left and right ears.
  • Showcase live streaming
  • Emotional FM
  • Music teaching and other relatively professional scenarios
Audio 3A ProcessingDuring real-time audio and video calls or live streaming, 3A processing can be performed on audio to improve call or live streaming quality and user experience.
  • AEC (Acoustic Echo Cancellation): Filter the collected audio data to reduce echoes in the audio.
  • AGC (Automatic Gain Control): After enabling this function, the SDK can automatically adjust the microphone volume, adapt to near and far sound pickup, and keep the volume stable.
  • ANS (Audio Noise Suppression): Identify background noise in the sound and eliminate it. After enabling this function, the human voice can be clearer.
All scenarios that hope to have high-quality real-time audio and video services
Voice Changer/Reverb/Stereo

To increase fun and interactivity, users can use voice changers to be funny, use reverb to set the atmosphere, and use stereo to make the sound more three-dimensional.

ZEGO Express SDK provides a variety of preset voice changer, reverb, reverb echo, and stereo effects. Developers can flexibly set the sound they want.

  • Live streaming
  • Voice chat room
  • Karaoke room
  • Anonymous social networking
  • Game entertainment
  • Role-playing

Advanced Functions

Advanced FunctionsFunction DescriptionBusiness Scenarios
Audio MixingThe SDK obtains one channel of audio data from the App and integrates the audio data provided by the App with the audio data collected by the SDK into one channel of audio data, thereby enabling the ability to play custom sounds and music files during calls or live streaming, and letting others in the room hear them.
  • Social voice chat
  • Live streaming
Scenario-based AI Noise ReductionReal-time automatic identification of different scenarios, intelligently adjusting AI noise reduction strategies to provide the best noise reduction and sound quality effects.
In call scenarios, all sounds except human voice are identified as noise and eliminated.
In music scenarios, the noise reduction effect is automatically adjusted to restore music sound quality.
Voice rooms, conferences, voice chat for games and other 1v1 or multi-person audio and video call scenarios, as well as sound cards, singing while playing, near-field music live streaming or online KTV scenarios
Custom Audio CaptureDevelopers can obtain audio information by themselves and hand it over to the SDK for transmission.
  • Online or local audio file transmission
  • Transmission of audio files from customized acquisition systems
Custom Audio RenderingAudio is rendered by developers themselves and then played.Developers have their own special rendering requirements
Custom Audio ProcessingDevelopers can perform special audio processing by themselves.When there are special sound processing requirements that the SDK cannot meet, such as special voice changing
Get Original Audio DataGet the function of original audio recording. The obtained original audio data format is PCM.Audio data retention or special processing
AI Voice ChangerThe "Conan Voice Changer Bow Tie" in real-time calls, perfectly reproducing the target character's timbre and rhythm, while retaining the user's speech rate, emotion, and tone, switching timbres at will, with ultra-low latency.
  • Social voice chat
  • Live streaming
  • Game voice

Live Streaming Capabilities

Basic Functions

Basic FunctionsFunction DescriptionBusiness Scenarios
Stream MixingMix multiple users' multiple video streams into one stream, so that you only need to play one stream to see the screens of all members in the room and hear the sounds of all members in the room.Multi-person call host co-hosting
Using CDN Live StreamingUnified access to multiple CDN capabilities. This function supports pushing streams to CDN, connecting RTC products and CDN live streaming products, making it convenient for users to watch live content directly from web pages or third-party players.Basic live streaming with high concurrency, scenarios with no strong requirements for live streaming latency
CDN Publishing AuthenticationTo prevent attackers from stealing the developer's publishing URL address to use it elsewhere, or forging the developer's server to generate the publishing URL address, thereby causing traffic loss, you can configure CDN publishing authentication by yourself through the ZEGOCLOUD Console. When authentication is enabled, relevant authentication parameters need to be spliced into the publishing URL address, otherwise publishing will fail.-
Playing Stream by URLWhen the publishing end uses third-party publishing tools (such as OBS software, network camera IP Camera, etc.) to push streams to the CDN, or when the audio and video screen is pushed to a third-party CDN by using the ZEGO SDK re-publishing CDN function, you can play streams by directly passing in the URL address.Third-party live streaming screen acquisition

Advanced Functions

Advanced FunctionsFunction DescriptionBusiness Scenarios
Ultra-low Latency Live StreamingFocuses on providing stable and reliable live streaming services. Compared with standard video live streaming products, audio and video latency is lower, synchronization is stronger, weak network resistance is better, bringing users millisecond-level live streaming experience.
  • Online education
  • Showcase live streaming
  • E-commerce live streaming
  • Watch together
  • Online auction
Direct to CDNThe process of pushing audio and video streams directly from the local client to the CDN, and users can watch directly from web pages or third-party players through the playing URL address.Developers who have audio and video distribution service cooperation with third-party CDNs can use

Other Capabilities

Basic Functions

Basic FunctionsFunction DescriptionBusiness Scenarios
Media PlayerProvides the ability to play audio and video media files, and supports pushing the audio and video data of the played media files to streams.
  • Play test audio
  • Play background music
  • Play video files
Audio Effect PlayerProvides an audio effect player and performs unified management of audio effects to achieve effects such as enhancing realism or setting the atmosphere for playing short sound effects.
  • Showcase live streaming
  • Game entertainment
Audio and Video RecordingDuring video calls, live streaming, or online teaching, users often need to record and save videos for subsequent on-demand viewing by other users. ZEGO provides a variety of recording solutions to meet recording needs in different scenarios.
  • Conference recording
  • Live streaming recording
  • Call recording
  • Online classroom recording

Previous

Overview

Next

Voice Call Pricing