What’s New in GIPS VoiceEngine 3.4?
A new version of GIPS VoiceEngine is about to be released. Therefore I would like to give a brief overview of some of the most important feature updates.
From a customer point of view, the two main additions are an expansion of our HD Voice support for Super Wideband (SWB), as well as support for Stereo Playout. Let me start by describing the new integration of SWB in VoiceEngine 3.4.
In order to expand support for SWB, all of VoiceEngine’s core functions now work efficiently at a 32 kHz sampling frequency. Currently, G.722.1 Annex C (or G.722.1C) is added to VoiceEngine 3.4, but the new architecture enables any codec using 32 kHz as the sample rate. G.722.1C provides 14 kHz audio bandwidth using 32 kHz sample rate at three different bitrates: 24, 32 and 48 kbps. However, as this blog has argued before, only supporting a SWB codec is not sufficient to providing high quality voice if the right additional components are not present. The new VoiceEngine from GIPS ensures true SWB quality since it contains SWB upgrades of all core components such as:
- adaptive jitter buffer and error-concealment unit (GIPS NetEQ)
- echo cancellation
- automatic gain control
- noise suppression
- voice-activity detection
- comfort noise generation
- mixers
In addition, utilization of the new SWB components are only activated when needed, hence the footprints for 8 and 16 kHz modes are not increased compared with the previous version of VoiceEngine.
VoiceEngine’s other main new feature in is the added support of stereo playout. The existing version of VoiceEngine allows “stereo modifications”, such as panning, but all actions must be performed on the client side. In the upcoming VoiceEngine 3.4, it will also be possible to play out a received stereo-signal packetized according to IETF RFC 3551. In essence, this means that a GIPS client will now be able to play out a dual-channel RTP stream, where a conference server, for instance, has performed some sort of spatial filtering of the conference participants. The end result would be that a user gets a feeling of all participants sitting around a conference table, with their voices coming from different directions. Note that the client is not performing any stereo intelligence, as the actual stereo effect must be generated at the transmitting side. As this feature pertains to the new SWB capability, the new stereo features are activated and deactivated dynamically and no new API calls are required. The only action needed on the client side to enable true stereo playout is to register a certain codec (payload type, name etc.) as a dual-channel codec.
In addition to SWB and stereo playout, the following features will also be added to the latest release of VoiceEngine:
- “RTP-dump” APIs which allow recording of received and transmitted RTP streams into rtpdump-compatible format.
- Complete Windows 7 and MAC OS X 10.6 (Snow Leopard) support.
- Automatic ducking or stream attenuation is a new feature in Windows 7 that is intended for VoIP. By default, the operating system reduces the intensity of an audio stream when a communication stream, such as a phone call, is received on the communication device through the computer. The latest VoiceEngine 3.4 fully exploits this new functionality in Windows 7 and allows the user to define a certain audio device as the default communication device.
- Possibility to build 64-bit versions of VoiceEngine for MAC OS X.
As the Technical Area Manager for voice technology, I can say that I am personally very excited about these new features, and look forward to seeing (and hearing) them enable some really cool and innovative products.





