Container Images (release 230705)
deepgram/onprem-api:1.95.0
deepgram/onprem-engine:3.53.0
deepgram/onprem-license-proxy:1.4.1
deepgram/onprem-billing:1.7.1
deepgram/onprem-metrics-server:2.0.6
deepgram/onprem-dgtools:2.1.4
Deepgram On-premises Release Tags
deepgram/onprem-api:release-230705
deepgram/onprem-engine:release-230705
deepgram/onprem-license-proxy:release-230705
deepgram/onprem-billing:release-230705
deepgram/onprem-metrics-server:release-230705
deepgram/onprem-dgtools:release-230705
This Release Contains The Following Changes
Support for license keys created and managed from Deepgram Console.
Support for new Domain-Specific Language Model powered summarization. Learn more.
The minimum supported CUDA runtime version for onprem-engine has changed from 11.0.3 to 11.3.1. Systems using NVIDIA drivers before version 450.80.02 might encounter errors when attempting to start this release of onprem-engine. Deepgram recommends installing the latest NVIDIA drivers for maximum compatibility, stability, and performance.
The onprem-engine container size has been significantly reduced.
Reduction in frequency of hallucinations when using Deepgram enhanced models.
Improvements to accuracy of reported word times when using existing Whisper models.
Duration values specified in the onprem-api configuration file can now include unit suffixes. For example, instead of writing 480 it is now possible to write 4m. Values with no suffix are assumed to be seconds.
Stability improvements and bug fixes.🐛