
Supermicro Revolutionizes Edge AI: Real-Time Decision-Making with Cloud-Level Performance
Supermicro is introducing new AI solutions that enable customers to harness AI capabilities in edge locations like public spaces, stores, and industrial sites. By combining Supermicro’s application-optimized servers with NVIDIA GPUs, customers can easily fine-tune AI models and deploy AI inference solutions at the edge, resulting in faster response times and enhanced decision-making capabilities.
These advancements allow users to process data at the edge, eliminating the need to send data back to the cloud. Customers can now leverage pre-trained large language models (LLMs) provided by NVIDIA AI Enterprise, optimizing performance and facilitating accurate, real-time decision-making close to the data source. Supermicro’s Hyper-E server, based on dual 5th Gen Intel Xeon processors, supports up to three NVIDIA H100 Tensor Core GPUs, delivering data center-level AI processing power to edge locations. The Supermicro SYS-221HE server, with front or rear servicing options, can be installed in various environments and is capable of handling AI workloads at the edge. An example of such a solution is Eviden’s AI-powered retail solution, which uses Supermicro edge systems and NVIDIA technologies to enhance the shopping experience with interactive 3D models and personalized chatbots.