May 10, 2025
Trending News

Cloudera launches AI inference service with Nvidia integration

  • October 10, 2024
  • 0

Cloudera launches AI Inference, a new service that leverages Nvidia technology for faster processing of AI models. The service focuses on securely managing and deploying large AI applications,

oh chip

Cloudera launches AI Inference, a new service that leverages Nvidia technology for faster processing of AI models. The service focuses on securely managing and deploying large AI applications, including generative AI (GenAI), and delivers up to 36x faster performance.

Cloudera AI Inference is one of the first AI inference services to leverage Nvidia NIM microservices. This integration, part of the Nvidia AI Enterprise platform, enables faster deployment and management of large AI models. This allows organizations to more efficiently take GenAI out of the pilot phase and apply it in practice. The service helps developers create and manage Large Language Models (LLMs) with advanced security and scalability.

The collaboration between Cloudera and Nvidia delivers improved performance through the use of Nvidia’s Tensor Core GPUs. This results in 36 times faster processing than traditional methods. The new service offers direct integration of the UI and APIs with Nvidia NIM microservice containers. This reduces the need for complex tools like CLIs and simplifies the management and monitoring of AI models.

Security and scalability are the focus

A key feature of Cloudera AI Inference is its emphasis on security and privacy. The service prevents sensitive data from being shared with vendor-hosted AI model services by giving companies control over the development and deployment of their own AI models. Additionally, the service supports both on-premises and cloud-based deployments, providing flexibility for organizations that require strict regulatory compliance.

The service is equipped with scalability, monitoring and security features. This allows companies to deploy AI models efficiently while meeting compliance standards and governance requirements. Automatic scalability and real-time performance tracking help identify and resolve issues quickly, ensuring optimal resource management. Cloudera’s integration with Nvidia provides a solution for companies that want to deploy reliable AI without complex DIY approaches.

Source: IT Daily

Leave a Reply

Your email address will not be published. Required fields are marked *

Exit mobile version