Generative AI continues to reshape industries through advanced foundational models that improve content creation and data interpretation. According to the NVIDIA Technology Blog, NVIDIA has launched two new model families from the NVIDIA AI Foundation: Phi-3 and Granite Code.
Phi-3 language model
Developed in collaboration with Microsoft, the Phi-3 series includes a Small Language Model (SLM) optimized for high performance and computational efficiency. These models excel at tasks such as content generation, summarization, question answering, and sentiment analysis. Its powerful reasoning capabilities make it ideal for a variety of applications that require logical reasoning and accurate responses.
Phi-3 Vision Model
The Phi-3 family also features the Phi-3 Vision model, a 4.2 billion parameter multimodal model designed to process and interpret both textual and visual data. Supporting 128K tokens, this model can analyze complex visual elements within images, such as charts, graphs, and tables, making it ideal for data-intensive tasks.
granite cord
IBM provided the Granite Code model, an open programming model designed to support a variety of coding tasks. Trained in 116 programming languages, these models can generate code examples, identify and fix errors, and provide explanations for code segments. Its performance on coding benchmarks is state-of-the-art and it is trained on licensable data, making it ideal for enterprise use.
Optimized for performance
Both Phi-3 and Granite Code models are optimized for latency and throughput using NVIDIA TensorRT-LLM. These models join more than 36 popular AI models supported by NVIDIA NIM, a microservice designed to simplify large-scale deployment of performance-optimized models. NVIDIA NIM allows you to significantly increase the number of enterprise application developers who can contribute to AI innovation.
NVIDIA continues to collaborate with leading model builders to support models in a fully accelerated stack while ensuring optimal performance and ease of deployment.
start
Visit the API Catalog to experience, customize, and deploy these models in your enterprise applications. With free NVIDIA cloud credits, developers can start testing models at scale and build proofs of concept by connecting their applications to NVIDIA-hosted API endpoints running on a fully accelerated stack.
Image source: Shutterstock
. . .
tag