NVIDIA said it has launched the NeMo Retriever microservice, a new suite of tools designed to power enterprises’ multilingual generative AI capabilities. Microservices leverage advanced embedding and reranking techniques to support accurate and contextual information retrieval across languages, significantly improving AI’s ability to process diverse data sets.
Strengthening multilingual AI system
The NeMo Retriever microservice is now accessible through the NVIDIA API catalog, giving enterprises the ability to extract and analyze data in a variety of languages and formats. These innovations enable businesses to connect generative AI with a wide range of data sources to deliver more accurate and actionable insights.
By integrating NeMo Retriever, organizations can achieve a 35x improvement in data storage efficiency thanks to advancements such as long context support and dynamic embedding resizing. These efficiencies enable large-scale processing and storage on a single server, making AI solutions more scalable and cost-effective.
Industry adoption and impact
Leading industry players, including DataStax, Cloudera, and SAP, are already implementing these microservices to enhance their AI products. For example, Wikimedia worked with DataStax to leverage NeMo Retriever to vectorize over 10 million Wikidata items in 3 days, a task that previously took 30 days. This feature supports real-time updates and expands multilingual accessibility for global users.
Additionally, companies like Cloudera and Cohesity are integrating NeMo Retriever into their platforms to improve multilingual data processing and search accuracy. This integration demonstrates the potential of microservices to drive significant business impact by overcoming linguistic and contextual barriers.
break the language barrier
NeMo Retriever solves critical challenges in enterprise AI, including processing massive amounts of data and ensuring accurate text search across languages. Designed for a variety of applications including search, question answering, and recommendation systems, it improves the adaptability and efficiency of AI solutions globally.
Microservices’ ability to precisely process long documents, such as contracts or medical records, ensures reliable and consistent results in complex scenarios and further optimizes resource allocation for scalability.
effectiveness
Developers can explore the capabilities of NeMo Retriever and other NIM microservices through the NVIDIA API catalog. Additionally, a free 90-day developer license for NVIDIA AI Enterprise is available to facilitate the development of efficient multilingual information retrieval systems.
Image source: Shutterstock