Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • HACKING
  • SLOT
  • CASINO
  • SUBMIT
Crypto Flexs
  • DIRECTORY
  • CRYPTO
    • ETHEREUM
    • BITCOIN
    • ALTCOIN
  • BLOCKCHAIN
  • EXCHANGE
  • TRADING
  • HACKING
  • SLOT
  • CASINO
  • SUBMIT
Crypto Flexs
Home»ADOPTION NEWS»NVIDIA Unveils Generative AI-Based Visual AI Agent for Edge Deployments
ADOPTION NEWS

NVIDIA Unveils Generative AI-Based Visual AI Agent for Edge Deployments

By Crypto FlexsJuly 17, 20243 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
NVIDIA Unveils Generative AI-Based Visual AI Agent for Edge Deployments
Share
Facebook Twitter LinkedIn Pinterest Email

Timothy Morano
July 17, 2024 18:22

NVIDIA powers AI at the edge with Jetson Orin platform, launches Vision Language Models (VLM) for dynamic video analytics





According to the NVIDIA Technical Blog, Vision Language Models (VLM), an exciting innovation in AI technology, provide a more dynamic and flexible way to analyze video. VLM makes the technology more accessible and adaptable by allowing users to interact with image and video inputs using natural language. These models can run on the NVIDIA Jetson Orin edge AI platform or on discrete GPUs via NIM.

What is a Visual AI Agent?

Visual AI agents are powered by VLM, which allows users to ask a wide range of questions in natural language and gain insights that reflect true intent and context from recorded or live video. These agents can be interacted with and integrated with other services and mobile apps via easy-to-use REST APIs. This new generation of visual AI agents helps summarize scenes using natural language, create broad alerts, and extract actionable insights from videos.

NVIDIA Metropolis provides a visual AI agent workflow, a reference solution that accelerates the development of AI applications powered by VLM, extracting insights through contextual understanding from video, whether deployed at the edge or in the cloud.

For cloud deployments, developers can power visual AI agents using NVIDIA NIM, a set of inference microservices that include industry-standard APIs, domain-specific code, optimized inference engines, and enterprise runtimes. Visit the API catalog to explore and try out basic models directly in your browser.

Building Visual AI Agents for the Edge

Jetson Platform Services is a set of pre-built microservices that provide essential built-in capabilities for building computer vision solutions on NVIDIA Jetson Orin. These microservices include AI services that support generative AI models such as zero-shot detection and state-of-the-art VLM. VLM combines large-scale language models with vision transformers to enable complex inference on text and visual inputs.

The VLM of choice for Jetson is VILA, which optimizes tokens per image to deliver cutting-edge inference capabilities and speed. Combining VLM with Jetson Platform Services allows you to create VLM-based visual AI agent applications that detect events on live streaming cameras and send notifications to users via mobile apps.

Integration with mobile apps

The entire end-to-end system can now be integrated with mobile apps to build VLM-based Visual AI Agents. To receive video input for VLM, Jetson Platform Services networking services and VST automatically discover and provide network-connected IP cameras. These cameras are available to VLM services and mobile apps via the VST REST API.

In the app, users can set up custom notifications in natural language, such as “Is there a fire?”, on selected live streams. Once the notification rules are set, VLM evaluates the live stream and notifies the user in real time via a WebSocket connected to the mobile app. This triggers a pop-up notification on the mobile device, allowing the user to ask follow-up questions in chat mode.

conclusion

This development highlights the potential of VLM combined with Jetson Platform Services to build advanced Visual AI agents. The full source code for the VLM AI service is available on GitHub, which developers can use to learn how to use VLM and build their own microservices.

For more information, visit the NVIDIA Technology Blog.

Image source: Shutterstock


Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Ether Lee (ETH) tests major support for $ 4,453 after the highest rejection.

August 31, 2025

Bitcoin analysts bet on $ 200K after hints of Fed.

August 23, 2025

‘Self -transactions, dressed in capital layout’: The cryptocurrency financial craze divides the industry.

August 15, 2025
Add A Comment

Comments are closed.

Recent Posts

TOKEN2049 Singapore stops all records with the world’s largest Web3 event with 25,000 attendees in unprecedented demand.

September 3, 2025

Simultaneously Mine Dogecoin (DOGE), Ripple (XRP), And SOL

September 3, 2025

Simultaneously Mine Dogecoin (DOGE), Ripple (XRP), And SOL

September 3, 2025

Cango Inc. Announces August 2025 Bitcoin Production And Mining Operations Update

September 2, 2025

BitMine Immersion (BMNR) Announces Release Of August Investor Presentation And Latest Video Message From Tom Lee, Chairman

September 2, 2025

Pioneering AI Visionary Vincent Boucher & AGI Alpha Announce A Meta‑Agentic AGI Jobs Marketplace Platform

September 2, 2025

Meme Coin Little Pepe Raises Above $24M In Presale With Over 39,000 Holders

September 2, 2025

Bybit WSOT 2025 Attracts Quadruple Squads As $8M Main Competition Commences

September 2, 2025

Duration Of The Process And Important Nuances

September 2, 2025

PrimeXBT Launches “Empowering Traders To Succeed” Campaign, Leading A New Era Of Trading

September 2, 2025

Korean sleeves cut Tesla and pivot with encryption stocks.

September 2, 2025

Crypto Flexs is a Professional Cryptocurrency News Platform. Here we will provide you only interesting content, which you will like very much. We’re dedicated to providing you the best of Cryptocurrency. We hope you enjoy our Cryptocurrency News as much as we enjoy offering them to you.

Contact Us : Partner(@)Cryptoflexs.com

Top Insights

TOKEN2049 Singapore stops all records with the world’s largest Web3 event with 25,000 attendees in unprecedented demand.

September 3, 2025

Simultaneously Mine Dogecoin (DOGE), Ripple (XRP), And SOL

September 3, 2025

Simultaneously Mine Dogecoin (DOGE), Ripple (XRP), And SOL

September 3, 2025
Most Popular

NEAR Protocol surged 54% in one week, and analysts predict further gains in 2024.

December 21, 2023

Solana’s $160 Resistance Rejected – Will SOL Fall to $120?

May 10, 2024

Hyra Network Wins Prestigious Chairman’s Award 2025 At WITSA Global AI Summit

August 21, 2025
  • Home
  • About Us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms and Conditions
© 2025 Crypto Flexs

Type above and press Enter to search. Press Esc to cancel.