As artificial intelligence continues to progress at a lightning-fast pace, new technologies are making these powerful tools more accessible than ever before. NVIDIA, the company behind graphics technologies powering some of the world’s fastest supercomputers, just released a breakthrough conversational AI demo app – Chat with RTX. Chat with RTX allows users to run a personalized AI chatbot powered by an RTX GPU directly on their Windows PC. This article provides an overview of Chat with RTX and step-by-step instructions on how to use the app to connect your own data to a large language model through this app for fast, generative AI capabilities on your local device.
Table of Contents
Introducing NVIDIA Chat with RTX Demo App
NVIDIA Chat with RTX allows users to run their own personal AI chatbot on Windows PCs and workstations. It allows users to connect the data stored on their PC as a dataset to query an AI model, like Mistral, Llama 2, etc., to retrieve contextually relevant answers from this personalized knowledge base. By running entirely on a user’s local RTX-powered Windows PC, Chat with RTX provides generative AI capabilities without requiring an internet connection or sending private data to the cloud.
How Chat with RTX Works?
Under the hood, Chat with RTX leverages powerful techniques like retrieval augmented generation (RAG) to search your data and provide the most pertinent answers quickly. NVIDIA’s TensorRT optimization further accelerates the model for lightning-fast inferences directly on your RTX GPU’s tensor cores. This unique combination of local processing and custom data integration results in an AI assistant that understands you and your interests in an intensely personal way. By connecting your data to an LLM on your RTX-powered PC, you can experience the speed and efficiency of generative AI in action.
System Requirements
Chat with RTX has the following minimum system requirements:
- Windows 10 or 11
- GeForce RTX 30 series or higher GPU with at least 8GB of VRAM
- 16GB of system RAM
- Latest NVIDIA graphics drivers
How to Use NVIDIA Chat with RTX
Step 1: Download the Demo App
To get started, you’ll need to download the demo app. Head over to the official NVIDIA website to download the app.
Step 2: Unzip the App
Once the download is complete, locate the downloaded file and unzip it.
Step 3: Start the App
The Chat with RTX executable file will automatically launch and open a web interface running locally on your system. It starts up instantly. Upon opening, you’ll notice two essential elements at the top: the “AI model” on the left and the “dataset” on the right.
Step 4: Select an AI Model
Choose an AI model from the “AI Model” dropdown that suits your needs.
Step 5: Attach Files for Questioning
You can attach your files to the dataset and ask questions about them. To do this, navigate to the dataset section and select the Folder Path option. Click on the “change folder” button to attach the desired folder containing your files. Supported file formats include .txt, .doc, and .pdf files.
Step 6: Ask Questions
With your files attached, you can now start asking questions in the chat window. The AI model you chose will extract information from your attached files to provide relevant, contextual responses. Rest assured, your documents remain safe and private on your PC throughout the entire process.
Step 7: Explore YouTube Videos
NVIDIA Chat with RTX goes beyond local files. You can also extract valuable insights from YouTube videos. Under the Dataset option, select “YouTube URL.” Copy the URL of the desired video and paste it into the box under YouTube URL. Click on the “Download” button, and the video transcription will be added. You can now ask questions about the video’s content. This personal AI assistant is best for analyzing long YouTube videos.
Develop Custom Chatbots with NVIDIA
Chat with RTX chatbot utilizes the open-source TensorRT-LLM RAG reference project available on GitHub. Developers can use this codebase to build their own personalized chatbot applications optimized for RTX GPU acceleration.
Limitations and Future Plans
While Chat with RTX is a promising early demonstration of locally accelerated generative AI capabilities, the initial release does have some limitations that NVIDIA is actively working to address. The application currently requires installation to the default directory for compatibility reasons. An identified issue causes the installation to fail if a custom location is selected. NVIDIA is developing a fix for this limitation to provide more flexibility in a future version. For now, users should use the default installation directory (“C:\Users<username>\AppData\Local\NVIDIA\ChatWithRTX”)
Experience the Power of NVIDIA Chat with RTX Today
With NVIDIA Chat with RTX AI App, the possibilities are endless. Create a personalized chatbot that harnesses the power of generative AI and connects your data seamlessly. This groundbreaking technology showcases the power of personalized generative AI, all directly on your local device. So, whether you’re seeking information from local files or analyzing YouTube videos, NVIDIA Chat with RTX empowers you to get the answers you need quickly and securely. Try NVIDIA Chat with RTX now!
| Also Read: