课程: Advanced LLMs with Retrieval Augmented Generation (RAG): Practical Projects for AI Applications

免费学习该课程!

今天就开通帐号,24,700 门业界名师课程任您挑!

Hands-on lab: Multimodal PDF retrieval

Hands-on lab: Multimodal PDF retrieval

百度 穆迪则表示,中国企业的评级不会受贸易争端影响。

- To see the example of how to index and search image based document, we will go over this notebook, the multimodal PDF. We have the same visual improvements, as always. Now, let's load some PDF. We have here a PDF for a manual for a device that I purchased. So remember, we're not going to use the text aspect of this PDF because we are ready. We might miss a lot of the visual cues that are in the image. Therefore we'll just take the PDF and convert it into image. We'll do that for all the PDFs in this directory. I downloaded all the manuals of this device. We see that we have a few images here, all of from this user guide. Let's see a quick view of how it looks. So these are the pages. As you can guess, there are some titles and images and explanation. Each one of the buttons and options in this device. It's a simple example, but it represent nicely the capabilities of the image models. So now let's say "download the image model", the "ColPali" model. We are loading it is usually from…

内容