8000 content-indexing · GitHub Topics · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
#

content-indexing

Here is 1 public repository matching this topic...

A Python tool that uses Google's Gemini AI to automatically extract structured metadata from PDF and DOCX documents, saving results to Excel for easy analysis and organizing raw responses as JSON files.

  • Updated Mar 21, 2025
  • Python

Improve this page

Add a description, image, and links to the content-indexing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the content-indexing topic, visit your repo's landing page and select "manage topics."

Learn more

0