VLM Parsing Example Bento

This is a BentoML service that demonstrates how to parse image using multi-modal LLM and extract useful information from them.

The work is based on @PsiACE's blog post.

Start the development server

This project is managed by PDM, install it first.

Install dependencies:

pdm install

Create .env file with credentials:

cp .env.example .env
# Complete the OPENAI_API_KEY in the .env file

Start the development server:

pdm dev

Deploy to BentoCloud

Go to BentoCloud and get an account.
Login to BentoCloud:
```
pdm run bentoml cloud login
```
Click "Secret" in the sidebar and create an "OpenAI" secret with your API key.
Deploy the service to BentoCloud:
```
pdm deploy --secret <your-secret-name>
```

License

This work is released under Unlicense. You can use it for any purpose without any restriction.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.env.template		.env.template
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
image.png		image.png
pdm.lock		pdm.lock
pyproject.toml		pyproject.toml
service.py		service.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VLM Parsing Example Bento

Start the development server

Deploy to BentoCloud

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

frostming/BentoVLM

Folders and files

Latest commit

History

Repository files navigation

VLM Parsing Example Bento

Start the development server

Deploy to BentoCloud

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages