8000 Feature: Multi-modal support, process image & document attachments, etc. · Issue #3 · coreylane/slackrock · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
Feature: Multi-modal support, process image & document attachments, etc. #3
Open
@coreylane

Description

@coreylane
  1. Only image data (png | jpeg | gif | webp) is currently supported, so files like PDFs, MS Office docs, etc. wouldn't be in scope.

  2. Claude 3 models are the only models that support "Vision" capabilities https://docs.aws.amazon.com/bedrock/latest/userguide/conversation-inference.html#conversation-inference-supported-models-features

Code example:

    message = {
        "role": "user",
        "content": [
            {
                "text": input_text
            },
            {
                    "image": {
                        "format": 'png',
                        "source": {
                            "bytes": image
                        }
                    }
            }
        ]
    }

    messages = [message]

    # Send the message.
    response = bedrock_client.converse(
        modelId=model_id,
        messages=messages
    )

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0