8000 ZhijunLStudio · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View ZhijunLStudio's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report ZhijunLStudio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ZhijunLStudio/README.md

Hi there 👋

This is Zhijun Li!

CSDN Gmail

Welcome to my Github page! I'm passionate about programming and technology, focusing on the artificial intelligence domain. I love turning ideas into code and sharing knowledge with the community.

AI Research

🔭 Current Work:

  • Multimodal Image Understanding
  • Text-to-Image Generation Models
  • Text-to-Video Generation Systems

🌱 Previous Experience:

  • Object Detection and Segmentation
  • Video Understanding Algorithms
  • Computer Vision Applications

💪 Things I am challenging myself with:

  • Contributing to open-source AI projects
  • Sharing knowledge through technical blogs
  • Exploring emerging multimodal architectures

💻 Programming languages and tools:



Pinned Loading

  1. YOLOv10-TensorRT-CUDA YOLOv10-TensorRT-CUDA Public

    YOLOV10+CUDA+TENSORRT

    C++ 3

  2. yolov7_tensorrt_opencv_queue yolov7_tensorrt_opencv_queue Public

    Use onnx to export tensorrt model to achieve high-performance deployment

    Python 3

  3. DAMO-YOLO-ONNX DAMO-YOLO-ONNX Public

    DAMO-YOLO uses onnxruntime inference deployment

    Python 4 1

  4. PaddleMIX PaddleMIX Public

    Forked from PaddlePaddle/PaddleMIX

    Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

    Python 1

  5. MultiModal-Evaluator MultiModal-Evaluator Public

    MultiModal-Evaluator 是一个用于评估视觉-语言模型(VLM)在图像理解任务上性能的工具。该工具通过异步处理图像样本,支持多种提示词输入,并使用语言模型进行自动评分,为模型性能比较提供客观数据支持。

    Python

  6. Rejection-Sampling-Evaluator Rejection-Sampling-Evaluator Public

    一个用于多模态模型输出优化的拒绝采样评估工具,专注于系统框图和电路图的网表提取任务。该工具通过生成多个候选结果并使用评分模型选择最佳结果,提高生成质量。

    Python

0