- Tokyo, Japan
- https://www.linkedin.com/in/hironsan
- @Hironsan13
More
Stars
Embedded property graph database built for speed. Vector search and full-text search built in. Implements Cypher.
Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
Matplotlib styles for scientific plotting
Toolkit for linearizing PDFs for LLM datasets/training
This SDK is now deprecated, use the new unified Google GenAI SDK.
Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search
Neo4j graph construction from unstructured data using LLMs
OpenTelemetry Instrumentation for AI Observability
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
A course on aligning smol models.
Japanese translation of Open Source AI Definition
Task-Aware Agent-driven Prompt Optimization Framework
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be …
This lab is a 1-day/2-day end-to-end SLM workshop led and developed by AI GBB. Attendees will learn how to quickly and easily perform the data preparation-fine tuning-serving-LLMOps series of proce…
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Omnivore is a complete, open source read-it-later solution for people who like reading.
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Tools for merging pretrained large language models.
The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
Convert PDF to markdown + JSON quickly with high accuracy