8000 sunxuening (sunxuening) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View sunxuening's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report sunxuening

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调

Jupyter Notebook 385 57 Updated Jan 25, 2025
Python 217 21 Updated Jan 31, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 33,210 2,660 Updated May 9, 2025

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 7,584 544 Updated Jan 3, 2025

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 55,776 7,472 Updated Nov 13, 2024

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,125 546 Updated May 23, 2024

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,120 382 Updated Aug 13, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,146 768 Updated Oct 16, 2024

百度NLP:分词,词性标注,命名实体识别,词重要性

C++ 3,931 596 Updated May 25, 2021

一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda

Python 1,829 169 Updated Mar 18, 2025

keras implement of transformers for humans

Python 5,405 928 Updated Nov 11, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,212 2,620 Updated Mar 4, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,155 28,897 Updated May 11, 2025

Graphene SQLAlchemy integration

Python 987 227 Updated Apr 7, 2025

最小二乘法—多项式拟合非线性函数

Python 58 33 Updated Sep 10, 2018

pytorch handbook是一本开源的书籍,目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门,其中包含的Pytorch教程全部通过测试保证可以成功运行

Jupyter Notebook 20,909 5,432 Updated Jul 25, 2024

"table2JSON", "table2XML", "table2PNG","table2CSV","table2Excel","table2Word","table2Powerpoint","table2txt","table2PDF"

JavaScript 1,252 579 Updated Nov 28, 2021

”深蓝词库转换“ 一款开源免费的输入法词库转换程序

C# 8,563 653 Updated Dec 23, 2024

中文拼写检查工具,用于对中文文本中的错误用语进行检测并给出纠正建议

Java 36 14 Updated Jan 7, 2018

hankcs / text-classification-svm

The missing SVM-based text classification module implementing HanLP's interface

Java 47 25 Updated Dec 28, 2017

Extract Keywords from sentence or Replace keywords in sentences.

Python 5,648 603 Updated Apr 13, 2025

HanLP Analyzer for Elasticsearch

Java 1 Updated Dec 28, 2018

HanLP Analyzer for Elasticsearch

Java 842 227 Updated Jul 12, 2024

for algorithm implementation and testing.

Jupyter Notebook 23 27 Updated Aug 27, 2020

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)

Java 681 90 Updated Dec 21, 2023

Samples for Spring Cloud Contract project

Java 388 317 Updated Apr 7, 2025

Implementing Facebook's FastText with java

158 23 Updated Apr 23, 2020

Flight rules for git

42,071 3,198 Updated Apr 11, 2025

Java JNA wrapper for Tesseract OCR API

Java 1,672 381 Updated Feb 15, 2025
Next
0