sunxuening

🎯

Focusing

sunxuening sunxuening

🎯

Focusing

5 followers · 5 following

wow
China Beijing
sunxuening

Stars

Zeyi-Lin / LLM-Finetune

大语言模型微调，Qwen2VL、Qwen2、GLM4指令微调

Jupyter Notebook 385 57 Updated Jan 25, 2025

HIT-SCIR-SC / QiaoBan

Python 217 21 Updated Jan 31, 2024

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 33,210 2,660 Updated May 9, 2025

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 7,584 544 Updated Jan 3, 2025

zylon-ai / private-gpt

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 55,776 7,472 Updated Nov 13, 2024

CLUEbenchmark / CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,125 546 Updated May 23, 2024

IDEA-CCNL / Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

Python 4,120 382 Updated Aug 13, 2024

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 8,146 768 Updated Oct 16, 2024

baidu / lac

百度NLP：分词，词性标注，命名实体识别，词重要性

C++ 3,931 596 Updated May 25, 2021

425776024 / nlpcda

一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda

Python 1,829 169 Updated Mar 18, 2025

bojone / bert4keras

keras implement of transformers for humans

Python 5,405 928 Updated Nov 11, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,212 2,620 Updated Mar 4, 2025

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 144,155 28,897 Updated May 11, 2025

graphql-python / graphene-sqlalchemy

Graphene SQLAlchemy integration

Python 987 227 Updated Apr 7, 2025

privateEye-zzy / Nonlinear_function_fitting

最小二乘法—多项式拟合非线性函数

Python 58 33 Updated Sep 10, 2018

zergtant / pytorch-handbook

pytorch handbook是一本开源的书籍，目标是帮助那些希望和使用PyTorch进行深度学习开发和研究的朋友快速入门，其中包含的Pytorch教程全部通过测试保证可以成功运行

Jupyter Notebook 20,909 5,432 Updated Jul 25, 2024

kayalshri / tableExport.jquery.plugin

"table2JSON", "table2XML", "table2PNG","table2CSV","table2Excel","table2Word","table2Powerpoint","table2txt","table2PDF"

JavaScript 1,252 579 Updated Nov 28, 2021

studyzy / imewlconverter

”深蓝词库转换“ 一款开源免费的输入法词库转换程序

C# 8,563 653 Updated Dec 23, 2024

zhangsonglei / ChineseSpellingCheck

中文拼写检查工具，用于对中文文本中的错误用语进行检测并给出纠正建议

Java 36 14 Updated Jan 7, 2018

hankcs / text-classification-svm

The missing SVM-based text classification module implementing HanLP's interface

Java 47 25 Updated Dec 28, 2017

vi3k6i5 / flashtext

Extract Keywords from sentence or Replace keywords in sentences.

Python 5,648 603 Updated Apr 13, 2025

sunxuening / elasticsearch-analysis-hanlp

Forked from KennFalcon/elasticsearch-analysis-hanlp

HanLP Analyzer for Elasticsearch

Java 1 Updated Dec 28, 2018

KennFalcon / elasticsearch-analysis-hanlp

HanLP Analyzer for Elasticsearch

Java 842 227 Updated Jul 12, 2024

fs302 / Algorithms

for algorithm implementation and testing.

Jupyter Notebook 23 27 Updated Aug 27, 2020

jimichan / mynlp

一个生产级、高性能、模块化、可扩展的中文NLP工具包。（中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典）

Java 681 90 Updated Dec 21, 2023

spring-cloud-samples / spring-cloud-contract-samples

Samples for Spring Cloud Contract project

Java 388 317 Updated Apr 7, 2025

spring-cloud-samples / eureka

Java 511 436 Updated Feb 24, 2025

jimichan / fastText4j

Implementing Facebook's FastText with java

158 23 Updated Apr 23, 2020

k88hudson / git-flight-rules

Flight rules for git

42,071 3,198 Updated Apr 11, 2025

nguyenq / tess4j

Java JNA wrapper for Tesseract OCR API

Java 1,672 381 Updated Feb 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly