📊 Rapid Table

🌟 简介

RapidTable库是专门用来文档类图像的表格结构还原，表格结构模型均属于序列预测方法，结合RapidOCR，将给定图像中的表格转化对应的HTML格式。

slanet_plus是paddlex内置的SLANet升级版模型，准确率有大幅提升

unitable是来源unitable的transformer模型，精度最高，暂仅支持pytorch推理，支持gpu推理加速,训练权重来源于 OhMyTable项目

📅 最近动态

2025-06-22 update: 发布v2.x，适配rapidocr v3.x
2025-01-09 update: 发布v1.x，全新接口升级。
2024.12.30 update：支持Unitable模型的表格识别，使用pytorch框架
2024.11.24 update：支持gpu推理，适配 rapidOCR 单字识别匹配,支持逻辑坐标返回及可视化
2024.10.13 update：补充最新paddlex-SLANet-plus 模型(paddle2onnx原因暂不能支持onnx)

📸 效果展示

🖥️ 支持设备

通过ONNXRuntime推理引擎支持：

DirectML
昇腾NPU

具体使用方法：

安装（需要卸载其他onnxruntime）:

# DirectML
pip install onnxruntime-directml

# 昇腾NPU
pip install onnxruntime-cann

使用：

from rapidocr import RapidOCR

from rapid_table import ModelType, RapidTable, RapidTableInput

# DirectML
ocr_engine = RapidOCR(params={"EngineConfig.onnxruntime.use_dml": True})
input_args = RapidTableInput(
    model_type=ModelType.SLANETPLUS, engine_cfg={"use_dml": True}
)

# 昇腾NPU
ocr_engine = RapidOCR(params={"EngineConfig.onnxruntime.use_cann": True})

input_args = RapidTableInput(
    model_type=ModelType.SLANETPLUS,
    engine_cfg={"use_cann": True, "cann_ep_cfg.gpu_id": 1},
)

table_engine = RapidTable(input_args)

img_path = "https://raw.githubusercontent.com/RapidAI/RapidTable/refs/heads/main/tests/test_files/table.jpg"

ori_ocr_res = ocr_engine(img_path)
ocr_results = [ori_ocr_res.boxes, ori_ocr_res.txts, ori_ocr_res.scores]

results = table_engine(img_path, ocr_results=ocr_results)
results.vis(save_dir="outputs", save_name="vis")

🧩 模型列表

`model_type`	模型名称	推理框架	模型大小	推理耗时(单图 60KB)
`ppstructure_en`	`en_ppstructure_mobile_v2_SLANet.onnx`	onnxruntime	7.3M	0.15s
`ppstructure_zh`	`ch_ppstructure_mobile_v2_SLANet.onnx`	onnxruntime	7.4M	0.15s
`slanet_plus`	`slanet-plus.onnx`	onnxruntime	6.8M	0.15s
`unitable`	`unitable(encoder.pth,decoder.pth)`	pytorch	500M	cpu(6s) gpu-4090(1.5s)

模型来源
PaddleOCR 表格识别
 PaddleX-SlaNetPlus 表格识别
 Unitable

模型下载地址：link

🛠️ 安装

版本依赖关系如下：

`rapid_table`	OCR
v0.x	`rapidocr_onnxruntime`
v1.0.x	`rapidocr>=2.0.0,<3.0.0`
v2.x	`rapidocr>=3.0.0`

由于模型较小，预先将slanet-plus表格识别模型(slanet-plus.onnx)打包进了whl包内。其余模型在初始化RapidTable类时，会根据model_type来自动下载模型到安装包所在models目录下。当然也可以通过RapidTableInput(model_path='')来指定自己模型路径。注意仅限于我们现支持的model_type。

⚠️注意：rapid_table>=v1.0.0之后，不再将rapidocr依赖强制打包到rapid_table中。使用前，需要自行安装rapidocr包。

⚠️注意：rapid_table>=v0.1.0,<1.0.0之后，不再将rapidocr依赖强制打包到rapid_table中。使用前，需要自行安装rapidocr_onnxruntime包。

pip install rapidocr
pip install rapid_table

# 基于torch来推理unitable模型
pip install rapid_table[torch] # for unitable inference

# onnxruntime-gpu推理
pip uninstall onnxruntime
pip install onnxruntime-gpu # for onnx gpu inference

🚀 使用方式

🐍 Python脚本运行

⚠️注意：在rapid_table>=1.0.0之后，模型输入均采用dataclasses封装，简化和兼容参数传递。输入和输出定义如下：

ModelType支持已有的4个模型 (source)：

class ModelType(Enum):
    PPSTRUCTURE_EN = "ppstructure_en"
    PPSTRUCTURE_ZH = "ppstructure_zh"
    SLANETPLUS = "slanet_plus"
    UNITABLE = "unitable"

CPU使用

from rapidocr import RapidOCR

from rapid_table import ModelType, RapidTable, RapidTableInput

ocr_engine = RapidOCR()

input_args = RapidTableInput(model_type=ModelType.UNITABLE)
table_engine = RapidTable(input_args)

img_path = "https://raw.githubusercontent.com/RapidAI/RapidTable/refs/heads/main/tests/test_files/table.jpg"

# # 使用单字识别
# ori_ocr_res = ocr_engine(img_path, return_word_box=True)
# ocr_results = [
#     [word_result[0][2], word_result[0][0], word_result[0][1]]
#     for word_result in ori_ocr_res.word_results
# ]
# ocr_results = list(zip(*ocr_results))

ori_ocr_res = ocr_engine(img_path)
ocr_results = [ori_ocr_res.boxes, ori_ocr_res.txts, ori_ocr_res.scores]
results = table_engine(img_path, ocr_results=ocr_results)
results.vis(save_dir="outputs", save_name="vis")

GPU使用

from rapidocr import RapidOCR

from rapid_table import ModelType, RapidTable, RapidTableInput

ocr_engine = RapidOCR()

# onnxruntime-gpu
input_args = RapidTableInput(
    model_type=ModelType.SLANETPLUS, engine_cfg={"use_cuda": True, "gpu_id": 1}
)

# torch gpu
# input_args = RapidTableInput(
#     model_type=ModelType.UNITABLE,
#     engine_cfg={"use_cuda": True, "cuda_ep_cfg.gpu_id": 1},
# )
table_engine = RapidTable(input_args)

img_path = "https://raw.githubusercontent.com/RapidAI/RapidTable/refs/heads/main/tests/test_files/table.jpg"

ori_ocr_res = ocr_engine(img_path)
ocr_results = [ori_ocr_res.boxes, ori_ocr_res.txts, ori_ocr_res.scores]

results = table_engine(img_path, ocr_results=ocr_results)
results.vis(save_dir="outputs", save_name="vis")

📦 终端运行

rapid_table test_images/table.jpg -v

📝 结果

📎 返回结果

<html>
<body>
<table>
    <tr>
        <td>Methods</td>
        <td></td>
        <td></td>
        <td></td>
        <td>FPS</td>
    </tr>
    <tr>
        <td>SegLink [26]</td>
        <td>70.0</td>
        <td>86d>
            <td.0
        </td>
        <td>77.0</td>
        <td>8.9</td>
    </tr>
    <tr>
        <td>PixelLink [4]</td>
        <td>73.2</td>
        <td>83.0</td>
        <td>77.8</td>
        <td></td>
    </tr>
    <tr>
        <td>TextSnake [18]</td>
        <td>73.9</td>
        <td>83.2</td>
        <td>78.3</td>
        <td>1.1</td>
    </tr>
    <tr>
        <td>TextField [37]</td>
        <td>75.9</td>
        <td>87.4</td>
        <td>81.3</td>
        <td>5.2</td>
    </tr>
    <tr>
        <td>MSR[38]</td>
        <td>76.7</td>
        <td>87.87.4</td>
        <td>81.7</td>
        <td></td>
    </tr>
    <tr>
        <td>FTSN [3]</td>
        <td>77.1</td>
        <td>87.6</td>
        <td>82.0</td>
        <td></td>
    </tr>
    <tr>
        <td>LSE[30]</td>
        <td>81.7</td>
        <td>84.2</td>
        <td>82.9</td>
        <>
        <ttd></td>
    </tr>
    <tr>
        <td>CRAFT [2]</td>
        <td>78.2</td>
        <td>88.2</td>
        <td>82.9</td>
        <td>8.6</td>
    </tr>
    <tr>
        <td>MCN[16]</td>
        <td>79</td>
        <td>88</td>
        <td>83</td>
        <td></td>
    </tr>
    <tr>
        <td>ATRR</
        >[35]</td>
        <td>82.1</td>
        <td>85.2</td>
        <td>83.6</td>
        <td></td>
    </tr>
    <tr>
        <td>PAN [34]</td>
        <td>83.8</td>
        <td>84.4</td>
        <td>84.1</td>
        <td>30.2</td>
    </tr>
    <tr>
        <td>DB[12]</td>
        <td>79.2</t91/d>
        <td>91.5</td>
        <td>84.9</td>
        <td>32.0</td>
    </tr>
    <tr>
        <td>DRRG[41]</td>
        <td>82.30</td>
        <td>88.05</td>
        <td>85.08</td>
        <td></td>
    </tr>
    <tr>
        <td>Ours (SynText)</td>
        <td>80.68</td>
        <td>85
            <t..40
        </td>
        <td>82.97</td>
        <td>12.68</td>
    </tr>
    <tr>
        <td>Ours (MLT-17)</td>
        <td>84.54</td>
        <td>86.62</td>
        <td>85.57</td>
        <td>12.31</td>
    </tr>
</table>
</body>
</html>

🖼️ 可视化结果

<>

Methods				FPS
SegLink [26]	70.0	86d>	77.0	8.9
PixelLink [4]	73.2	83.0	77.8
TextSnake [18]	73.9	83.2	78.3	1.1
TextField [37]	75.9	87.4	81.3	5.2
MSR[38]	76.7	87.87.4	81.7
FTSN [3]	77.1	87.6	82.0
LSE[30]	81.7	84.2	82.9
CRAFT [2]	78.2	88.2	82.9	8.6
MCN[16]	79	88	83
ATRR[35]	82.1	85.2	83.6
PAN [34]	83.8	84.4	84.1	30.2
DB[12]	79.2	91.5	84.9	32.0
DRRG[41]	82.30	88.05	85.08
Ours (SynText)	80.68	85	82.97	12.68
Ours (MLT-17)	84.54	86.62	85.57	12.31

🔄 与TableStructureRec关系

TableStructureRec库是一个表格识别算法的集合库，当前有wired_table_rec有线表格识别算法和lineless_table_rec无线表格识别算法的推理包。

RapidTable是整理自PP-Structure中表格识别部分而来。由于PP-Structure较早，这个库命名就成了rapid_table。

总之，RapidTable和TabelStructureRec都是表格识别的仓库。大家可以都试试，哪个好用用哪个。由于每个算法都不太同，暂时不打算做统一处理。

关于表格识别算法的比较，可参见TableStructureRec测评

📌 更新日志 (more)

2024.12.30 update

支持Unitable模型的表格识别，使用pytorch框架

2024.11.24 update

支持gpu推理，适配 rapidOCR 单字识别匹配,支持逻辑坐标返回及可视化

2024.10.13 update

补充最新paddlex-SLANet-plus 模型(paddle2onnx原因暂不能支持onnx)

2023-12-29 v0.1.3 update

优化可视化结果部分

2023-12-27 v0.1.2 update

添加返回cell坐标框参数
完善可视化函数

2023-07-17 v0.1.0 update

将rapidocr_onnxruntime部分从rapid_table中解耦合出来，给出选项是否依赖，更加灵活。
增加接口输入参数ocr_result：
- 如果在调用函数时，事先指定了ocr_result参数值，则不会再走OCR。其中ocr_result格式需要和rapidocr_onnxruntime返回值一致。
- 如果未指定ocr_result参数值，但是事先安装了rapidocr_onnxruntime库，则会自动调用该库，进行识别。
- 如果ocr_result未指定，且rapidocr_onnxruntime未安装，则会报错。必须满足两个条件中一个。

2023-07-10 v0.0.13 updata

更改传入表格还原中OCR的实例接口，可以传入其他OCR实例，前提要与rapidocr_onnxruntime接口一致

2023-07-06 v0.0.12 update

去掉返回表格的html字符串中的<thead></thead><tbody></tbody>元素，便于后续统一。
采用Black工具优化代码

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.github		.github
docs		docs
rapid_table		rapid_table
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

📊 Rapid Table

🌟 简介

📅 最近动态

📸 效果展示

🖥️ 支持设备

🧩 模型列表

🛠️ 安装

🚀 使用方式

🐍 Python脚本运行

CPU使用

GPU使用

📦 终端运行

📝 结果

📎 返回结果

🖼️ 可视化结果

🔄 与TableStructureRec关系

📌 更新日志 (more)

2024.12.30 update

2024.11.24 update

2024.10.13 update

2023-12-29 v0.1.3 update

2023-12-27 v0.1.2 update

2023-07-17 v0.1.0 update

2023-07-10 v0.0.13 updata

2023-07-06 v0.0.12 update

About

Uh oh!

Releases 8

Sponsor this project

Uh oh!

Contributors 4

Languages

Uh oh!

License

RapidAI/RapidTable

Folders and files

Latest commit

History

Repository files navigation

📊 Rapid Table

🌟 简介

📅 最近动态

📸 效果展示

🖥️ 支持设备

🧩 模型列表

🛠️ 安装

🚀 使用方式

🐍 Python脚本运行

CPU使用

GPU使用

📦 终端运行

📝 结果

📎 返回结果

🖼️ 可视化结果

🔄 与TableStructureRec关系

📌 更新日志 (more)

2024.12.30 update

2024.11.24 update

2024.10.13 update

2023-12-29 v0.1.3 update

2023-12-27 v0.1.2 update

2023-07-17 v0.1.0 update

2023-07-10 v0.0.13 updata

2023-07-06 v0.0.12 update

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Sponsor this project

Uh oh!

Contributors 4

Languages