Embodied Image Quality Assessment for Robotic Intelligence
The first embodied image quality assessment (EIQA) benchmark

Jianbo Zhang¹, Chunyi Li¹, Liang Yuan¹^, Guoquan Zheng², Jie Hao², Guangtao Zhai¹^,

¹Shanghai Jiaotong University, ²Beijing University of Chemical Technology. ^*Corresponding author.

Why we do this? Image quality assessment (IQA) of user-generated content (UGC) is a critical technique for human quality of experience (QoE). However, for robot-generated content (RGC), will its image quality be consistent with the Moravec paradox and counter to human common sense? Human subjective scoring is more based on the attractiveness of the image. Embodied agent are required to interact and perceive in the environment, and finally perform specific tasks. Visual images as inputs directly influence downstream tasks. In this paper, we first propose an embodied image quality assessment (EIQA) frameworks.

Release

[2024/12/26] 🔥 Github repo for EPD-Bench is online.
[To Do] [ ] Expand richer data for datasets.

EPD-Bench Construction

In contrast to traditional IQA image collection methods, embodied AI requires interaction with the surrounding environment. The ultimate goal is robot-oriented image quality assessment, and thus, image collection is also done by the robot itself. Two classical reinforcement learning algorithms, the Proximal Policy Optimization (PPO) and the Soft Actor Critic (SAC), and a state-of-the-art method, TDMPC2, are used to perform the 2 tasks in the SAPIEN simulator, respectively. A monocular camera is used to capture RGB images as sensor data input to the model.

Based on a simulated environment ManiSkill, a robotic arm acts as an embodied intelligence to perform simple push and pick tasks. For the robot, different quality of image inputs have different impacts on the robot to complete the task, which is also directly related to the performance of the robot.

HVS & RVS are different

is a gap between the robot vision system and the human vision system, and the current image quality assessment from the human perspective is limited.

Evaluation

Comparison of 14 IQA methods for BL (Baseline), FR (Full Reference), NR (No reference) respectively on EPD benchmarks. For detail on differnet content types, please check our paper.

Metric	SRCC↑	PLCC↑	KRCC↑	PUSH_SRCC↑	PUSH_PLCC↑	PUSH_KRCC↑	PICK_SRCC↑	PICK_PLCC↑	PICK_KRCC↑
PSNR	0.1233	0.1356	0.0819	0.0811	0.0927	0.0539	0.0972	0.1138	0.0645
SSIM	0.0597	0.0635	0.0396	0.0633	0.0716	0.0417	0.0228	0.0275	0.0152
PieAPP	0.3616	0.3853	0.2466	0.1727	0.2023	0.1165	0.1604	0.1802	0.1061
CKDN	0.6971	0.6654	0.5062	0.2100	0.2186	0.1404	0.3372	0.3040	0.2266
IQT	0.5435	0.5416	0.3814	0.3918	0.3706	0.2650	0.5613	0.5664	0.3920
AHIQ	0.4199	0.4382	0.2888	0.2425	0.2707	0.1674	0.3061	0.3157	0.2074
DISTS	0.2113	0.2107	0.1428	0.1335	0.1608	0.0906	0.1135	0.1036	0.0985
TOPIQ-FR	0.1265	0.1232	0.0845	0.1615	0.1684	0.1113	0.1207	0.1185	0.0794
HyperIQA	0.3212	0.3289	0.2212	0.3099	0.2981	0.2098	0.4055	0.3927	0.2733
DBCNN	0.1921	0.2133	0.1307	0.0962	0.1039	0.0643	0.1882	0.1855	0.1251
MANIQA	0.5267	0.5603	0.3675	0.2475	0.2574	0.1661	0.5847	0.5859	0.4116
CLIPIQA	0.1821	0.2160	0.1218	0.0750	0.0893	0.0503	0.1464	0.1497	0.0978
TempQT	0.2340	0.1730	0.1450	0.1040	0.0980	0.0560	0.2210	0.1400	0.1500
TOPIQ-NR	0.1253	0.1193	0.0828	0.0995	0.1043	0.0667	0.0846	0.0602	0.0568

PUSH means push box subset, and PICK means pick box subset.

Contact

Please contact any of the first authors of this paper for queries.

Jianbo Zhang, sjtu5029101@sjtu.edu.cn@sjtu.edu.cn

Citation

If you find our work interesting, please feel free to cite our paper:

@misc{zhang2024embodiedimagequalityassessment,
      title={Embodied Image Quality Assessment for Robotic Intelligence}, 
      author={Jianbo Zhang and Chunyi Li and Liang Yuan and Guoquan Zheng and Jie Hao and Guangtao Zhai},
      year={2024},
      eprint={2412.18774},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2412.18774}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
datasets		datasets
figure		figure
tools		tools
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Release

EPD-Bench Construction

HVS & RVS are different

Evaluation

Contact

Citation

About

Releases

Packages

Languages

License

Jianbo-maker/EPD_benchmark

Folders and files

Latest commit

History

Repository files navigation

Release

EPD-Bench Construction

HVS & RVS are different

Evaluation

Contact

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages