LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation

This repository contains the code for the paper "LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation". We add instructions on how to run the experiments reported in the paper.

Abstract

Recent progress in large vision–language models has driven improvements in language-based semantic 8000 navigation, where an embodied agent must reach a target object described in natural language. Despite these advances, we still lack a clear, language-focused benchmark for testing how well such agents ground the words in their instructions. We address this gap with LangNav, an open-set dataset specifically created to test an agent’s ability to locate objects described at different levels of detail, from broad category names to fine attributes and object–object relations. Every description in LangNav was manually checked, yielding a lower error rate than existing lifelong- and semantic-navigation datasets. On top of LangNav we build LangNavBench, a benchmark that measures how well current semantic-navigation methods understand and act on these descriptions while moving toward their targets. LangNavBench allows to systematically compare models on their handling of attributes, spatial and relational cues, and category hierarchies, offering the first thorough, language-centred evaluation of embodied navigation systems. We also present Multi-Layered Feature Map (MLFM), a method that builds a queryable multi-layered semantic map, particularly effective when dealing with small objects or instructions involving spatial relations. MLFM outperforms state-of-the-art mapping-based navigation baselines on the LangNav dataset.

Code setup

1. Clone the repository

# https
git clone https://github.com/3dlg-hcvc/langmonmap.git
# or ssh
git clone git@github.com:3dlg-hcvc/langmonmap.git

2. Install dependencies

Create a conda environment and install Habitat-sim v0.2.5

# create conda and install habitat
conda create -n langnav python=3.9 cmake=3.14.0 habitat-sim=0.2.5 headless -c conda-forge  -c aihabitat
conda activate langnav

Install dependencies

cd langmonmap
python -m pip install -r requirements.txt

YOLOV7:

git clone https://github.com/WongKinYiu/yolov7

Build planning utilities:

python3 -m pip install ./planning_cpp/

3. Download the model weights

mkdir -p weights/

Download SED Clip model weights from OneMap repository and place it under weights/.

4. Download scenes data

Follow instructions for Habitat Synthetic Scenes Dataset (HSSD) and download from here. Link the scenes in ``datasets/scene_datasets/fphab/''.

Download LangNav dataset

Follow HuggingFace LangNav dataset to download the data splits. Place inside ``datasets/langnav''.

Running the code

1. Run evaluation

You can run the evaluation on the test split with:

python eval_mlfm.py --config config/lnav/mlfm_conf.yaml

The evaluation run will save out the results in the results/ directory. You can read the results with:

python read_results_mlfm.py --config config/lnav/mlfm_conf.yaml

Running experiments reported in the paper

You can find all the yaml files under ``config/lnav/paper'' for running the experiments reported in the paper.

Acknowledgements

Our repository is built on top of the open-sourced OneMap repo. We use assets from HSSD to build our dataset.

Citation

If you use this code in your research, please cite our paper:

@misc{langnavbenchraychaudhuri,
      title={LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation},
}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
config		config
eval		eval
lseg		lseg
mapping		mapping
onemap_utils		onemap_utils
planning		planning
planning_cpp		planning_cpp
spot_utils		spot_utils
vision_models		vision_models
.gitignore		.gitignore
README.md		README.md
eval_habitat_goat_bench.py		eval_habitat_goat_bench.py
eval_habitat_multi.py		eval_habitat_multi.py
eval_mlfm.py		eval_mlfm.py
eval_vlfm.py		eval_vlfm.py
read_results_goat_bench.py		read_results_goat_bench.py
read_results_mlfm.py		read_results_mlfm.py
read_results_multi.py		read_results_multi.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation

Abstract

Code setup

1. Clone the repository

2. Install dependencies

3. Download the model weights

4. Download scenes data

Download LangNav dataset

Running the code

1. Run evaluation

Running experiments reported in the paper

Acknowledgements

Citation

About

Releases

Packages

Languages

3dlg-hcvc/langmonmap

Folders and files

Latest commit

History

Repository files navigation

LangNavBench: Evaluation of Natural Language Understanding in Semantic Navigation

Abstract

Code setup

1. Clone the repository

2. Install dependencies

3. Download the model weights

4. Download scenes data

Download LangNav dataset

Running the code

1. Run evaluation

Running experiments reported in the paper

Acknowledgements

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages