WD Tagger with Region Detection

Overview

This project is an image tagging tool that enhances tag accuracy by using region detection techniques. It's designed to improve tagging for machine learning models, based on Waifu Diffusion (WD) models, by addressing limitations in latent space resolution.

Key Features

Support for multiple pre-trained models:
- ViT
- SwinV2
- ConvNeXT
- EVA02 Large
Flexible region detection with YOLO models
Customizable tag thresholds
Ability to add or remove tags for specific regions
Batch processing
Optional text file output for tags
Solves the problem with wrong tags caused by detection (for example tags close-up, portrait when using face detection)

Installation

git clone https://github.com/MindB1ast/wdv3-timm.git
cd wdv3-timm

Install Dependencies

pip install -r requirements.txt

Usage

Usage example in BatchWork.ipynb

Basic Example

from Scripts import ScriptOptions, BatchTagging

params = ScriptOptions(
    ImageFolder="./TestPic/",
    model='big',  # Options: convnext, swinv2, big, vit
    gen_threshold=0.35,  # Confidence for general tags (confidence when tagging full picture, for threshold when tagging detected area change config)
    char_threshold=0.75,  
    batch=2, 
    recursive=False, 
    save_txt=True,   
    append_txt=True  
)

result = BatchTagging(params)

Configuration Options

ScriptOptions Parameters

ImageFolder: Path to the directory containing images
model: Tagging model to use (vit, swinv2, convnext, big)
gen_threshold: Confidence threshold for general tags (default: 0.35)
char_threshold: Confidence threshold for character tags (default: 0.75)
batch: Number of images to process simultaneously
recursive: Process images in subdirectories
save_txt: Save tags to text files
append_txt: Append tags to existing text files

Custom Models and Detectors

Place yours Yolo models in the models/ directory
Configure detectors in detectors.json

Detector Configuration Example

[
  {
    "name": "person_detector", 
    "model_path": "person_yolov8s-seg.pt",
    "confidence": 0.35,
    "classes": [0],
    "remove_tags_from_full": ["tag1", "tag2"],
    "remove_tags_from_region": [],
    "add_tags_to_region": {},
    "exclude_from_region": [],
    "region_gen_threshold": 0.25,
    "region_char_threshold": 0.8
  }
]

Advanced Usage

Visualization

You can use the view_image_results() function to visualize detection results:

from Scripts.visualization import view_image_results

view_image_results(result, image_index=0, visualize=True)

Сводка метрик (в %) для 12 изображений

Метод	Precision	Recall	F1-score
Объединенные теги(c yolo)	72.72	71.75	70.09
Полное изображение(без yolo)	76.23	60.31	62.89

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Scripts		Scripts
TestTagPic		TestTagPic
.gitignore		.gitignore
Analitycs.ipynb		Analitycs.ipynb
ManagingTags.ipynb		ManagingTags.ipynb
README.md		README.md
detectors-Copy1.json		detectors-Copy1.json
detectors.json		detectors.json
requirements.txt		requirements.txt
updated_results.json		updated_results.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WD Tagger with Region Detection

Overview

Key Features

Installation

Install Dependencies

Usage

Basic Example

Configuration Options

ScriptOptions Parameters

Custom Models and Detectors

Detector Configuration Example

Advanced Usage

Visualization

Contributing

About

Uh oh!

Releases

Packages

Languages

MindB1ast/wdv3-timm

Folders and files

Latest commit

History

Repository files navigation

WD Tagger with Region Detection

Overview

Key Features

Installation

Install Dependencies

Usage

Basic Example

Configuration Options

ScriptOptions Parameters

Custom Models and Detectors

Detector Configuration Example

Advanced Usage

Visualization

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages