8000 [Feature] Auto Labeling Implementation by Hironsan · Pull Request #1206 · doccano/doccano · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[Feature] Auto Labeling Implementation #1206

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 42 commits into from
Feb 19, 2021
Merged

[Feature] Auto Labeling Implementation #1206

merged 42 commits into from
Feb 19, 2021

Conversation

Hironsan
Copy link
Member
@Hironsan Hironsan commented Feb 19, 2021

This PR allows users to label text automatically. I think this enables users to speed up annotation.

How it works

This feature enables automatic labeling by calling the Web API from doccano. Therefore, you can use any commercial service (e.g. Google Natural Language API, Amazon Comprehend, Watson, etc.) or your own server for labeling, as long as you can call the API from doccano. Notice that there is no learning function for now. This is an issue for the future.

How to use

Configuration

  1. Select "Settings" on the side menu.

image

  1. Select "Auto Labeling" tab and press "Create" button.

image

  1. Select a configuration template. Some tasks have predefined templates to simplify the configuration.

image

  1. Enter the parameters required to use the API.

image

  1. Write a mapping template to extract labels from API responses. If you select the predefined template, you can skip this process.

image

  1. Map the label fetched from the API to a label defined by you.

image

Turn on the feature

  1. Go to the annotation page.
  2. Select the settings button.
  3. Turn on the feature.

Notice that if you don't have any configuration, you can't use this feature.

Future works

  • Assigning a configuration to each user
  • Allow the admin to set throttling for each user
  • Allow merging of responses from multiple APIs
  • Implementing the training feature
  • Increase the number of the predefined templates(https://github.com/doccano/auto-labeling-pipeline)

close #191

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Feature request: Add Auto-labeling
1 participant
0