- 15 May 2023
- DarkLight
- PDF
Model Assisted Annotations
- Updated On 15 May 2023
- DarkLight
- PDF
ML/AI assisted annotations helps accelerate the human touch, reducing human time, effort, and cognition invested by up to 90%!
With Dataloop, you can integrate existing models or run our pre-trained models to automatically pre-label complete datasets.
Automation pipelines are the most efficient time and cost saving element enabling massive data reduction via smart pre and post processing flows.
Annotation Studio Level Processing
By using one of Dataloop's pre-configured models, or your own custom model, annotators can activate a prediction function on an item according to need.
Automation models are available in the Image and Video annotation studios only
Pre-Configured Models
When enabled from a task recipe, an annotator can select the model/function to run on the item currently active in the annotation studio.
To activate a pre-configured annotation model:
- Make sure that automations are enabled from recipe instructions
- Enter an annotation studio
- From the left-side tool-selection panel, select the Automation tool
- Select the model to run from the dropdown below
Run Your Automation Model
By installing your own model and deploying it to the respective UI-Slot, your model can be used by annotators in the same way as Dataloops' default models.
For more information, follow this tutorial to add your model as a FaaS package, and deploy it to a UI slot
Filtering Model Annotations By Confidence
When working with automated annotations in the annotation studio, Dataloop provides a filter that allows filtering annotations by their confidence score. The filter works with a minimum and maximum range, so you can filter out the lowest score annotations and focus on the highest ones, or filter out the high score ones to be able to select and delete all the lower-score annotations.
- In the annotation studio's right-side annotations panel, locate the Group-Mode button and switch it to "By Creator" - Annotations will be grouped by its creator and not by label, hence group by models.
- On every model's group. locate the filter dropdown to see the confidence filter range selection. Adjust the Min and Max values, and annotations will filter in/out of the studio.
Pre-Processing - Model Annotations
Dataloop enables you to preprocess all the data in advance as per your project needs (for example as part of Pipeline processes, or to simply avoid having annotators manually running it from the annotation studio).
Before you start:
- Install your model and run it as a service - follow this tutorial to learn how
- Deploy your model service to relevant UI slots - read here to learn how
Run Model On Data (Dataset Browser)
- Filter the items you intend to preprocess (read the Data Filtering tutorial)
- Ensure your model service is deployed to the Dataset-browser UI-Slot
- Select the Automation tool from the top actions bar, and select your model from the dropdown
Manually Trigger Data To Model Service
- From the Dataset browser, select 1 or more items
- From the right-side panel, select the relevant Function and click on EXECUTE
Preprocess With Data Pipelines
For more information about working with Pipelines, read the respective documentation section.
- From the left navigation menu, select the Pipelines menu item
- Start a new Pipeline by pressing on "CREATE PIPELINE", and either start a new empty one or select from an available template
- Drag & drop a FaaS node onto the pipeline canvas, and from node settings select your model function from the list, to et this node to run your model prediction function
- Send data to the Pipeline by triggering it or by connecting a dataset node before the FaaS one, and your items will go through the Pipeline and processed - model annotations will be saved on the item
- Finish your pipeline composition to achieve and other preprocessing steps and technical/business goals
Upload Model Annotations
Using Dataloops' SDK, you can upload annotations containing model information and annotation confidence score. The process is easy and straightforward, using the Builder for creating annotations in the Dataloop format.
Here's an example of creating and uploading a bounding-box annotation using the builder, with model information and confidence score:
builder.add( annotation_definition=dl.Box( top=startY, left=startX, right=endX, bottom=endY, label='person' ), model_info={ 'name': 'Caffe', 'confidence': confidence } )