Extract Frames¶

The Extract Frames tab allows you to select frames for labeling using various methods.

Upload videos and automatically extract random frames
Upload zipped files of frames
Automatically extract frames using a given model (active learning)

Upload videos and automatically extract random frames¶

Select the appropriate option from the list.

Drag and drop video file(s) using the provided interface. You will see an upload progress bar.

Choose number of frames to label per video - these frames will be automatically selected to maximize the diversity of poses from each video.

You can also select the portion of the video to extract frames from. If the beginning and/or end of your videos do not contain the animals or contain extra objects (e.g. experimenter hands) we recommend excluding these portions.

Click “Extract frames” once the video upload is complete, and another progress bar will appear.

Once all frames have been extracted you will see “Proceed to the next tab to label frames” in green.

Upload zipped files of frames¶

Select the appropiate option from the list.

Drag and drop zipped files(s) of frames using the provided interface. You will see an upload progress bar.

Warning

At the moment this feature of the app requires a strict file structure!

As an example, let’s say you have a video named subject023_session0.mp4 and you have extracted frames 143, 1156, and 34567, which you want to label.

You will need to create a single zip file named subject023_session0.zip. The zip file must contain png files, and they must follow the naming convention img%08.png, for example img00000143.png (such that there are 8 digits for the frame number, with leading zeros).

If you would like to fit context models, you must also include context frames for each labeled frame. Again using frame 143 as an example, you must include five files:

img00000141.png
img00000142.png
img00000143.png
img00000144.png
img00000145.png

Including context frames is recommended, though not required.

Finally, you must include a csv file named selected_frames.csv that is simply a list of the file names of the frames you wish to label (not the context frames), so that LabelStudio knows which frames to upload into its database. For the example above, the csv file should look like:

img00000143.png
img00001156.png
img00034567.png

Therefore, the final set of files that must be zipped into subject023_session0.zip for this example is:

img00000141.png
img00000142.png
img00000143.png
img00000144.png
img00000145.png
img00001154.png
img00001155.png
img00001156.png
img00001157.png
img00001158.png
img00034565.png
img00034566.png
img00034567.png
img00034568.png
img00034569.png
selected_frames.csv

If you would like to upload frames for multiple videos, make one zip file per video.

Click “Extract frames” once the zip file upload is complete.

Once all frames have been extracted you will see “Proceed to the next tab to label frames” in green.

Automatically extract frames using a given model (active learning)¶

This option allows you to choose frames to label that are “difficult” for a given model. Since there is no ground truth, frames are selected based on likelihood values and other metrics that are correlated with pixel error (large temporal jumps and PCA reprojection errors; see the original Lightning Pose paper for technical details).

Note

This option will not appear until at least one model has been trained.

First you will need to determine which videos you would like to extract frames from. Next, you will need to run inference on those videos with a given model in the “Train/Infer” tab; see the inference documentation.

After you have completed inference you can return to the “Extract Frames” tab and select the appropriate option from the list:

You will then be able to select which model you would like to use (which should be the same model you used to run inference in the “Train/Infer” tab). Once the model is selected you will see a list of all videos where inference has already been performed. Select one or more videos, and as before you may also enter the number of frames per video you would like to label, as well as use the slider to exclude frames from the beginning and/or end of the video.

Click “Extract frames”, and you will quickly see the green message informing you that your frames are ready for labeling.