How SDSC Processed 2.3M Surgical Video Frames in One Month with Lightly
Lightly enabled SDSC to process 2.3 million surgical video frames in one month and accelerate labeling by 10×, intelligently selecting only the most informative frames to train their YOLOv8 instrument detection model.
Margaux Masson-Forsythe
Director of Machine Learning
Overview
Lightly enabled SDSC to process 2.3 million surgical video frames in one month and accelerate labeling by 10×, intelligently selecting only the most informative frames to train their YOLOv8 instrument detection model.
Industry
Healthcare
Location
San Diego, CA, U.S
Employee
<100
Get Started with Lightly
Talk to Lightly’s computer vision team about your use case.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Improving YOLOv8 Instrument Detection Using Active Learning on Surgical Videos
Surgical Data Science Collective (SDSC) is a non-profit research organization committed to enhancing surgical outcomes by harnessing the potential of artificial intelligence. SDSC collaborates with surgeons to produce extensive metrics from surgical videos, offering them valuable insights to enhance their procedures. SDSC also assists surgeons in creating searchable libraries of their surgical videos, ensuring secure backups. By providing surgeons with their state-of-the-art technology, SDSC strives to enhance patient outcomes and advance the field of medicine.
Problem
Obtaining and working with high-quality data from surgical videos is a challenge. It is crucial for the success of SDSC projects to effectively curate this data and extract the necessary information for optimal training of the models. However, labeling every frame in hours-long videos with 30 frames per second is impossible. Moreover, this approach would result in a huge number of similar-looking images and an imbalance in the variety of surgical tools labeled. For instance, suction instruments are almost constantly present on the screen.
Testimonials
"Through this collaboration, SDSC and Lightly have combined their expertise to revolutionize the process of frame selection in surgical videos, making it more efficient and accurate than ever before to find the best subset of frames for labeling and model training."
Margaux Masson-Forsythe
Director of Machine Learning
Scalable and Efficient Data Curation using Lightly
SDSC has collaborated with Lightly to successfully implement an innovative and dynamic active machine learning solution. This cutting-edge solution is designed to intelligently and actively select the most optimal frames for labeling and retraining purposes. Through this collaboration, SDSC and Lightly have combined their expertise to revolutionize the process of frame selection in surgical videos, making it more efficient and accurate than ever before to find the best subset of frames for labeling and model training.
SDSC has implemented an active machine-automated learning pipeline using Lightly that:
Runs an inference on the selected videos with the selected pre-trained model (YOLOv8)
Creates all required Lightly files, organizes data for optimal processing in Lightly and uploads all data to an Amazon S3 bucket
Starts an EC2 instance (g4dn.2xlarge with 8 vCPUs, 32GB of system memory, one T4 GPU – image: Deep Learning AMI GPU PyTorch 1.13.1 (Amazon Linux 2)) configured to be a Lightly worker (configuration was done following this documentation)
Starts the Lightly worker and schedules the Lightly processing run in a Docker container on the EC2 instance
Waits for the Lightly process (active learning component) to be complete
Stops the EC2 instance
Sends images to Encord Annotate and creates an Annotation Project ready to be labeled and sent to labelers
Results
Using Lightly, SDSC was able to:
Process over 2.3 million frames within the first month of use
10x their labeling speed
Get Started with Lightly
Talk to Lightly’s computer vision team about your use case.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
No fluff—just results from teams using Lightly to move faster with better data and models.
"We had millions of images but no clear way to prioritize. Manual selection was slow and full of guesswork. With Lightly, we just feed in the data and get back what’s actually worth labeling."
Carlos Alvarez
Machine Learning Engineer
“Lightly enabled us to improve our ML data pipeline in all regards: Selection, Efficiency, and Functionality. This allowed us to cut customer onboarding time by 50% while achieving better model performance.”
Harishma Dayanidhi
Co-Founder/ VP of Engineering
“By integrating Lightly into our existing workflow, we achieved a 90% reduction in dataset size and doubled the efficiency of our deployment process. The tool’s seamless implementation significantly enhanced our data pipeline.”
Usman Khan
Sr. Data Scientist
“Lightly gave us transparency to a part of the ML development that is a black box, data. Furthermore, Lightly enabled us to do Active Learning at scale and helped us improve recall and F1-score of our object detector by 32% and 10% compared to our previous data selection method. We finally saw the light in our data using Lightly.”
Gonzalo Urquieta
Project Leader
"Lightly is hyper-focused on finding thousands of relevant images from millions of video frames to improve deep learning models. The Lightly platform enabled us to build models and deploy features more than 2x faster and unlock completely new development workflows."
Isura Ranatunga
Co-Founder and CTO
"I was truly amazed once we received the results of Lightly. We knew we had a lot of similar images due to our video feed but the results showed us how we can work more efficiently by selecting the right data"
Alejandro Garcia
CEO
Explore Lightly Products
Lightly One
Data Selection & Data Viewer
Get data insights and find the perfect selection strategy