In computer vision, a bounding box is a rectangular coordinate system used to localize and delineate an object within an image or video frame. It is typically defined by the coordinates of its top-left corner (x1, y1) and its bottom-right corner (x2, y2), or by its center coordinates, width, and height. Bounding boxes are a fundamental component in object detection tasks, where the goal of a model is not only to classify what objects are present in an image but also to accurately identify their precise locations. The prediction of accurate bounding boxes is a key metric for evaluating the performance of object detection models, forming the basis for metrics like Intersection Over Union (IOU), which measures the overlap between predicted and ground-truth bounding boxes.
Data Selection & Data Viewer
Get data insights and find the perfect selection strategy
Learn MoreSelf-Supervised Pretraining
Leverage self-supervised learning to pretrain models
Learn MoreSmart Data Capturing on Device
Find only the most valuable data directly on device
Learn More