FEATURE: Versatile `Video` class #1924

Ashp116 · 2025-07-30T19:29:49Z

Description

This PR allows the process_video function to include the audio stream from the source video in the final annotated output. Previously, the function only rendered video frames and discarded the audio, resulting in silent output videos. This change ensures that the output video maintains both visual and audio components, addressing issue #1923.

This PR requires the imageio-ffmpeg dependency, which enables audio stream handling during video writing. You can find it here: imageio-ffmpeg on PyPI

Type of change

Bug fix (non-breaking change which fixes an issue)

How has this change been tested, please provide a testcase or example of how you tested the change?

Please refer to #1923

Any specific deployment considerations

Ensure that imageio-ffmpeg is installed in the environment.

Docs

Docs updated? What were the changes

…supervision into bug/process-video-audio

SkalskiP · 2025-07-31T15:32:54Z

Hi @Ashp116 👋🏻 Another great idea! Video processing is probably the oldest part of supervision, written over two years ago, and I’ve been wanting to update its API for a while. Would you be open to not only adding audio support but also helping me with the update?

Ashp116 · 2025-07-31T18:23:34Z

Hi @SkalskiP, yea, I would like to help update the API. I was thinking of changing how videos are written in process_video. The original compression is lost when annotations are added and the file is written to a target_path. But yea, I would like to help out with the update.

SkalskiP · 2025-08-01T09:54:27Z

Hi @Ashp116 I'm really glad you want to help me! Let's goooo! 🔥 🔥 🔥

I want the functionalities currently found in supervision.utils.video to be reorganized around a new Video class. Importantly, all features previously available in the old API must still be supported in the new one. Ideally, the new API should be more consistent and expressive.

get video info (works for files, RTSP, webcams)

import supervision as sv
 
# static video
sv.Video("source.mp4").info

# video stream
sv.Video("rtsp://...").info

# webcam
sv.Video(0).info

simple frame iteration (object is iterable)

import supervision as sv

video = sv.Video("source.mp4")
for frame in video:
    ...

advanced frame iteration (stride, sub-clip, on-the-fly resize)

import supervision as sv

for frame in sv.Video("source.mp4").frames(stride=5, start=100, end=500, resolution_wh=(1280, 720)):
    ...

process the video

import cv2
import supervision as sv

def blur(frame, i):
    return cv2.GaussianBlur(frame, (11, 11), 0)

sv.Video("source.mp4").save(
    "blurred.mp4",
    callback=blur,
    show_progress=True
)

overwrite target video parameters

import supervision as sv

sv.Video("source.mp4").save(
    "timelapse.mp4",
    fps=60,
    callback=lambda f, i: f,
    show_progress=True
)

complete manual control with explicit VideoInfo

from supervision import Video, VideoInfo

source = Video("source.mp4")
target_info = VideoInfo(width=800, height=800, fps=24)

with src.sink("square.mp4", info=target_info) as sink:
    for f in src.frames():
        f = cv2.resize(f, target_info.resolution_wh)
        sink.write(f)

multi-backend support decode/encode

import supervision as sv

video = sv.Video("source.mkv", backend="pyav")

video = sv.Video("source.mkv", backend="opencv")

suggested minimal protocol

class Backend(Protocol):
    def open(self, path: str) -> Any: ...
    def info(self, handle: Any) -> VideoInfo: ...

    def read(self, handle: Any) -> tuple[bool, np.ndarray]: ...
    def grab(self, handle: Any) -> bool: ...
    def seek(self, handle: Any, frame_idx: int) -> None: ...

    def writer(self, path: str, info: VideoInfo, codec: str) -> Writer: ...

class Writer(Protocol):
    def write(self, frame: np.ndarray) -> None: ...
    def close(self) -> None: ...

UPDATE: Added a new Video class with OpenCV writer and backend

…supervision into bug/process-video-audio

Ashp116 · 2025-08-02T06:52:16Z

Hi @SkalskiP,

I’ve addressed most of the features you mentioned, but I have some thoughts on a few aspects of the implementation:

.save Functionality
How would you handle .save for a video feed coming from a webcam or an RTSP stream? Currently, I have it where only video files can be saved.
Writer and Backend Classes
This is just my personal opinion, but should these classes be moved to separate scripts/modules? If we add more writers and backends in the future, keeping everything inside the main video script might become cluttered.
“Complete manual control with explicit VideoInfo” Functionality
```
from supervision import Video, VideoInfo

source = Video("source.mp4")
target_info = VideoInfo(width=800, height=800, fps=24)

with src.sink("square.mp4", info=target_info) as sink:
    for f in src.frames():
        f = cv2.resize(f, target_info.resolution_wh)
        sink.write(f)
```
I’m not fully clear on what this feature is intended to do. In this snippet, the Video instance source is created but never used afterward. Is src supposed to be source? Also, is the goal to create sinks for each backend? Could you please clarify the purpose and expected usage here?

SkalskiP · 2025-08-07T22:23:07Z

pyproject.toml

@@ -46,7 +46,8 @@ dependencies = [
    "pillow>=9.4",
    "requests>=2.26.0",
    "tqdm>=4.62.3",
-    "opencv-python>=4.5.5.64"
+    "opencv-python>=4.5.5.64",
+    "imageio-ffmpeg (>=0.6.0,<0.7.0)"


Let's not use imageio-ffmpeg and use PyAV instead.

When adding PyAV as dependency, make sure to make it optional. We don't want to require everyone to install it.

SkalskiP · 2025-08-07T22:26:51Z

supervision/utils/video.py

@@ -141,6 +673,7 @@ def _validate_and_setup_video(
    return video, start, end


+@DeprecationWarning


let's use deprecated from supervision.utils.internal. it is currently used in the codebase to mark several deprecations.

SkalskiP · 2025-08-07T22:26:54Z

supervision/utils/video.py

@@ -192,6 +725,7 @@ def get_video_frames_generator(
    video.release()


+@DeprecationWarning


let's use deprecated from supervision.utils.internal. it is currently used in the codebase to mark several deprecations.

SkalskiP · 2025-08-07T22:27:01Z

supervision/utils/video.py

@@ -117,6 +648,7 @@ def __exit__(self, exc_type, exc_value, exc_traceback):
        self.__writer.release()


+@DeprecationWarning


No need to mark private classes / functions as deprecated. We do it only for classes / functions in public supervision API.

SkalskiP · 2025-08-07T22:27:05Z

supervision/utils/video.py

+        )
+
+
+@DeprecationWarning


let's use deprecated from supervision.utils.internal. it is currently used in the codebase to mark several deprecations.

Internals of this deprecated class should be now reimplemented using new Video API.

SkalskiP · 2025-08-07T22:28:42Z

supervision/utils/video.py

@@ -141,6 +673,7 @@ def _validate_and_setup_video(
    return video, start, end


+@DeprecationWarning
 def get_video_frames_generator(


Internals of this deprecated function should be now reimplemented using new Video API.

ADD: Added audio stream for process_video

7fba113

Ashp116 requested a review from SkalskiP as a code owner July 30, 2025 19:29

pre-commit-ci bot and others added 3 commits July 30, 2025 19:30

fix(pre_commit): 🎨 auto format pre-commit hooks

8947f77

REMOVE: Removed ffprobe

73b5836

Merge branch 'bug/process-video-audio' of https://github.com/Ashp116/…

e02d298

…supervision into bug/process-video-audio

Ashp116 changed the title ~~ADD: Added audio stream for process_video~~ BUG: Added audio stream for process_video Jul 30, 2025

Ashp116 and others added 16 commits August 1, 2025 22:51

UPDATE: Added a new Video class with OpenCV writer and backend

5e07794

Merge pull request #1 from Ashp116/update/video-core

46ec693

UPDATE: Added a new Video class with OpenCV writer and backend

fix(pre_commit): 🎨 auto format pre-commit hooks

b2096d0

Precommit

9fb7098

fix(pre_commit): 🎨 auto format pre-commit hooks

850a2c6

Precommit

46900f8

Merge branch 'bug/process-video-audio' of https://github.com/Ashp116/…

34cb9a1

…supervision into bug/process-video-audio

fix(pre_commit): 🎨 auto format pre-commit hooks

c700394

UPDATE: Fixed incomplete write closing

fce8ade

ADD: Docstrings

f86f4f2

fix(pre_commit): 🎨 auto format pre-commit hooks

2265977

UPDATE: Allow for ffmpeg error passthrough

bf67bfa

UPDATE: Writer and Backend abstract class

ec4bd01

Precommit

b9e7968

fix(pre_commit): 🎨 auto format pre-commit hooks

a96c3f0

Precommit

a6c91bc

Ashp116 changed the title ~~BUG: Added audio stream for process_video~~ FEATURE: Versatile Video class Aug 2, 2025

Ashp116 mentioned this pull request Aug 5, 2025

Reimplement video utils #1929

Open

Ashp116 added 2 commits August 6, 2025 16:21

UPDATE: Added manual control

d075e03

ADD: Added docstrings

7f078ff

pre-commit-ci bot and others added 6 commits August 6, 2025 20:28

fix(pre_commit): 🎨 auto format pre-commit hooks

af49e9a

UPDATE: Deprecate warning old Video API

320d817

FIX: Prototype resolution for roboflow#1687

cb8d2f8

fix(pre_commit): 🎨 auto format pre-commit hooks

a3a3a9e

FIX: Rounding type

ecbf5af

fix(pre_commit): 🎨 auto format pre-commit hooks

1da4466

SkalskiP requested changes Aug 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEATURE: Versatile `Video` class #1924

FEATURE: Versatile `Video` class #1924

Ashp116 commented Jul 30, 2025

Uh oh!

SkalskiP commented Jul 31, 2025

Uh oh!

Ashp116 commented Jul 31, 2025

Uh oh!

SkalskiP commented Aug 1, 2025 •

edited

Loading

Uh oh!

Ashp116 commented Aug 2, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

SkalskiP Aug 7, 2025

Uh oh!

Uh oh!

		@@ -141,6 +673,7 @@ def _validate_and_setup_video(
		return video, start, end


		@DeprecationWarning

		@@ -192,6 +725,7 @@ def get_video_frames_generator(
		video.release()


		@DeprecationWarning

		@@ -117,6 +648,7 @@ def __exit__(self, exc_type, exc_value, exc_traceback):
		self.__writer.release()


		@DeprecationWarning

FEATURE: Versatile Video class #1924

Are you sure you want to change the base?

FEATURE: Versatile Video class #1924

Conversation

Ashp116 commented Jul 30, 2025

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Uh oh!

SkalskiP commented Jul 31, 2025

Uh oh!

Ashp116 commented Jul 31, 2025

Uh oh!

SkalskiP commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Ashp116 commented Aug 2, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

FEATURE: Versatile `Video` class #1924

FEATURE: Versatile `Video` class #1924

SkalskiP commented Aug 1, 2025 •

edited

Loading