floss.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
For people who care about, support, and build Free, Libre, and Open Source Software (FLOSS).

Administered by:

Server stats:

685
active users

Robert Mader

For those of you interested in our recent video offloading / zero-copy playback work: I quickly put together some s to make it easy to test stuff already. Compositor offloading should work on all semi-recent Intel/AMD and a variety of ARM64 devices.

If you trust the sandbox you can get them here:
cloud.silentundo.org/s/r8733si

I expect quite a few people hitting driver bugs, so please help tracking those down :)

Things should generally work on , , and - not sure about other compositors.

I haven't tried myself yet as NV12 support was only added recently. But tomorrow there's a local release party where I hope to convince some people to try on their devices.

Note that offloading currently requires HW decoding - something I hope we can change next cycle. will display a little icon in the top-left if hardware decoding and thus offloading is not used.

Whenever HW decoding and offloading works I'd expect the player to be competitive to whatever other favorite player you have performance wise.

For those that are confused by the pictures in the first post because they know about the hardware limitations: yes, correct, both of them actually don't show zero-copy playback :P
One semi-surprising finding of the offloading work was that compositor offloading often pays off even when not hitting a full zero-copy path / hardware plane scanout.

In theory that gap shouldn't exist and we more or less have all APIs in place to let clients scale and pre-rotate content in a optimal way - i.e. so the compositor doesn't have to do a copy and we keep things to one single copy wherever zero-copy is impossible.

In practice there's lots of room for improvements in various stacks. With compositor offload we now have a baseline of the minimal required GPU work (assuming compositors are fully optimized).

For those using HW that still uses the stateful V4L2 API for decoding - such as the - I uploaded another build to the link in the first post that includes a patch that is *not* close to landing, but works well enough to make playback work.

With that I can play 1080p30fps videos (the decoder limit) on my big screen smoothly, which otherwise not possible (apart from reducing the resolution).

gitlab.freedesktop.org/gstream

@rmader That reminds me that this seems like a regression with the switch away from the proprietary Broadcom GL stack and the OpenMAX IL/MMAL-based decoders.

With those (even on the RPi 1/2/3!) it was possible to decode and render 1080p30 smoothly by getting EGLImages from the decoder and rendering them via GLES2.

With the new Mesa/VC4 GL stack and the v4l2-based stateful decoder this is only possible by directly rendering the dmabufs via KMS or Wayland instead of going via anything GL-based.

I didn't investigate this further.

@slomo I have to say that when I reduce the screen resolution to 1080p it probably work if it wasn't for gitlab.freedesktop.org/mesa/me (direct scanout broken for GL apps).

But for anything above I'd assume things to get problematic without hardware plane upscaling. Maybe it would still manage 30fps, but likely with an extra frame latency on a 60fps screen.

GitLabRaspberry Pi4 performance regression with Mesa 23.3. May effect other V3D based Pi devices. (#10306) · Issues · Mesa / mesa · GitLab System information OS: batocera.linux GPU: V3D Kernel version: 6.1.64-v8 Mesa version: 3.1 Mesa 23.3.1 Xserver...