Change the fb buffer as 16 bytes aligned #419

jason-mao · 2022-07-13T09:52:04Z

The 128 bit aligned buffer have good performance for esp_jpeg encoding. The aligned data to improve the memory load time, combined with esp32-S3 128bit SIMD instructions optimization, we got about 10 fps on OV3660 camera with YUV422 and VGA.

kxbin · 2022-07-19T02:52:45Z

Hello, may I ask if there is test data, how much fps has been improved?

igrr

Thank you for the PR @jason-mao!

Have left one note about the usage of heap_caps_aligned_alloc in older releases.

Also would you mind sharing, what is the observed performance improvement?
Perhaps you could add a test case or some other way to benchmark the improvement you have observed?

driver/cam_hal.c

dongmenhai · 2022-08-09T12:11:11Z

@jason-mao Hello，How can I test this feature to take effect? I use esp32s3 to run a demo of pic_server, call the frame2jpg function 30 times and calculate the time. By comparison, I found that the method of modifying cam_obj->frames[x].fb.buf to 128 bit alignment did not change the speed of jpeg encoding

jason-mao · 2022-08-22T11:04:25Z

Hello, may I ask if there is test data, how much fps has been improved?

This modification just to make the 128bit SIMD instructions to load data at theoretical performance, the unaligned data has extra time. This is only the first step. Most of the real improvement time is in code assembly optimization. I think I can add some code on esp-adf to test it.

jason-mao · 2022-08-22T11:05:02Z

Thank you for the PR @jason-mao!

Have left one note about the usage of heap_caps_aligned_alloc in older releases.

Also would you mind sharing, what is the observed performance improvement? Perhaps you could add a test case or some other way to benchmark the improvement you have observed?

I have update on #419 (comment)

jason-mao · 2022-08-22T11:07:28Z

@jason-mao Hello，How can I test this feature to take effect? I use esp32s3 to run a demo of pic_server, call the frame2jpg function 30 times and calculate the time. By comparison, I found that the method of modifying cam_obj->frames[x].fb.buf to 128 bit alignment did not change the speed of jpeg encoding

Sorry, maybe I caused some misunderstanding. I explanation here is clearly #419 (comment)

…rformance with 128 bit SIMD instructions

igrr

Latest version LGTM.

Given the typical framebuffer sizes, loosing a few bytes of heap due to alignment doesn't seem to be a big issue. So I'm okay with this change even without adding SIMD optimizations to the rest of this project.

@me-no-dev PTAL as well.

me-no-dev · 2022-08-23T12:53:10Z

Thanks @jason-mao

weilian1977 · 2023-03-28T09:43:21Z

The 128 bit aligned buffer have good performance for esp_jpeg encoding. The aligned data to improve the memory load time, combined with esp32-S3 128bit SIMD instructions optimization, we got about 10 fps on OV3660 camera with YUV422 and VGA.

can provide test code？

weilian1977 · 2023-03-29T13:28:56Z

Hello, may I ask if there is test data, how much fps has been improved?

This modification just to make the 128bit SIMD instructions to load data at theoretical performance, the unaligned data has extra time. This is only the first step. Most of the real improvement time is in code assembly optimization. I think I can add some code on esp-adf to test it.

https://github.com/espressif/esp-adf-libs/blob/master/esp_codec/include/codec/esp_jpeg_version.h？？

igrr requested changes Aug 4, 2022

View reviewed changes

driver/cam_hal.c Show resolved Hide resolved

jason-mao force-pushed the bugfix/change_fb_to_aligned_buffer branch from 846a3f4 to 8b7416a Compare August 23, 2022 07:38

Fix fb buffer as 128 bit aligned for improve the data transmission pe…

03b1eab

…rformance with 128 bit SIMD instructions

jason-mao force-pushed the bugfix/change_fb_to_aligned_buffer branch from 8b7416a to 03b1eab Compare August 23, 2022 07:45

igrr approved these changes Aug 23, 2022

View reviewed changes

me-no-dev merged commit 36121e1 into espressif:master Aug 23, 2022

weilian1977 mentioned this pull request Mar 27, 2023

esp32S3 jpg编码解码速度如何？ #510

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change the fb buffer as 16 bytes aligned #419

Change the fb buffer as 16 bytes aligned #419

jason-mao commented Jul 13, 2022 •

edited

Loading

Uh oh!

kxbin commented Jul 19, 2022

Uh oh!

igrr left a comment

Uh oh!

Uh oh!

dongmenhai commented Aug 9, 2022

Uh oh!

jason-mao commented Aug 22, 2022

Uh oh!

jason-mao commented Aug 22, 2022

Uh oh!

jason-mao commented Aug 22, 2022

Uh oh!

igrr left a comment

Uh oh!

me-no-dev commented Aug 23, 2022

Uh oh!

weilian1977 commented Mar 28, 2023

Uh oh!

weilian1977 commented Mar 29, 2023

Uh oh!

Uh oh!

Change the fb buffer as 16 bytes aligned #419

Change the fb buffer as 16 bytes aligned #419

Conversation

jason-mao commented Jul 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kxbin commented Jul 19, 2022

Uh oh!

igrr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dongmenhai commented Aug 9, 2022

Uh oh!

jason-mao commented Aug 22, 2022

Uh oh!

jason-mao commented Aug 22, 2022

Uh oh!

jason-mao commented Aug 22, 2022

Uh oh!

igrr left a comment

Choose a reason for hiding this comment

Uh oh!

me-no-dev commented Aug 23, 2022

Uh oh!

weilian1977 commented Mar 28, 2023

Uh oh!

weilian1977 commented Mar 29, 2023

Uh oh!

Uh oh!

jason-mao commented Jul 13, 2022 •

edited

Loading