634557 - Implement efficient scaling YUV-to-RGB conversions for ARM (using NEON).

The conversion routines provide the following: * Multiply-by-two. - Sampling using the nearest pixel. - Minimum source width of 16 pixels. - Number of rows must be divisible by 2. * Divide-by-two. - Sampling using a 2*2 average. - Minimum source width of 16 pixels. - Number of rows must be divisible by 2. * Divide-by-four. - Sampling using a 4*4 average. - Minimum source width of 32 pixels. - Number of rows must be divisible by 4. There are variants for both ARGB-8888 and RGB-565 format. Each routine expects to be able to decode a whole image in one shot. Row-by-row variants are feasible, but will be notably less efficient as the function call overhead here is significant simply because we're using so many registers which need to be stacked. There is room for improvement, both in flexibility and in performance, so we should consider having another look at these routines in the future. The test framework provided runs each routine, along with Siarhei's implementation (from yuv_convert_arm.cpp) and Steve's solution (which I'll attach in a minute). It saves the result of each routine as a BMP file and displays the time taken for each one.

Jacob Bramley [:jbramley]

Reporter

Comment 5

•

14 years ago

Attached file Steve's YUV-to-ARGB solution (with no scaling). (obsolete) — Details

This is the solution on which my scaling variants are based.

Conversion routines. 14 years ago Jacob Bramley [:jbramley] 50.28 KB, text/plain		Details
Test framework. 14 years ago Jacob Bramley [:jbramley] 17.23 KB, text/plain		Details
Steve's YUV-to-ARGB solution (with no scaling). 14 years ago Jacob Bramley [:jbramley] 17.96 KB, text/plain		Details
Half-complete generic scaler. 14 years ago Jacob Bramley [:jbramley] 13.98 KB, text/plain		Details
ScaleYCbCr42xToRGB565_BilinearY_Row_NEON 14 years ago Timothy B. Terriberry (:derf) 12.12 KB, text/plain		Details
Add ScaleYCbCrToRGB565 version 1. 14 years ago Timothy B. Terriberry (:derf) 61.26 KB, patch		Details \| Diff \| Splinter Review
Add ScaleYCbCrToRGB565 version 2. 14 years ago Timothy B. Terriberry (:derf) 72.42 KB, patch		Details \| Diff \| Splinter Review
Jacob's generic scaler (unoptimized). 14 years ago Jacob Bramley [:jbramley] 21.76 KB, text/plain		Details
Y'CbCr conversion testbench 14 years ago Timothy B. Terriberry (:derf) 25.04 KB, application/octet-stream		Details
Patch update which can be applied to the current trunk 14 years ago Joachim Herb 45.97 KB, patch		Details \| Diff \| Splinter Review
Test framework, with benchmark results. 14 years ago Jacob Bramley [:jbramley] 63.78 KB, application/gzip		Details
Test framework, with benchmark results 14 years ago Jacob Bramley [:jbramley] 60.58 KB, application/gzip		Details
Test framework with two-pass nearest and bilinear scalers added 14 years ago Siarhei Siamashka 120.43 KB, application/gzip		Details
ScaleYCbCrToRGB565: Reference C version 14 years ago Timothy B. Terriberry (:derf) 27.09 KB, patch	cajbir : review+	Details \| Diff \| Splinter Review
ScaleYCbCr42xToRGB565_BilinearY_Row_NEON 14 years ago Timothy B. Terriberry (:derf) 18.13 KB, patch	jbramley : review+	Details \| Diff \| Splinter Review
ScaleYCbCrToRGB565: Reference C version (for check-in) 14 years ago Timothy B. Terriberry (:derf) 27.34 KB, patch	derf : review+	Details \| Diff \| Splinter Review
Implement ScaleYCbCr42xToRGB565_BilinearY_Row_NEON (for check-in) 14 years ago Timothy B. Terriberry (:derf) 19.54 KB, patch	derf : review+	Details \| Diff \| Splinter Review
oprofile-logs-for-bug-634557.txt (before and after patches got applied) 14 years ago Siarhei Siamashka 11.60 KB, text/plain		Details