da7ddd8477dc802c8736c7ab860fc09f33689ce9 - fp2-dev/platform/frameworks/rs

commit	da7ddd8477dc802c8736c7ab860fc09f33689ce9	[log] [tgz]
author	Tobias Grosser <grosser@google.com>	Thu Jul 11 17:58:47 2013 -0700
committer	Tobias Grosser <grosser@google.com>	Mon Jul 15 14:07:20 2013 -0700
tree	aa2adccfb8659aeef55ae83ef2f1164699c4cb30
parent	574854bcb2eb25a85b9b52faf2fb3e743fa7aa14 [diff]

Simplify code of convolve3x3

Instead of first doing all multiplications and then adding the results in
a tree manner, we just repetitively perform a load/multiply/add patter.
With and without tuning for A15, this yields a 5% performance increase for N10.

This commit also exposes more instructions to be transformed into fused
multiply adds.

Change-Id: I1215d75da236e6b2d6b6aa48b3ab35606cdba7b8

java/tests/ImageProcessing/src/com/android/rs/image/convolve3x3.fs[diff]

1 file changed

tree: aa2adccfb8659aeef55ae83ef2f1164699c4cb30

cpp/
cpu_ref/
driver/
java/
scriptc/
server/
tests/
Android.mk
CleanSpec.mk
rs.h
rs.spec
rs_hal.h
rs_native.spec
rsAdapter.cpp
rsAdapter.h
rsAllocation.cpp
rsAllocation.h
rsAnimation.cpp
rsAnimation.h
rsComponent.cpp
rsComponent.h
rsContext.cpp
rsContext.h
rsCppUtils.cpp
rsCppUtils.h
rsDebugHelper.h
rsDefines.h
rsDevice.cpp
rsDevice.h
rsElement.cpp
rsElement.h
rsEnv.h
rsFBOCache.cpp
rsFBOCache.h
rsFifo.h
rsFifoSocket.cpp
rsFifoSocket.h
rsFileA3D.cpp
rsFileA3D.h
rsFont.cpp
rsFont.h
rsg.spec
rsg_generator.c
rsgApi.cpp.rsg
rsgApiFuncDecl.h.rsg
rsgApiReplay.cpp.rsg
rsgApiStructs.h.rsg
rsMatrix2x2.cpp
rsMatrix2x2.h
rsMatrix3x3.cpp
rsMatrix3x3.h
rsMatrix4x4.cpp
rsMatrix4x4.h
rsMesh.cpp
rsMesh.h
rsMutex.cpp
rsMutex.h
rsObjectBase.cpp
rsObjectBase.h
rsPath.cpp
rsPath.h
rsProgram.cpp
rsProgram.h
rsProgramBase.h
rsProgramFragment.cpp
rsProgramFragment.h
rsProgramRaster.cpp
rsProgramRaster.h
rsProgramStore.cpp
rsProgramStore.h
rsProgramVertex.cpp
rsProgramVertex.h
rsRuntime.h
rsSampler.cpp
rsSampler.h
rsScript.cpp
rsScript.h
rsScriptC.cpp
rsScriptC.h
rsScriptC_Lib.cpp
rsScriptC_LibGL.cpp
rsScriptGroup.cpp
rsScriptGroup.h
rsScriptIntrinsic.cpp
rsScriptIntrinsic.h
rsSignal.cpp
rsSignal.h
rsStream.cpp
rsStream.h
rsThreadIO.cpp
rsThreadIO.h
rsType.cpp
rsType.h
rsUtils.h
spec.h
spec.l