Tag:
Branch:
Tree:
afdb4ce61e
0.6.1-en
0.6.2-en
0.6.3-en
0.6.4-en
0.6.5-en
0.6.6-en
0.7.0-en
0.7.1-en
0.7.10
0.7.10-081-model
0.7.2-en
0.7.3-en
0.7.4-i18n
0.7.5-i18n
0.7.6-i18n
0.7.7-i18n
0.7.8-i18n
0.8.0
0.8.1
0.8.10
0.8.12
0.8.13
0.8.4
0.8.5
0.8.6
0.8.7
0.8.8
0.8.9
d2
d2-honda
d3
deprecated-beta2
deprecated-beta3
deprecated-release2
deprecated-release2_e2e
deprecated-release3
docs
master3-pedal
pre-build
pre-build-sdsu-tune
pre-build-tr7
r2
r2-v2022.07.29
r3-v2022.07.29
r3-v2022.08.14
v0.1
v0.2
v0.2.1
v0.2.2
v0.2.3
v0.2.4
v0.2.5
v0.2.6
v0.2.7
v0.2.8
v0.2.9
v0.3.0
v0.3.1
v0.3.2
v0.3.3
v0.3.4
v0.3.5
v0.4.0.2
v0.4.1
v0.4.2
v0.4.4
v0.4.5
v0.4.6
v0.4.7
v0.5
v0.5.1
v0.5.10
v0.5.11
v0.5.12
v0.5.13
v0.5.2
v0.5.3
v0.5.4
v0.5.5
v0.5.6
v0.5.7
v0.5.8
v0.5.9
v0.6.2-zhs
v0.6.2-zht
v0.6.3-zhs
v0.6.3-zht
v0.6.4-zhs
v0.6.4-zht
v0.6.5-zhs
v0.6.5-zht
v0.6.6-zhs
v0.6.6-zht
v0.7.0-zhs
v0.7.0-zht
v0.7.1-zhs
v0.7.1-zht
v0.7.2-zhs
v0.7.2-zht
v0.7.3-zhs
v0.7.3-zht
v0.7.4-i18n
v0.7.4-zhs
v0.7.4-zht
v0.7.5-i18n
v0.7.6-i18n
${ noResults }
2 Commits (afdb4ce61e2e4a209cf6d4a35b51bfafdbbf11e4)
Author | SHA1 | Message | Date |
---|---|---|---|
|
78a352a8ca
|
This isn't bigmodel, but there's a lot of good stuff here (#1532)
* bigmodel * more debug print * debugging bigmodel * remove the tanh, debugging * print images/buffers * disassemble the command queues * decompiler * dump the shaders * full disasm * support patching kernel and fixing convolution_horizontal_reduced_reads_1x1 * microbenchmark * 42 GFLOPS, 1 GB/s * gemm benchmark * 75 GFLOPS vs 42 GFLOPS * 115 GFLOPS * oops, never mind * gemm image is slow * this is pretty hopeless * gemm image gets 62 GFLOPS * this is addictive and still a waste of time * cleanup cleanup * that hook was dumb * tabbing * more tabbing Co-authored-by: Comma Device <device@comma.ai> |
5 years ago |
|
302d06ee70 |
thneed saves 45% of a core (#1512)
* thneed runs the model * thneed is doing the hooking * set kernel args * thneeding the bufferS * print the images well * thneeds with better buffers * includes * disasm adreno * parse packets * disasm works * disasm better * more thneeding * much thneeding * much more thneeding * thneed works i think * thneed is patient * thneed works * 7.7% * gpuobj sync * yay, it mallocs now * cleaning it up, Thneed * sync objs and set power * thneed needs inputs and outputs * thneed in modeld * special modeld runs * can't thneed the DSP * test is weird * thneed modeld uses 6.4% CPU * add thneed to release * move to debug * delete some junk from the pr * always track the timestamp * timestamp hacks in thneed * create a new command queue * fix timestamp * pretty much back to what we had, you can't use SNPE with thneed * improve thneed test * disable save log Co-authored-by: Comma Device <device@comma.ai> |
5 years ago |