* bigmodel
* more debug print
* debugging bigmodel
* remove the tanh, debugging
* print images/buffers
* disassemble the command queues
* decompiler
* dump the shaders
* full disasm
* support patching kernel and fixing convolution_horizontal_reduced_reads_1x1
* microbenchmark
* 42 GFLOPS, 1 GB/s
* gemm benchmark
* 75 GFLOPS vs 42 GFLOPS
* 115 GFLOPS
* oops, never mind
* gemm image is slow
* this is pretty hopeless
* gemm image gets 62 GFLOPS
* this is addictive and still a waste of time
* cleanup cleanup
* that hook was dumb
* tabbing
* more tabbing
Co-authored-by: Comma Device <device@comma.ai>
* thneed runs the model
* thneed is doing the hooking
* set kernel args
* thneeding the bufferS
* print the images well
* thneeds with better buffers
* includes
* disasm adreno
* parse packets
* disasm works
* disasm better
* more thneeding
* much thneeding
* much more thneeding
* thneed works i think
* thneed is patient
* thneed works
* 7.7%
* gpuobj sync
* yay, it mallocs now
* cleaning it up, Thneed
* sync objs and set power
* thneed needs inputs and outputs
* thneed in modeld
* special modeld runs
* can't thneed the DSP
* test is weird
* thneed modeld uses 6.4% CPU
* add thneed to release
* move to debug
* delete some junk from the pr
* always track the timestamp
* timestamp hacks in thneed
* create a new command queue
* fix timestamp
* pretty much back to what we had, you can't use SNPE with thneed
* improve thneed test
* disable save log
Co-authored-by: Comma Device <device@comma.ai>
* convert c-capnp to c++
change paramater type
* remove c-capnp header from commonmodel.c
* unsigned short -> uint16_t
* MSM_CAMERA_I2C_ADDR_TYPE_MAX->4
* lgtm in readme, and mac nui fixes
* call super inits in radars
* unused imports, dup lines
* more radars, more unused imports
* pass CP into RadarInterfaceBase
* more fixups
* unused imports
* delete unused lines
* ugh, new unused import
Co-authored-by: George Hotz <geohot@gmail.com>
* add traffic convention
* hope this work
* no comment
* latest and gratest
* big gru model
* 1af55c7d-ee15-414a-9e98-a0cb08c3441f/75
* much later in training
* wrong temporal size
* converged
* fix lane changes
* capnparm
* building in progress
* scons build works
* that script fixes opencl
* start new camera code
* includes and more camera scripts
* control c works now
* no device control yet
* phy too
* just one camera for now
* fix capnparm
* hmm, the inits are needed
* more cameras
* link stop start
* doesn't work yet
* fix ion on qcom2
* start poll ish
* 4 pictures and done
* no jpeg
* it works to picture
* destroy sync obj
* both work for now
* defined QCOM2
* fix fd leak
* run modeld
* 10 bit mode
* real frame stride
* needs digital gain
* dnew
* no color correcting on new
* that snpe doesn't work
* qcom2 gate
* cleanups
* oops, fix aarch64 detector
* update cereal
* modeld works with SNPE
* fix driver monitoring model
Co-authored-by: Tici <robbe@comma.ai>
* snpe version 1.35.0.698
* fix files common for new snpe
* efficient net with normal names
* compiling with 1.32 works
* actual snpe files
* fix monitoring, delete unneeded
* point the symlinks to the right places
* err, modeld needs the CPU
* benchmark for 50 its
* put back old model for now
Co-authored-by: Comma Device <device@comma.ai>
Co-authored-by: Harald Schafer <harald.the.engineer@gmail.com>
Co-authored-by: George Hotz <geohot@gmail.com>
* cleanup simulator files
* minor updates
* update readme
* keras runner builds
* hmm, still doesn't work
* keras runner works
* should work with python3 keras mod
* touchups