* delete unused stuff
* remove CL interceptor from thneed since we don't use SNPE anymore
* remove dead files from release
* that's removed
* oops, didn't save
* pc thneed prereqs
* ugh, out of date
* that can stay private
* memcpy here is fine in SNPE variant
* release files
* thneed docs don't work anymore. they didn't look too useful
Co-authored-by: Comma Device <device@comma.ai>
* get log
* simplify two nonsense
* not needed
* libyuv is a joke
* clean up
* try small
* fast but not bad
* working
* clean up driverview
* simplified
* thats mirrored
* smol
* tweak
* ref is screen
* w/ ee
* update camera model
* no if TICI
* start
* update pose thresh
* less cpu more dsp
* new libyuv
* new snpe
* add files
* test
* should be fast
* update out len
* trigger test
* use master snpe
* add cereal
* update cereal
* refactor parsing
* missing ;
* get
* wrong type
* test model
* use driver data
* 10829278-72fe-4283-a118-2cef959ce174/1550
* no pf
* adapt driverview
* ;
* rhd learner
* update libyuv buildi x64
* ad4337ea
* remove blink slack
* test
* no
* use toggle
* b16
* fix for nv12
* 5b02cff5 both
* update test
* update cereal
* update cereal
* update cereal
* v2 packets
* revert libyuv
* no /
* update snpemodel
* ;
* memcpy
* fix test
* use toggle in driverview
* update power
* update replay
* Revert "update replay"
This reverts commit 1d0979ca59.
* update model ref
* halve cpu
* fake 8bit onnx runner
* same thresh as report
* cereal master
Co-authored-by: Comma Device <device@comma.ai>
* update cereal
* run but not use
* log distraction type
* regression scaling
* clean up naming
* add calib buf
* add to header
* fake model
* no calib model
* adjust threshs
* 018a305f
* fix bn
* tweak1
* tweak2
* 0ff2/666
* tweak3
* t4
* t5
* fix out of bound
* skip when replaying old segments
* update ref
* fix onnxmodel
* get calib
* update model replay refs
* up ref
* Added wide cam vipc client and bigmodel transform logic
* Added wide_frame to ModelState, should still work normally
* Refactored image input into addImage method, should still work normally
* Updated thneed/compile.cc
* Bigmodel, untested: 44f83118-b375-4d4c-ae12-2017124f0cf4/200
* Have to initialize extra buffer in SNPEModel
* Default paramater value in the wrong place I think
* Move USE_EXTRA to SConscript
* New model: 6c34d59a-acc3-4877-84bd-904c10745ba6/250
* move use extra check to runtime, not on C2
* this is always true
* more C2 checks
* log if frames are out of sync
* more logging on no frame
* store in pointer
* print sof
* add sync logic
* log based on sof difference as well
* keep both models
* less assumptions
* define above thneed
* typo
* simplify
* no need for second client is main is already wide
* more comments update
* no optional reference
* more logging to debug lags
* add to release files
* both defines
* New model: 6831a77f-2574-4bfb-8077-79b0972a2771/950
* Path offset no longer relevant
* Remove duplicate execute
* Moved bigmodel back to big_supercombo.dlc
* add wide vipc stream
* Tici must be tici
* Needs state too
* add wide cam support to model replay
* handle syncing better
* ugh, c2
* print that
* handle ecam lag
* skip first one
* so close
* update refs
Co-authored-by: mitchellgoffpc <mitchellgoffpc@gmail.com>
Co-authored-by: Harald Schafer <harald.the.engineer@gmail.com>
Co-authored-by: Adeeb Shihadeh <adeebshihadeh@gmail.com>
Co-authored-by: Comma Device <device@comma.ai>
* add thneed optimizer
* local work group opt
* kernels and final mods
* release files
* build system touchups
* fix kernel path, rand inputs for self test
* broken since extra is gone
* update model replay ref
Co-authored-by: Comma Device <device@comma.ai>
* completely untested
* it builds now
* bug fixes, save 1ms
* using a kernel to copy works
* more sane API to loadyuv
Co-authored-by: Comma Device <device@comma.ai>
* use the actual file name
* use macro
* fix typo
* continue
* Apply suggestions from code review
Co-authored-by: Adeeb Shihadeh <adeebshihadeh@gmail.com>
* use cstring instead of string.h
* use cstdio instead of stdio.h
* remove inttypes.h
* use cstdlib instead of stdlib.h
* use cstdint instead of stdint.h
* #include <cstddef>
* cstdlib
* use cmath
* remove stddef.h
* use cassert
* use csignal
* use ctime
* use cerror
* rebase master
* no need to malloc one extra byte
* combine two read_file into a faster one
* cleanup #include
* use resize
* apply suggestions from review
* space
* rebase master
* initial commit
* CPU working in dokcer
* Update selfdrive/manager.py
Co-authored-by: Adeeb Shihadeh <adeebshihadeh@gmail.com>
* Update selfdrive/manager.py
* fixes
* fixing env var
* fixes
* building and running in docker
* camerad fix
* camerad fix
* this ACTUALLY works
* removing unused libraries
* updating to CARLA 0.9.11
* removing useless flags
* small changes
* forgot this
* sudo not needed for xhost
* final changes
Co-authored-by: Adeeb Shihadeh <adeebshihadeh@gmail.com>
* start thneed load/save
* compiling
* fix loading
* build thneed model in scons
* don't hardcode /data/openpilot
* release files
* those too
* support for loading/saving binary kernels
* save binaries out of json band
* make binary a command line flag to the compiler
* need include assert
* fix shadowed common in SConscript
* cleanup run.h
* hmm, the recurrent buffer wasn't 0ed
* ugh, unique ptr
* remove power constraint, refactor record
* Revert "remove power constraint, refactor record"
This reverts commit bb6fa52db6df59cd9d6420a6f630430e35af8a5e.
* print on thneed stop
* fingers crossed for this one
* recorded
* just curious
* okay okay, pass tests?
* cleanups
* refactor wait
Co-authored-by: Comma Device <device@comma.ai>
Co-authored-by: Adeeb Shihadeh <adeebshihadeh@gmail.com>
* refactor draw model
* rebase master
* correct valid_len
* rename function
* rename variables
* white space
* rebase to master
* e16c13ac-927d-455e-ae0a-81b482a2c787
* start rewriting
* save proress
* compiles!
* oops
* many fixes
* seems to work
* fix desires
* finally cleaned
* wrong std for ll
* dont pulse none
* compiles!
* ready to test
* WIP does not compile
* compiles
* various fixes
* does something!
* full 3d
* not needed
* draw up to 100m
* fix segfault
* wrong sign
* fix flicker
* add road edges
* finish v2 packet
* Added pytorch supercombo
* fix rebase
* no more keras
* Hacky solution to the NCHW/NHWC incompatibility between SNPE and our frame data
* dont break dmonitoringd, final model 229e3ce1-7259-412b-85e6-cc646d70f1d8/430
* fix hack
* Revert "fix hack"
This reverts commit 5550fc01a7881d065a5eddbbb42dac55ef7ec36c.
* Removed axis permutation hack
* Folded padding layers into conv layers
* Removed the last pad layer from the dlc
* Revert "Removed the last pad layer from the dlc"
This reverts commit b85f24b9e1d04abf64e85901a7ff49e00d82020a.
* Revert "Folded padding layers into conv layers"
This reverts commit b8d1773e4e76dea481acebbfad6a6235fbb58463.
* vision model: 5034ac8b-5703-4a49-948b-11c064d10880/780 temporal model: 229e3ce1-7259-412b-85e6-cc646d70f1d8/430 with permute + pool opt
* fix ui drawing with clips
* ./compile_torch.py 5034ac8b-5703-4a49-948b-11c064d10880/780 dfcd2375-81d8-49df-95bf-1d2d6ad86010/450 with variable history length
* std::clamp
* not sure how this compiled before
* 2895ace6-a296-47ac-86e6-17ea800a74e5/550
* db090195-8810-42de-ab38-bb835d775d87/601
* 5m is very little
* onnx runner
* add onnxruntime to pipfile
* run in real time without using the whole CPU
* bump cereal;
* add stds
* set road edge opacity based on stddev
* don't access the model packet in paint
* convert mat.h to a c++ header file (#2499)
* update tests
* safety first
Co-authored-by: deanlee <deanlee3@gmail.com>
Co-authored-by: mitchell <mitchell@comma.ai>
Co-authored-by: Comma Device <device@comma.ai>
Co-authored-by: George Hotz <george@comma.ai>
Co-authored-by: Adeeb Shihadeh <adeebshihadeh@gmail.com>
* enable Wunused, first pass
* unused stuff in snpe model
* these are used on phone
* handle sigint and sigterm in modeld
* fix phone build
* camera qcom
* QCOM build works
* delete unused camerad vars
Co-authored-by: Comma Device <device@comma.ai>
* Running pre-commit in CI
* fix dockerfile syntax
* dont run on submodule repo folders
* Fix some import errors in ci
* more stuff
* That should be the last one
* thneed runs the model
* thneed is doing the hooking
* set kernel args
* thneeding the bufferS
* print the images well
* thneeds with better buffers
* includes
* disasm adreno
* parse packets
* disasm works
* disasm better
* more thneeding
* much thneeding
* much more thneeding
* thneed works i think
* thneed is patient
* thneed works
* 7.7%
* gpuobj sync
* yay, it mallocs now
* cleaning it up, Thneed
* sync objs and set power
* thneed needs inputs and outputs
* thneed in modeld
* special modeld runs
* can't thneed the DSP
* test is weird
* thneed modeld uses 6.4% CPU
* add thneed to release
* move to debug
* delete some junk from the pr
* always track the timestamp
* timestamp hacks in thneed
* create a new command queue
* fix timestamp
* pretty much back to what we had, you can't use SNPE with thneed
* improve thneed test
* disable save log
Co-authored-by: Comma Device <device@comma.ai>
* add traffic convention
* hope this work
* no comment
* latest and gratest
* big gru model
* 1af55c7d-ee15-414a-9e98-a0cb08c3441f/75
* much later in training
* wrong temporal size
* converged
* fix lane changes
* cleanup simulator files
* minor updates
* update readme
* keras runner builds
* hmm, still doesn't work
* keras runner works
* should work with python3 keras mod
* touchups