dragonpilot

dragonpilot - 基於 openpilot 的開源駕駛輔助系統

YassineYousfi 6c5693e965 faster rocket launcher model (#26009 ) * cache tokens 1456d261-d232-4654-8885-4d9fde883894/440 e63ab895-2222-4abd-a9a5-af86bb70e260/700 * udpate ref commit * bump tinygrad to master		3 years ago
..
README.md	Update README.md	3 years ago
commonmodel.cc	nv12: encoderd avoids a full frame copy (#24519 )	3 years ago
commonmodel.h	nv12: encoderd avoids a full frame copy (#24519 )	3 years ago
dmonitoring.cc	DM: track RHD predictions (#24947 )	3 years ago
dmonitoring.h	DM: track RHD predictions (#24947 )	3 years ago
dmonitoring_model.current	fullframe DM model (#24860 )	3 years ago
dmonitoring_model.onnx	fullframe DM model (#24860 )	3 years ago
dmonitoring_model_q.dlc	fullframe DM model (#24860 )	3 years ago
driving.cc	Rocket Launcher Model (#25963 )	3 years ago
driving.h	faster rocket launcher model (#26009 )	3 years ago
supercombo.onnx	faster rocket launcher model (#26009 )	3 years ago

Neural networks in openpilot

To view the architecture of the ONNX networks, you can use netron

image stream
- Two consecutive images (256 * 512 * 3 in RGB) recorded at 20 Hz : 393216 = 2 * 6 * 128 * 256
  - Each 256 * 512 image is represented in YUV420 with 6 channels : 6 * 128 * 256
    - Channels 0,1,2,3 represent the full-res Y channel and are represented in numpy as Y[::2, ::2], Y[::2, 1::2], Y[1::2, ::2], and Y[1::2, 1::2]
    - Channel 4 represents the half-res U channel
    - Channel 5 represents the half-res V channel
wide image stream
- Two consecutive images (256 * 512 * 3 in RGB) recorded at 20 Hz : 393216 = 2 * 6 * 128 * 256
  - Each 256 * 512 image is represented in YUV420 with 6 channels : 6 * 128 * 256
    - Channels 0,1,2,3 represent the full-res Y channel and are represented in numpy as Y[::2, ::2], Y[::2, 1::2], Y[1::2, ::2], and Y[1::2, 1::2]
    - Channel 4 represents the half-res U channel
    - Channel 5 represents the half-res V channel
desire
- one-hot encoded vector to command model to execute certain actions, bit only needs to be sent for 1 frame : 8
traffic convention
- one-hot encoded vector to tell model whether traffic is right-hand or left-hand traffic : 2
recurrent state
- The recurrent state vector that is fed back into the GRU for temporal context : 512

Read here for more.