Martin Peterlin, Chief Technology Officer at Luxonis, demonstrates the company’s latest edge AI and vision technologies and products at the 2021 Embedded Vision Summit. Specifically, Peterlin demonstrates neural inference-controlled crop, zoom and H.265 encode.
Peterlin shows how to use a neural network to guide what portion of a high-resolution (12 Mpixel) image sensor is output to a 2 Mpixel (1920×1080) h.265 encoded video stream. This capability allows a neural network or computer vision algorithm (e.g., motion estimation) to guide where the action is in a given scene, and then 6x losslessly zoom into that action, h.265 encoding the resultant 1920×1080 region of interest.