Capture video frames and sends them to server using grpc protocol
Records synchronized audio to send to google speech and video to extract features