I want to detect the two hands of a picture at the same time. Firstly, I will find out the bounding boxes of the two hands with the method of object detection, and then I will bring out the depth information of the picture in the two enveloping boxes and use the model to detect them respectively. Is this feasible?