I hope you all doing well.
I’m working on a project to develop a fully functional deep learning pipeline, and having a dataset that consists of 2D (images, masks) and 3D (Point cloud, bounding box-bbox3d).
<code>Data format:
IMAGE Shape: (451, 706, 3)
MASK Shape: (451, 706, 9)
PC Shape: (3, 451, 706)
BBOX3D Shape: (9, 8, 3)
</code>
<code>Data format:
IMAGE Shape: (451, 706, 3)
MASK Shape: (451, 706, 9)
PC Shape: (3, 451, 706)
BBOX3D Shape: (9, 8, 3)
</code>
Data format:
IMAGE Shape: (451, 706, 3)
MASK Shape: (451, 706, 9)
PC Shape: (3, 451, 706)
BBOX3D Shape: (9, 8, 3)
I am a just starting with deep learning, and I spent a couple of days learning Pytorch and doing some research to build an understanding around this subject. I am trying to find a model that can work and handle this data, I did some research on open-source models and couldn’t find any that can handle this dataset.
I would appreciate any suggestions you provide. Thank you.