Publication:
Improving regression performance on monocular 3D object detection using bin-mixing and sparse voxel data

dc.contributor.authorBalatkan, Eren
dc.contributor.authorKıraç, Mustafa Furkan
dc.contributor.departmentComputer Science
dc.contributor.ozuauthorKIRAÇ, Mustafa Furkan
dc.contributor.ozugradstudentBalatkan, Eren
dc.date.accessioned2023-05-22T08:32:20Z
dc.date.available2023-05-22T08:32:20Z
dc.date.issued2021
dc.description.abstractAccurate and fast 3D object detection plays a role of paramount importance for safe and capable autonomous machines. LiDAR point cloud based methods have demonstrated impressive results, yet expensive LiDAR sensors make such approaches infeasible for wide-scale adaptation. Camera based methods on the other hand are performing sub-optimally given safety and accuracy requirements. Traditionally, camera based 3D object detection is performed by generating pseudo-LiDAR point clouds from RGB-D data and using point-cloud based models, however, irregular nature of point cloud data representation makes it challenging to exploit spatial local correlations on 3D space and point cloud based models generally suffer from this. To this end, we propose Sparse Voxel based 3D Object Detection, our approach differs from traditional approaches by converting point cloud information to sparse voxel grid and utilizing sub-manifold sparse convolutions to extract information instead of PointNet based models. Furthermore, we propose Bin-Mixing layers. Bin-Mixing replaces the output layer of a neural network and boosts performance by representing the problem of regression in a fashion that is easier for network to learn.en_US
dc.identifier.doi10.1109/UBMK52708.2021.9558880en_US
dc.identifier.endpage424en_US
dc.identifier.scopus2-s2.0-85125842056
dc.identifier.startpage419en_US
dc.identifier.urihttp://hdl.handle.net/10679/8309
dc.identifier.urihttps://doi.org/10.1109/UBMK52708.2021.9558880
dc.language.isoengen_US
dc.publicationstatusPublisheden_US
dc.publisherIEEEen_US
dc.relation.ispartof2021 6th International Conference on Computer Science and Engineering (UBMK)
dc.relation.publicationcategoryInternational
dc.rightsrestrictedAccess
dc.subject.keywords3D object detectionen_US
dc.subject.keywordsBin mixingen_US
dc.subject.keywordsSparse voxel griden_US
dc.subject.keywordsSub-manifold sparse convolutionen_US
dc.titleImproving regression performance on monocular 3D object detection using bin-mixing and sparse voxel dataen_US
dc.typeconferenceObjecten_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections