General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor

dc.contributor.authorYu Xianjia
dc.contributor.authorSalimpour Sahar
dc.contributor.authorPeña Queralta Jorge
dc.contributor.authorWesterlund Tomi
dc.contributor.organizationfi=robotiikka ja autonomiset järjestelmät|en=Robotics and Autonomous Systems|
dc.contributor.organization-code1.2.246.10.2458963.20.72785230805
dc.converis.publication-id179338592
dc.converis.urlhttps://research.utu.fi/converis/portal/Publication/179338592
dc.date.accessioned2026-01-21T12:26:20Z
dc.date.available2026-01-21T12:26:20Z
dc.description.abstractOver the last decade, robotic perception algorithms have significantly benefited from the rapid advances in deep learning (DL). Indeed, a significant amount of the autonomy stack of different commercial and research platforms relies on DL for situational awareness, especially vision sensors. This work explored the potential of general-purpose DL perception algorithms, specifically detection and segmentation neural networks, for processing image-like outputs of advanced lidar sensors. Rather than processing the three-dimensional point cloud data, this is, to the best of our knowledge, the first work to focus on low-resolution images with a 360 degrees field of view obtained with lidar sensors by encoding either depth, reflectivity, or near-infrared light in the image pixels. We showed that with adequate preprocessing, general-purpose DL models can process these images, opening the door to their usage in environmental conditions where vision sensors present inherent limitations. We provided both a qualitative and quantitative analysis of the performance of a variety of neural network architectures. We believe that using DL models built for visual cameras offers significant advantages due to their much wider availability and maturity compared to point cloud-based perception.
dc.identifier.jour-issn1424-8220
dc.identifier.olddbid212485
dc.identifier.oldhandle10024/195503
dc.identifier.urihttps://www.utupub.fi/handle/11111/52254
dc.identifier.urlhttps://www.mdpi.com/1424-8220/23/6/2936
dc.identifier.urnURN:NBN:fi-fe2023042739027
dc.language.isoen
dc.okm.affiliatedauthorYu, Xianjia
dc.okm.affiliatedauthorSalimpourkasebi, Sahar
dc.okm.affiliatedauthorPeña Queralta, Jorge
dc.okm.affiliatedauthorWesterlund, Tomi
dc.okm.discipline113 Computer and information sciencesen_GB
dc.okm.discipline213 Electronic, automation and communications engineering, electronicsen_GB
dc.okm.discipline113 Tietojenkäsittely ja informaatiotieteetfi_FI
dc.okm.discipline213 Sähkö-, automaatio- ja tietoliikennetekniikka, elektroniikkafi_FI
dc.okm.internationalcopublicationnot an international co-publication
dc.okm.internationalityInternational publication
dc.okm.typeA1 ScientificArticle
dc.publisherMDPI
dc.publisher.countrySwitzerlanden_GB
dc.publisher.countrySveitsifi_FI
dc.publisher.country-codeCH
dc.publisher.placeBasel
dc.relation.articlenumber2936
dc.relation.doi10.3390/s23062936
dc.relation.ispartofjournalSensors
dc.relation.issue6
dc.relation.volume23
dc.source.identifierhttps://www.utupub.fi/handle/10024/195503
dc.titleGeneral-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
dc.year.issued2023

Tiedostot

Näytetään 1 - 1 / 1
Ladataan...
Name:
sensors-23-02936.pdf
Size:
5.24 MB
Format:
Adobe Portable Document Format