General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor

Yu Xianjia; Salimpour Sahar; Peña Queralta Jorge; Westerlund Tomi

General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor

dc.contributor.author	Yu Xianjia
dc.contributor.author	Salimpour Sahar
dc.contributor.author	Peña Queralta Jorge
dc.contributor.author	Westerlund Tomi
dc.contributor.organization	fi=robotiikka ja autonomiset järjestelmät\|en=Robotics and Autonomous Systems\|
dc.contributor.organization-code	1.2.246.10.2458963.20.72785230805
dc.converis.publication-id	179338592
dc.converis.url	https://research.utu.fi/converis/portal/Publication/179338592
dc.date.accessioned	2026-01-21T12:26:20Z
dc.date.available	2026-01-21T12:26:20Z
dc.description.abstract	Over the last decade, robotic perception algorithms have significantly benefited from the rapid advances in deep learning (DL). Indeed, a significant amount of the autonomy stack of different commercial and research platforms relies on DL for situational awareness, especially vision sensors. This work explored the potential of general-purpose DL perception algorithms, specifically detection and segmentation neural networks, for processing image-like outputs of advanced lidar sensors. Rather than processing the three-dimensional point cloud data, this is, to the best of our knowledge, the first work to focus on low-resolution images with a 360 degrees field of view obtained with lidar sensors by encoding either depth, reflectivity, or near-infrared light in the image pixels. We showed that with adequate preprocessing, general-purpose DL models can process these images, opening the door to their usage in environmental conditions where vision sensors present inherent limitations. We provided both a qualitative and quantitative analysis of the performance of a variety of neural network architectures. We believe that using DL models built for visual cameras offers significant advantages due to their much wider availability and maturity compared to point cloud-based perception.
dc.identifier.jour-issn	1424-8220
dc.identifier.olddbid	212485
dc.identifier.oldhandle	10024/195503
dc.identifier.uri	https://www.utupub.fi/handle/11111/52254
dc.identifier.url	https://www.mdpi.com/1424-8220/23/6/2936
dc.identifier.urn	URN:NBN:fi-fe2023042739027
dc.language.iso	en
dc.okm.affiliatedauthor	Yu, Xianjia
dc.okm.affiliatedauthor	Salimpourkasebi, Sahar
dc.okm.affiliatedauthor	Peña Queralta, Jorge
dc.okm.affiliatedauthor	Westerlund, Tomi
dc.okm.discipline	113 Computer and information sciences	en_GB
dc.okm.internationalcopublication	not an international co-publication
dc.okm.internationality	International publication
dc.okm.type	A1 ScientificArticle
dc.publisher	MDPI
dc.publisher.country	Switzerland	en_GB
dc.publisher.country	Sveitsi	fi_FI
dc.publisher.country-code	CH
dc.publisher.place	Basel
dc.relation.articlenumber	2936
dc.relation.doi	10.3390/s23062936
dc.relation.ispartofjournal	Sensors
dc.relation.issue	6
dc.relation.volume	23
dc.source.identifier	https://www.utupub.fi/handle/10024/195503
dc.title	General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
dc.year.issued	2023

Tiedostot

Näytetään 1 - 1 / 1

Name:: sensors-23-02936.pdf
Size:: 5.24 MB
Format:: Adobe Portable Document Format

Lataa

Kokoelmat

Rinnakkaistallenteet