General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
| dc.contributor.author | Yu Xianjia | |
| dc.contributor.author | Salimpour Sahar | |
| dc.contributor.author | Peña Queralta Jorge | |
| dc.contributor.author | Westerlund Tomi | |
| dc.contributor.organization | fi=robotiikka ja autonomiset järjestelmät|en=Robotics and Autonomous Systems| | |
| dc.contributor.organization-code | 1.2.246.10.2458963.20.72785230805 | |
| dc.converis.publication-id | 179338592 | |
| dc.converis.url | https://research.utu.fi/converis/portal/Publication/179338592 | |
| dc.date.accessioned | 2026-01-21T12:26:20Z | |
| dc.date.available | 2026-01-21T12:26:20Z | |
| dc.description.abstract | Over the last decade, robotic perception algorithms have significantly benefited from the rapid advances in deep learning (DL). Indeed, a significant amount of the autonomy stack of different commercial and research platforms relies on DL for situational awareness, especially vision sensors. This work explored the potential of general-purpose DL perception algorithms, specifically detection and segmentation neural networks, for processing image-like outputs of advanced lidar sensors. Rather than processing the three-dimensional point cloud data, this is, to the best of our knowledge, the first work to focus on low-resolution images with a 360 degrees field of view obtained with lidar sensors by encoding either depth, reflectivity, or near-infrared light in the image pixels. We showed that with adequate preprocessing, general-purpose DL models can process these images, opening the door to their usage in environmental conditions where vision sensors present inherent limitations. We provided both a qualitative and quantitative analysis of the performance of a variety of neural network architectures. We believe that using DL models built for visual cameras offers significant advantages due to their much wider availability and maturity compared to point cloud-based perception. | |
| dc.identifier.jour-issn | 1424-8220 | |
| dc.identifier.olddbid | 212485 | |
| dc.identifier.oldhandle | 10024/195503 | |
| dc.identifier.uri | https://www.utupub.fi/handle/11111/52254 | |
| dc.identifier.url | https://www.mdpi.com/1424-8220/23/6/2936 | |
| dc.identifier.urn | URN:NBN:fi-fe2023042739027 | |
| dc.language.iso | en | |
| dc.okm.affiliatedauthor | Yu, Xianjia | |
| dc.okm.affiliatedauthor | Salimpourkasebi, Sahar | |
| dc.okm.affiliatedauthor | Peña Queralta, Jorge | |
| dc.okm.affiliatedauthor | Westerlund, Tomi | |
| dc.okm.discipline | 113 Computer and information sciences | en_GB |
| dc.okm.discipline | 213 Electronic, automation and communications engineering, electronics | en_GB |
| dc.okm.discipline | 113 Tietojenkäsittely ja informaatiotieteet | fi_FI |
| dc.okm.discipline | 213 Sähkö-, automaatio- ja tietoliikennetekniikka, elektroniikka | fi_FI |
| dc.okm.internationalcopublication | not an international co-publication | |
| dc.okm.internationality | International publication | |
| dc.okm.type | A1 ScientificArticle | |
| dc.publisher | MDPI | |
| dc.publisher.country | Switzerland | en_GB |
| dc.publisher.country | Sveitsi | fi_FI |
| dc.publisher.country-code | CH | |
| dc.publisher.place | Basel | |
| dc.relation.articlenumber | 2936 | |
| dc.relation.doi | 10.3390/s23062936 | |
| dc.relation.ispartofjournal | Sensors | |
| dc.relation.issue | 6 | |
| dc.relation.volume | 23 | |
| dc.source.identifier | https://www.utupub.fi/handle/10024/195503 | |
| dc.title | General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor | |
| dc.year.issued | 2023 |
Tiedostot
1 - 1 / 1