주요 논문
5
*2026년 기준 최근 6년 이내 논문에 한해 Impact Factor가 표기됩니다.
1
article
|
·
인용수 4
·
2024Finite-Plane Simultaneous Localization and Mapping (FP-SLAM): A New RGB-D SLAM Exploiting Interfeature Relationship
Hae Min Cho, Euntai Kim
IF 5.9 (2024)
IEEE Transactions on Instrumentation and Measurement
This article introduces a novel feature-based visual simultaneous localization and mapping (v-SLAM), termed finite-plane (FP)-SLAM, using an RGB-D camera Specifically, the problem solved in this article is to extract sparse features effectively from RGB-D images and build a graph consisting of poses and sparse features as accurately as possible in real time on CPU. Motivated by surfel-point SLAM (SP-SLAM), FP-SLAM extracts points and surfels from RGB-D images as sparse features and optimizes a pose and feature graph jointly. Compared to SP-SLAM, however, two new residuals are proposed to optimize the graph and improved the accuracy in FP-SLAM. These new residuals exploit the relationships between surfel-surfel (SS) and between point-surfel (PS), capturing interfeature relations. Incorporating these new residuals with those from SP-SLAM, a total of four residuals are employed to jointly optimize points, surfels, and camera poses. In addition, to expedite processing, the Jacobian of the error function is directly implemented in the optimization instead of using the off-the-shelve derivative module to speed up the processing. Finally, the effectiveness of the proposed FP-SLAM is validated on benchmark datasets by comparing it against previous methods in terms of localization accuracy.
https://doi.org/10.1109/tim.2024.3418089
Simultaneous localization and mapping
Artificial intelligence
Computer vision
Feature (linguistics)
Computer science
RGB color model
Feature extraction
Plane (geometry)
Pattern recognition (psychology)
Mobile robot
2
article
|
·
인용수 3
·
2024Correlation Verification for Image Retrieval and Its Memory Footprint Optimization
Seongwon Lee, Hongje Seong, Suhyeon Lee, Euntai Kim
IF 18.6 (2024)
IEEE Transactions on Pattern Analysis and Machine Intelligence
In this paper, we propose a novel image retrieval network named Correlation Verification Network (CVNet) to replace the conventional geometric re-ranking with a 4D convolutional neural network that learns diverse geometric matching possibilities. To enable efficient cross-scale matching, we construct feature pyramids and establish cross-scale feature correlations in a single inference, thereby replacing the costly multi-scale inference. Additionally, we employ curriculum learning with the Hide-and-Seek strategy to handle challenging samples. Our proposed CVNet demonstrates state-of-the-art performance on several image retrieval benchmarks by a large margin. From an implementation perspective, however, CVNet has one drawback: it requires high memory usage because it needs to store dense features of all database images. This high memory requirement can be a significant limitation in practical applications. To address this issue, we introduce an extension of CVNet called Dense-to-Sparse CVNet (CVNet), which can significantly reduce memory usage by sparsifying the features of the database images. The sparsification module in CVNet learns to select the relevant parts of image features end-to-end using a Gumbel estimator. Since the sparsification is performed offline, CVNet does not increase online extraction and matching times. CVNet dramatically reduces the memory footprint while preserving performance levels nearly identical to CVNet.
https://doi.org/10.1109/tpami.2024.3504274
Computer science
Artificial intelligence
Footprint
Correlation
Memory footprint
Computer vision
Image retrieval
Pattern recognition (psychology)
Image (mathematics)
Image processing
3
article
|
·
인용수 1
·
2024Drivable Region Completion via a 3D LiDAR
Wonje Jang, Euntai Kim
IF 8.4 (2024)
IEEE Transactions on Intelligent Transportation Systems
Three-dimensional light detection and ranging (3D LiDAR) sensors are widely used in autonomous vehicles. Among various perception problems using a 3D LiDAR, the identification of drivable region (DR) is one of the most important problems. In this paper, the DR identification using a 3D LiDAR is reformulated as a completion problem. In other words, the problem is to generate the virtual points so that the virtual points cover the DR as densely and uniformly as possible, thereby representing the DR well enough. The problem considered herein is named the DR completion problem. To solve the DRC, a new network called the Drivable Region Completion Network (DRCN) is proposed. The proposed DRCN consists of an encoder part and a decoder part. The encoder part consists of two encoders. The first one is the Multi-Layer Perceptron (MLP) encoder, and it captures the global feature of the given LiDAR points. The second one is named the Point Pyramid Network (PPN) encoder, and it extracts local features between the points in the point cloud. The decoder part takes the coarse-to-fine structure, and it consists of the coarse and refinement decoders. The coarse decoder predicts seed points for the DR. The refinement decoder refines the seed points to predict the final dense DR points. Finally, the DRCN is applied to the SemanticKITTI dataset for evaluation. The proposed DRCN is validated by showing that our DRCN outperforms other DR detection methods in terms of accuracy.
https://doi.org/10.1109/tits.2024.3367130
Lidar
Computer science
Artificial intelligence
Computer vision
Remote sensing
Geography
4
article
|
·
인용수 17
·
2022Fallen person detection for autonomous driving
Suhyeon Lee, Sangyong Lee, Hongje Seong, Junhyuk Hyun, Euntai Kim
IF 8.5 (2022)
Expert Systems with Applications
https://doi.org/10.1016/j.eswa.2022.119242
Computer science
Artificial intelligence
Computer vision
Image (mathematics)
Set (abstract data type)
Domain (mathematical analysis)
Pixel
Mathematics
5
article
|
·
인용수 19
·
2022Video Object Segmentation Using Kernelized Memory Network With Multiple Kernels
Hongje Seong, Junhyuk Hyun, Euntai Kim
IF 23.6 (2022)
IEEE Transactions on Pattern Analysis and Machine Intelligence
on DAVIS 2016 validation set are 0.12 and 0.13 seconds per frame, respectively, and the two networks have similar computation times to STM.
https://doi.org/10.1109/tpami.2022.3163375
Computer science
Artificial intelligence
Kernel (algebra)
Segmentation
Matching (statistics)
Frame (networking)
Computer vision
Pattern recognition (psychology)
Set (abstract data type)
Object (grammar)