Robots operating in human-centered environments, such as retail stores, restaurants, and households, are often required to distinguish between similar objects in different contexts with a high degree of accuracy. However, fine-grained object recognition remains a challenge in robotics due to the high intra-category and low inter-category dissimilarities. In addition, the limited number of fine-grained 3D datasets poses a significant problem in addressing this issue effectively. In this paper, we propose a hybrid multi-modal Vision Transformer (ViT) and Convolutional Neural Networks (CNN) approach to improve the performance of fine-grained visual classification (FGVC). To address the shortage of FGVC 3D datasets, we generated two synthetic d...
Moving object recognition (MOR) is an important but challenging problem in the field of computer vis...
Computer vision has been revolutionised in recent years by increased research in convolutional neura...
Service robots, in general, have to work independently and adapt to the dynamic changes happening in...
Robots operating in human-centered environments, such as retail stores, restaurants, and households,...
Robots operating in human-centered environments, such as retail stores, restaurants, and households,...
Robots operating in human-centered environments, such as retail stores, restaurants, and households,...
A robot working in a human-centered environment is frequently confronted with fine-grained objects t...
State-of-the-art object recognition systems, and computer vision methods in general, are getting bet...
State-of-the-art object recognition systems, and computer vision methods in general, are getting bet...
Abweichender Titel nach Übersetzung der Verfasserin/des VerfassersObject recognition, or object clas...
Environmental perception is a vital feature for service robots when working in an indoor environment...
Environmental perception is a vital feature for service robots when working in an indoor environment...
3D visual recognition is a prerequisite for most autonomous robotic systems operating in the real wo...
Recognizing 3-D objects has a wide range of application areas from autonomous robots to self-driving...
For various computer vision tasks, finding suitable feature representations is fundamental. Fine-gra...
Moving object recognition (MOR) is an important but challenging problem in the field of computer vis...
Computer vision has been revolutionised in recent years by increased research in convolutional neura...
Service robots, in general, have to work independently and adapt to the dynamic changes happening in...
Robots operating in human-centered environments, such as retail stores, restaurants, and households,...
Robots operating in human-centered environments, such as retail stores, restaurants, and households,...
Robots operating in human-centered environments, such as retail stores, restaurants, and households,...
A robot working in a human-centered environment is frequently confronted with fine-grained objects t...
State-of-the-art object recognition systems, and computer vision methods in general, are getting bet...
State-of-the-art object recognition systems, and computer vision methods in general, are getting bet...
Abweichender Titel nach Übersetzung der Verfasserin/des VerfassersObject recognition, or object clas...
Environmental perception is a vital feature for service robots when working in an indoor environment...
Environmental perception is a vital feature for service robots when working in an indoor environment...
3D visual recognition is a prerequisite for most autonomous robotic systems operating in the real wo...
Recognizing 3-D objects has a wide range of application areas from autonomous robots to self-driving...
For various computer vision tasks, finding suitable feature representations is fundamental. Fine-gra...
Moving object recognition (MOR) is an important but challenging problem in the field of computer vis...
Computer vision has been revolutionised in recent years by increased research in convolutional neura...
Service robots, in general, have to work independently and adapt to the dynamic changes happening in...