Use este identificador para citar ou linkar para este item:
https://repositorio.ufpe.br/handle/123456789/55203
Compartilhe esta página
Título: | A new approach to semantic mapping using reusable consolidated visual representations |
Autor(es): | SOUSA, Ygor César Nogueira |
Palavras-chave: | Inteligência computacional; Mapeamento semântico topológico; Robótica móvel |
Data do documento: | 28-Ago-2023 |
Editor: | Universidade Federal de Pernambuco |
Citação: | SOUSA, Ygor César Nogueira. A new approach to semantic mapping using reusable consolidated visual representations. 2023. Tese (Doutorado em Ciência da Computação) – Universidade Federal de Pernambuco, Recife, 2023. |
Abstract: | The advancement of robotics may produce a positive impact on several aspects of our society. However, in order for robotic agents to assist humans in a variety of everyday ac- tivities, they need to possess representations of their environments that allow spatial and human-centered semantic understanding. Many works in the recent literature use Convolu- tional Neural Network (CNN) models to recognize semantic properties of images and incor- porate the results into traditional metric or topological maps, a procedure known as semantic mapping. The types of semantic properties (e.g., room size, place category, and objects) and their semantic classes (e.g., kitchen and bedroom, for place category) are usually previously defined and restricted to the planned tasks. Thus, all the visual data acquired and processed during the construction of the maps is lost, and only the recognized semantic properties re- main on the maps. In contrast, this research proposes using the visual data acquired during the mapping process to create reusable representations of regions by consolidating deep features extracted from the data. These consolidated representations would allow the recognition of new semantic information in a flexible way, and consequently, the adaptation of the semantics of the maps to new requirements of new tasks without the need for remapping. Such use of reusable consolidated representations for the generation of semantic maps is demonstrated in a topological mapping method that creates consolidated representations of deep visual fea- tures extracted from RGB images captured around each topological node. This is done using a process we denote as Topological Consolidation of Features by Moving Averages (TCMA). Experiments performed with real-world indoor datasets suggested that the proposed method is able to create consolidated representations that fairly preserve the visual features of the original images they consolidated and do not degrade in quality over time. Furthermore, the very promising results suggested that the consolidated representations produced are suitable for recognizing different semantic properties, indicating the topological location of images and adapting previously created maps with new semantic information. The experiments included two different CNNs for deep features extraction, classifiers trained on large-scale datasets from the literature, and more practical real-time scenarios. Different variations of the method were evaluated, including a derivation of the TCMA process that uses the arithmetic mean of multiple exponential moving averages. |
URI: | https://repositorio.ufpe.br/handle/123456789/55203 |
Aparece nas coleções: | Teses de Doutorado - Ciência da Computação |
Arquivos associados a este item:
Arquivo | Descrição | Tamanho | Formato | |
---|---|---|---|---|
TESE Ygor César Nogueira Sousa.pdf | 14,64 MB | Adobe PDF | ![]() Visualizar/Abrir |
Este arquivo é protegido por direitos autorais |
Este item está licenciada sob uma Licença Creative Commons