Please use this identifier to cite or link to this item:
https://repositorio.ufpe.br/handle/123456789/57554
Share on
Title: | ELODIN : naming concepts in embedding spaces |
Authors: | MELLO, Rodrigo Vitor Castro Alves de |
Keywords: | Inteligência computacional; Processamento de linguagem natural; Deep learning |
Issue Date: | 27-Sep-2023 |
Publisher: | Universidade Federal de Pernambuco |
Citation: | MELLO, Rodrigo Vitor Castro Alves de. ELODIN: naming concepts in embedding spaces. 2023. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de Pernambuco, Recife, 2023. |
Abstract: | Despite recent advancements, the field of text-to-image synthesis still suffers from the lack of fine-grained control. Using only text, it remains challenging to deal with issues such as concept coherence and concept cohesion. A method to enhance control by generating new words that can be reused throughout multiple images is proposed. Each new word, which I call “named concept”, can be mixed and matched freely with natural language, effectively expanding human vocabulary. Just as a painter combines pre-existing shades into personalized colors according to their needs, the proposed method enables combining e.g. “yellow” and “hawk” into a single word, that is, a single named concept. The new word, when present in subsequent text prompts, results in images that consistently contain the same yellow hawk. Unlike previous contributions, our method does not replicate visuals from input data. In some cases, it can generate visual concepts in a zero-shot manner, that is, without any visual input. A set of comparisons show our method to be a significant improvement over text prompts containing only natural language. Theoretical considerations on the foundations of Deep Learning are made throughout the text and Name Learning is proposed. |
URI: | https://repositorio.ufpe.br/handle/123456789/57554 |
Appears in Collections: | Dissertações de Mestrado - Ciência da Computação |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
DISSERTAÇÃO Rodrigo Vitor Castro Alves de Mello.pdf | 15,34 MB | Adobe PDF | ![]() View/Open |
This item is protected by original copyright |
This item is licensed under a Creative Commons License