Banca de QUALIFICAÇÃO: GABRIEL CALDAS BARROS E SÁ

Uma banca de QUALIFICAÇÃO de MESTRADO foi cadastrada pelo programa.
STUDENT : GABRIEL CALDAS BARROS E SÁ
DATE: 27/03/2024
TIME: 14:40
LOCAL: https://meet.google.com/pxh-ncyf-eje
TITLE:

Jaguar: A Hierarchical Multi-Agent Deep Reinforcement Learning approach with Transfer Learning for StarCraft II


KEY WORDS:

Deep Reinforcement Learning; StarCraft II; Multi-Agent; Transfer Learning; Hierarchical Architecture.


PAGES: 50
BIG AREA: Ciências Exatas e da Terra
AREA: Ciência da Computação
SUMMARY:

Real-time strategy games are environments that usually simulate real military situations and present a set of challenges for the field of Artificial Intelligence, such as the high complexity and large space of actions and states, partially observable maps and the fact that they deal with multiple agents at the same time, in addition to the tasks being able to be performed within the scope of micromanagement or macromanagement. In particular, Reinforcement Learning has been highlighted in the application and evolution of techniques capable to deal with these challenges, especially with the advent of Deep Reinforcement Learning. A systematic literature review was conducted with the goal of understanding and summarizing which environments, techniques, tools and architectures make up the state of the art of Deep Reinforcement Learning in Real-Time Strategy games. From the information obtained by the review, this work proposes to develop a Hierarchical Multi-Agent approach focused on the game StarCraft II, using Transfer Learning and action masking techniques to aid the agent training consume less resources and obtain satisfactory results even for more complex scenarios.


COMMITTEE MEMBERS:
Presidente - 2978747 - CHARLES ANDRYE GALVAO MADEIRA
Interno - 1669545 - DANIEL SABINO AMORIM DE ARAUJO
Externo ao Programa - 2885532 - IVANOVITCH MEDEIROS DANTAS DA SILVA - UFRN
Notícia cadastrada em: 02/04/2024 08:37
SIGAA | Superintendência de Tecnologia da Informação - (84) 3342 2210 | Copyright © 2006-2024 - UFRN - sigaa02-producao.info.ufrn.br.sigaa02-producao