URL: https://www.mdpi.com/2079-9292/12/13/2814
Authors: de Curtò, J / de Zarzà, I / Roig, Gemma / Cano, Juan / Manzoni, Pietro / Calafate, Carlos
Publication: Electronics
Place Published: MDPI
Volume / Number / Pagination: 12 / 13 / 2814
Palabras clave: multi-armed bandit, non-stationary environments, Large Language Models, AI strategy optimization, GPT-3.5-turbo, QLoRA