Q-learning | Pollux - Fachinformationsdienst Politikwissenschaft

The solution for a Multi-Objetive Reinforcement Learning problem is a set of Pareto optimal policies. MPQ-learning is a recent algorithm that approximates the whole set of all Pareto-optimal deterministic policies by directly generalizing Q-learning to the multiobjective setting. In this paper we present a modification of MPQ-learning that avoids useless cyclical policies and thus improves the number of training steps required for convergence. ; Supported by: the Spanish Government, Agencia Estatal de Investigaci´on (AEI) and European Union, Fondo Europeo de Desarrollo Regional (FEDER), grant TIN2016-80774-R (AEI/FEDER, UE); and Plan Propio de Investigación de la Universidad de Málaga - Campus de Excelencia Internacional Andalucía Tech.

Open Access

BASE

Exportieren

Aufsatz(elektronisch)#32019

Autonomous Algorithmic Collusion: Q-Learning Under Sequential Pricing

In: RAND Journal of Economics, Forthcoming

Klein, Timo

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Working paper

Exportieren

Aufsatz(elektronisch)#42021

Nash Q-Learning Agents in Hotelling's Model: Reestablishing Equilibrium

In: This is a pre-print of an article published in Communications in Nonlinear Science and Numerical Simulation (2021). The final authenticated version is available online at DOI: doi.org/10.1016/j.cnsns.2021.105805

Vainer, Jan; Kukacka, Jiri

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Working paper

Exportieren

Aufsatz(elektronisch)#5

A New Q-Learning-Based Community Detection Algorithm

In: EAAI-24-10157

Chen, Xiaoyu; Gao, Xingbao

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Exportieren

Aufsatz(elektronisch)#69. August 2021

Autonomous algorithmic collusion: Q‐learning under sequential pricing

In: The Rand journal of economics, Band 52, Heft 3, S. 538-558

Klein, Timo

ISSN: 1756-2171

AbstractPrices are increasingly set by algorithms. One concern is that intelligent algorithms may learn to collude on higher prices even in the absence of the kind of coordination necessary to establish an antitrust infringement. However, exactly how this may happen is an open question. I show how in simulated sequential competition, competing reinforcement learning algorithms can indeed learn to converge to collusive equilibria when the set of discrete prices is limited. When this set increases, the algorithm considered increasingly converges to supra‐competitive asymmetric cycles. I show that results are robust to various extensions and discuss practical limitations and policy implications.

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

Exportieren

Aufsatz(elektronisch)#75. August 2012

Selecting the best routing algorithm for Q-learning

In: International journal of academic research, Band 4, Heft 4, S. 89-94

Yahyayi, Leila; Chanpa, Robabeh

ISSN: 2075-7107

Zugriff(via Standort)Subito

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

Exportieren

Aufsatz(elektronisch)#82024

Artificial Collusion: Examining Supracompetitive Pricing by Q-Learning Algorithms

In: Amsterdam Law School Research Paper No. 2022-25

den Boer, Arnoud V.; Meylahn, Janusz M; Schinkel, Maarten Pieter

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Exportieren

Aufsatz(elektronisch)#92024

A Geometric Nash Approach in Tuning the Learning Rate in Q-Learning Algorithm

Osei Bonsu, Kwadwo

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Exportieren

Open Access#102017

Bilateral contract prices estimation using a Q-learning based approach

Rodriguez-Fernandez, Jaime; Pinto, Tiago; Silva, Francisco; Praça, Isabel; Vale, Zita; Corchado, Juan Manuel

The electricity markets restructuring process encouraged the use of computational tools in order to allow the study of different market mechanisms and the relationships between the participating entities. Automated negotiation plays a crucial role in the decision support for energy transactions due to the constant need for players to engage in bilateral negotiations. This paper proposes a methodology to estimate bilateral contract prices, which is essential to support market players in their decisions, enabling adequate risk management of the negotiation process. The proposed approach uses an adaptation of the Q-Learning reinforcement learning algorithm to choose the best from a set of possible contract prices forecasts that are determined using several methods, such as artificial neural networks (ANN), support vector machines (SVM), among others. The learning process assesses the probability of success of each forecasting method, by comparing the expected negotiation price with the historic data contracts of competitor players. The negotiation scenario identified as the most probable scenario that the player will face during the negotiation process is the one that presents the higher expected utility value. This approach allows the supported player to be prepared for the negotiation scenario that is the most likely to represent a reliable approximation of the actual negotiation environment. ; This work has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 703689 (project ADAPT) and No 641794 (project DREAM-GO); NetEfficity Project (P2020 − 18015); and UID/EEA/00760/2013 funded by FEDER Funds through COMPETE pro-gram and by National Funds through FCT. ; info:eu-repo/semantics/publishedVersion

Open Access

BASE

Exportieren

Aufsatz(elektronisch)#112020

Finding Best Answers for the Iterated Prisoner's Dilemma Using Improved Q-Learning

Kies, Martin

init.form.title.accessOptions

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Working paper

Exportieren

Aufsatz(elektronisch)#12

Baghchal: An Augmented Q-Learning Approach to Turn-Based Heterogeneous Multi-Agent Systems

In: IJIN-D-23-00458

Luitel, Yugesh Upadhyaya; Koirala, Arpan; Shrestha, Aatish; Poudel, Abhijeet Sharma

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Exportieren

Aufsatz(elektronisch)#13

Coordinated Q-Learning Based Multi-Hop Routing for Uav-Assisted Emergency Communication

In: PMC-D-24-00658

Sharvari, N. P.; Das, Dibakar; Bapat, Jyotsna; Das, Debabrata

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Exportieren

Aufsatz(elektronisch)#142021

Limiting dynamics for Q-learning with memory one in two-player, two-action games

Meylahn, Janusz M

Open Access

Verfügbarkeit an Ihrem Standort wird überprüft

Dieser Artikel ist auch in Ihrer Bibliothek verfügbar: |

elektronisch

gedruckt

SSRN

Exportieren

Open Access#152019

Contextual Simulated Annealing Q-Learning for Pre-negotiation of Agent-Based Bilateral Negotiations

Tiago Pinto; Zita Vale

Electricity markets are complex environments, which have been suffering continuous transformations due to the increase of renewable based generation and the introduction of new players in the system. In this context, players are forced to re-think their behavior and learn how to act in this dynamic environment in order to get as much benefit as possible from market negotiations. This paper introduces a new learning model to enable players identifying the expected prices of future bilateral agreements, as a way to improve the decision-making process in deciding the opponent players to approach for actual negotiations. The proposed model introduces a con-textual dimension in the well-known Q-Learning algorithm, and includes a simulated annealing process to accelerate the convergence process. The proposed model is integrated in a multi-agent decision support system for electricity market players negotiations, enabling the experimentation of results using real data from the Iberian electricity market. ; This work has received funding from the European Union's Horizon 2020 research and innovation programme under project DOMINOES (grant agreement No 771066) and from FEDER Funds through COMPETE program and from National Funds through FCT under the project UID/EEA/00760/2019.

Open Access

BASE

Exportieren

Suchergebnisse

Filter

Format

Medientyp

Sprache

Weitere Sprachen

Jahre

Kontakt

Hilfe