Shohei Ohsawa

2papers

2 Papers

MLJun 6, 2021
Truthful Self-Play

Shohei Ohsawa

We present a general framework for evolutionary learning to emergent unbiased state representation without any supervision. Evolutionary frameworks such as self-play converge to bad local optima in case of multi-agent reinforcement learning in non-cooperative partially observable environments with communication due to information asymmetry. Our proposed framework is a simple modification of self-play inspired by mechanism design, also known as {\em reverse game theory}, to elicit truthful signals and make the agents cooperative. The key idea is to add imaginary rewards using the peer prediction method, i.e., a mechanism for evaluating the validity of information exchanged between agents in a decentralized environment. Numerical experiments with predator prey, traffic junction and StarCraft tasks demonstrate that the state-of-the-art performance of our framework.

CRJun 14, 2019
What is Stablecoin?: A Survey on Its Mechanism and Potential as Decentralized Payment Systems

Makiko Mita, Kensuke Ito, Shohei Ohsawa et al.

Our study provides a survey on how existing stablecoins-- cryptocurrencies aiming at price stabilization-- peg their value to other assets, from the perspective of Decentralized Payment Systems (DPSs). This attempt is important because there has been no preceding surveys focusing on the stablecoin as DPSs, i.e., the one aiming at not only price stabilization but also decentralization. Specifically, we first classified existing stablecoins into four types according to their collaterals (fiat, commodity, crypto, and non-collateralized) and pointed out the high potential of non-collateralized stablecoins as DPSs; then, we further classified existing non-collateralized stablecoins into two types according to their intervention layers (protocol, application) and confirmed details of their representative mechanisms. Utilizing concepts such as Quantity Theory of Money (QTM), Tobin tax, and speculative attack, our survey revealed the status quo where, despite the high potential of non-collateralized stablecoins, they have no standard mechanism to achieve the stablecoin for practical DPSs.