Reinforcement Learning (RL): a gentle introduction with a real application

Christian Hidber, bSquare

RL learns complex processes autonomously. No big data sets with the “right” answers are needed: the algorithms learn by experimenting. In 2017, RL algorithms beat the reigning Go World Champion.

The lecture intuitively shows “how” and “why” RL works. As a practical example, we apply RL to syphonic roof drainage: The choice of the “right” dimensions of trillions of possibilities is extremely difficult. Similar optimisation challenges appear in areas like dynamic pricing, game balancing, or player reward systems in non GamStop casinos. By the way, this prevents large buildings, such as airports or stadiums, from collapsing in heavy rain.

This talk shows how RL complements our existing machine learning solution, delivering a 56 per cent reduced fail rate.

Required audience experience

Basic familiarity with machine learning is helpful, but not required

Objective of the talk

• Develop an intuitive understanding of reinforcement learning
• Work with OpenAI gym
• Deploying a “bleeding edge” solution on 7,000 clients

Track 1
Location: Auditorium Date: October 15, 2018 Time: 5:25 pm – 6:10 pm Christian Hidber, bSquare Christian Hidber, bSquare