TY - JOUR
T1 - Understanding Musical Predictions With an Embodied Interface for Musical Machine Learning
AU - Martin, Charles Patrick
AU - Glette, Kyrre
AU - Nygaard, Tønnes Frostad
AU - Torresen, Jim
N1 - Publisher Copyright:
© Copyright © 2020 Martin, Glette, Nygaard and Torresen.
PY - 2020/3/3
Y1 - 2020/3/3
N2 - Machine-learning models of music often exist outside the worlds of musical performance practice and abstracted from the physical gestures of musicians. In this work, we consider how a recurrent neural network (RNN) model of simple music gestures may be integrated into a physical instrument so that predictions are sonically and physically entwined with the performer's actions. We introduce EMPI, an embodied musical prediction interface that simplifies musical interaction and prediction to just one dimension of continuous input and output. The predictive model is a mixture density RNN trained to estimate the performer's next physical input action and the time at which this will occur. Predictions are represented sonically through synthesized audio, and physically with a motorized output indicator. We use EMPI to investigate how performers understand and exploit different predictive models to make music through a controlled study of performances with different models and levels of physical feedback. We show that while performers often favor a model trained on human-sourced data, they find different musical affordances in models trained on synthetic, and even random, data. Physical representation of predictions seemed to affect the length of performances. This work contributes new understandings of how musicians use generative ML models in real-time performance backed up by experimental evidence. We argue that a constrained musical interface can expose the affordances of embodied predictive interactions.
AB - Machine-learning models of music often exist outside the worlds of musical performance practice and abstracted from the physical gestures of musicians. In this work, we consider how a recurrent neural network (RNN) model of simple music gestures may be integrated into a physical instrument so that predictions are sonically and physically entwined with the performer's actions. We introduce EMPI, an embodied musical prediction interface that simplifies musical interaction and prediction to just one dimension of continuous input and output. The predictive model is a mixture density RNN trained to estimate the performer's next physical input action and the time at which this will occur. Predictions are represented sonically through synthesized audio, and physically with a motorized output indicator. We use EMPI to investigate how performers understand and exploit different predictive models to make music through a controlled study of performances with different models and levels of physical feedback. We show that while performers often favor a model trained on human-sourced data, they find different musical affordances in models trained on synthetic, and even random, data. Physical representation of predictions seemed to affect the length of performances. This work contributes new understandings of how musicians use generative ML models in real-time performance backed up by experimental evidence. We argue that a constrained musical interface can expose the affordances of embodied predictive interactions.
KW - creativity
KW - embodied performance
KW - interface
KW - mixture density network (MDN)
KW - musical performance
KW - predictive interaction
KW - recurrent neural network (RNN)
UR - http://www.scopus.com/inward/record.url?scp=85109111743&partnerID=8YFLogxK
U2 - 10.3389/frai.2020.00006
DO - 10.3389/frai.2020.00006
M3 - Article
SN - 2624-8212
VL - 3
JO - Frontiers in Artificial Intelligence
JF - Frontiers in Artificial Intelligence
M1 - 6
ER -