Expressive Communication: A Commmon Framework for Evaluating Developments in Generative Models and Steering Interfaces

In the Expressive Communication framework, Alice the composer is tasked with composing a piece of music that expresses the imagery and words of a card. She creates two pieces of music, one for each of the different systems she is comparing, which can differ on their interfaces or their models. After completing her compositions, she provides self-report answers about her experience and the compositions made with both systems:

  1. Expression: I feel confident that my composition created with System X expresses the imagery/ideas of the card I chose.
  2. Communication: I feel confident that others will be able to rank the Card I chose high after listening to the composition created with System X.
  3. Musical Coherence: The composition made with System X feels musically coherent.
  4. Ownership: Using System X, I felt the composition created was mine.
  5. Control: I felt I had control creating the composition when using System X.
  6. Efficacy: Using System X, I could find several solutions to achieve my goals for the composition.

Afterwards, Bob the listener is provided the same card and listens to both music compositions made by Alice. Bob is asked to compare these compositions along two measures:

  1. Which better evokes the imagery and words of the card?
  2. Which sounds more musical?

Composer Results

Providing composers with more steerable interfaces to the generation helped them feel more ownership, control, and efficacy in finding multiple music options that achieved their goals

Providing composers with with the more expressive model helps them feel the composition has higher musical coherence, and helped them find more viable options in the generated music which ultimately contributed to a small but significant difference in ownership and control.

Listener Results

Composers made 100+ pieces of music. Afterwards, 20 listeners made 1000+ head-to-head comparisons for the different pieces of music.

Developments in Models and Interfaces are Complimentary for Creative Empowerment and Effectiveness

Steering can Help Overcome Model Biases

We further investigated the listener comparison ratings across the 5 cards (see below for all 5 cards).

When comparing model ratings across cards, we find that biases in pretrained models help to better evoke some feelings over others (see next Figure, left). When comparing interface ratings across cards, we find that steering interfaces are more helpful when expressing feeling that are misaligned with model biases (see next Figure, right).

Music Supplement

You can play the role of the listener by making comparisons between two music samples. Samples are organized by different cards.

happy
ParticipantPerformanceRNN RadioMusicTransformer RadioMusicTransformer Steered
P10
P15
P16
P19
P30
P8
sad
ParticipantPerformanceRNN RadioMusicTransformer RadioMusicTransformer Steered
P13
P14
P17
P18
P21
P23
P24
P27
P28
P7
curious
ParticipantPerformanceRNN RadioMusicTransformer RadioMusicTransformer Steered
P11
P12
P15
P16
P17
P18
P20
P23
P26
P27
P29
P30
P31
P6
conflict
ParticipantPerformanceRNN RadioMusicTransformer RadioMusicTransformer Steered
P10
P11
P12
P13
P19
P21
P22
P25
P26
P9
fear
ParticipantPerformanceRNN RadioMusicTransformer RadioMusicTransformer Steered
P14
P22
P24
P25
P28
P29
P31
P6
P7
P8
P9