Qualifier

Mitja Hmeljak

Underlying Theme: Music Visualization
Target exam time: 10:00 AM, Friday July 2, 2004
Place: LH101
Focus Topics
1. Auditory Perception
2. Music Cognition
3. Computer Graphics for Sci.Viz. and Virtual Reality
4. Sonification
Committee Members
- Prof. Robert Port (Cog.Sci.) - Topics 1, 2
- Prof. Andrew Hanson (CSCI) - Topics 3, 4
- Prof. Michael Gasser (CSCI) - Topics 1, 2
- Prof. Eric Isaacson (Music) - Topics 1, 2
Bibliography

Audition and Vision

A. What are the fundamental psychological differences and similarities between audition and vision?

Are there any unbridgeable differences?
Are those differences basically due to the information itself or to our neurocognitive constraints?

B. What sort of goals should a music visualization system try to achieve?

Is there a reason to expect that music, an essentially temporal and auditory phenomenon should be amenable to visualization?
What advantages could be expected if it could be done?
Might visualization of music have any consequences for listener/lookers once they are accustomed to such a transformation?

A: Features of auditory vs. visual perception

temporal characteristics, or changes over time

human hearing well designed to discriminate between periodic and aperiodic events and can detect small changes in the frequency of continuous signals -> advantage of auditory over visual displays:

fast-changing or transient data that might be blurred or completely missed by visual displays may be easily detectable in even a primitive, but well-designed auditory display

auditory perception advantages:

detection of complex temporal data
data embedded in other, more static, signals

visual vs. auditory spatial field

visual perception disadvantages:

visual attention may be diverted from a particular location: visual events may be missed by being outside of visual range -> typical use of auditory monitoring and alarm applications

visual perception advantages:

information can be visually 'stacked' in space instead of being 'stacked' in time (which relies on visual memory)
use of 'small multiples' for multivariant data:

once the viewer learns one presentation of data, the same presentation can be repeated for additional data: visual detection of differences in pattern
avoids recency bias (i.e. giving more relevance to the most recent sensory input)

visual information can be augmented by integrating image and explanation (e.g. words/numbers/markers) and multiple cues without masking effects - much more problematic with auditory information
quantitative visual display:

(with or without scales, grids, etc) direct quantitative comparison of complex data

B: music visualization:

music notation is music visualization

advantages of music visualization:

music occurs in time: to perceive its structure, transformation into spatial/visual
simultaneous presentation of past / future events
several aspects of a composition can be presented in a single (static) visual display

Techniques in Visualizing Music

Summarize techniques found in your readings for visualizing music in

(a) audio (i.e., derived from a performance) and
(b) symbolic (i.e., derived from the score) form.

What are the relative benefits of each?
What are their weak points?

Symbolic visualizations:

Brinkman/Mesiti:

graphic representations of musical structure
extracted from printed score or equivalent information
note plots

time,pitch -> x,y
utilization of acoustic space

part plots

include lines i.e. pitches connected in explicit parts
rhythmic structure visible

analytic abstractions

space plots: use of total pitch space
band plots: high+low+average of used pitches
aggregate plots: use of pitch classes

dynamic shape

-> audio

etc.

Malinovski: Music Animation Machine

Bach Trio Sonata IV, 3rd movement: color used to separate parts:

Beethoven Bagatelle, op.33, no.5 (video) : color used to demarcate different types of textural material
Brahms Capriccio, opus 76, no.2 (video): color used to demarcate closeness to tonality:

Hiraga/Fujishiro, Smith/Williams, etc:

as above, but in 3D:
structure?

Symbolic vs Audio signal visualization:

Self-similarity (Foote/Cooper)

short time scale:
long time scale:
demo

Visual Display of Rhythm

You are designing a system that will display visually the RHYTHM of an unfamiliar piece of music in real time.
The system should represent how a listener's sense of the rhythmic structure emerges and evolves in time.
You can base it on any theory of rhythm, for example, that of Lerdahl and Jackendoff (though they won't help with how rhythm emerges in real time).

Discuss both representation and implementation issues for the system.

rhytmic structure

non-overlapping

with exceptions: units at any level can overlap (possibly but not commonly including in the overlap entire units at lower levels) - but this is the only exception to an otherwise non-overlapping structure

recursive

allows similar grouping structures to exist at any hierarchical level

adjacency

nonadjacent units can not be grouped together at any level

metrical structure (as from GTTM)

phenomenal accent := any event at the musical surface causing emphasis or stress to a moment in the musical flow (attack points, local stresses, sudden changes in dynamics, timbre, leaps in pitch, etc...)
structural accents := any accent caused by melodic or harmonic points of gravity, i.e. cadence, tonal motion

metric accent := any beat relatively strong in its metrical context

metric accents:

created by phenomenal accents, which provide cues with their regularity, or create ambiguities or syncopation by an absence of regularity

mental contruct inferred from phenomenal accents on the musical surface

Interaction of grouping and metrical structure

(associational structure := "web" of motivic (timbral, etc) associations between potentially disjoint groups, not part of the rhytmic structure of a musical piece)

Beat spectrum visualization (from Foote)

non real-time example: rhythmic structure of a pop tune, used in a music video

Procedural Animation and Sonification

(1) give a detailed example of a non-trivial procedural animation, and then
(2) indicate what methods, algorithms, and heuristics you would use to produce an appropriate sonification of the animation.

Procedural Animation

particle set animations

generate new particles if necessary
assign new particles initial attributes

initial position, speed, direction, size, transparency
shape
lifetime

discard exhausted particles
move all current particles according to their scripts
render all current particles

example: sphere eversion by principle of energy minimization

computations for energy minimization:

only computed the second half of the eversio

starts with a surface with the special 4-fold symmetry of the halfway model

perturbation: following path towards lesser energy

creation/destruction of triangles:

define facet attribute flat integer; lfac:=.85; tfac:=.9;

flat_tri := {

   set facet ff flat max(ff.edge,dihedral)<.18;

   delete edges ee where length<.1/tfac and min(ee.face,flat);

   refine edges where length>.7*lfac and dihedral>.5;

};

trian := { n lfac; u; l lfac; u; g; w .001/tfac; t .05/tfac; flat_tri; u; g; V; u; };

parambience/sonification

mapping data to specific dimensions of sound:

parallel listening (ability to monitor and process multiple auditory data sets)
rapid detection (especially in high-stress environments)
affective response (ease of learning and high engagement qualities)
auditory gestalt formation (discerning relationships or trends in data streams)
real-time visualization+sonification (video)
user interaction (video)
non real-time rendering example (video)

Last updated: July 2, 2004

mitja (at) indiana . edu