Music alignment
[Image:MusicAlignment_BeethovenFifth.png|thumb|300px|right|First theme of Symphony No. 5 by Ludwig van Beethoven in a sheet music, audio,
and piano-roll representation. The red bidirectional arrows indicate the aligned time positions of corresponding note events in the different representations.]
Music can be described and represented in many different ways including sheet music, symbolic representations, and audio recordings. For each of these representations, there may exist different versions that correspond to the same musical work. The general goal of music alignment is to automatically link the various data streams, thus interrelating the multiple information sets related to a given musical work. More precisely, music alignment is taken to mean a procedure which, for a given position in one representation of a piece of music, determines the corresponding position within another representation. In the figure on the right, such an alignment is visualized by the red bidirectional arrows. Such synchronization results form the basis for novel interfaces that allow users to access, search, and browse musical content in a convenient way.
Basic procedure
Given two different music representations, typical music alignment approaches proceed in two steps. In the first step, the two representations are transformed into sequences of suitable features. In general, such feature representations need to find a compromise between two conflicting goals. On the one hand, features should show a large degree of robustness to variations that are to be left unconsidered for the task at hand. On the other hand, features should capture enough characteristic information to accomplish the given task. For music alignment, one often uses chroma-based features, which capture harmonic and melodic characteristics of music, while being robust to changes in timbre and instrumentation, are being used.In the second step, the derived feature sequences have to be brought into correspondence. To this end, techniques related to dynamic time warping (DTW) or hidden Markov models (HMMs) are used to compute an optimal alignment between two given feature sequences.