a musical score or an audio recording), the main idea of chroma features is to aggregate for a given local time window (e.g. Assuming the equal-tempered scale, one considers twelve chroma values represented by the setĬonsisting of all pitches separated by an integer number of octaves.
Based on this observation, a pitch can be separated into two components, which are referred to as tone height and chroma. The underlying observation is that humans perceive two musical pitches as similar in color if they differ by an octave.