Difference between revisions of "MIR workshop 2008 notes"

From CCRMA Wiki
Jump to: navigation, search
(Audio Fingerprinting)
(Real-world applications)
Line 92: Line 92:
  
 
= Real-world applications =  
 
= Real-world applications =  
 +
== Audio Segmentation ==
 +
[http://www.ifs.tuwien.ac.at/mir/audiosegmentation.html Automatic Audio Segmentation: Segment Boundary and Structure Detection in Popular Music]
 +
 
== Audio Fingerprinting ==  
 
== Audio Fingerprinting ==  
 
* [http://ieeexplore.ieee.org/iel5/8561/27103/01203274.pdf?tp=&isnumber=&arnumber=1203274 P. Cano, E. Batlle, T. Kalker, and J. Haitsma, “A review of algorithms for audio fingerprinting,” in IEEE International Workshop on Multimedia Signal Processing (MMSP), pp. 169 – 173, December 2002. 4 pages.]
 
* [http://ieeexplore.ieee.org/iel5/8561/27103/01203274.pdf?tp=&isnumber=&arnumber=1203274 P. Cano, E. Batlle, T. Kalker, and J. Haitsma, “A review of algorithms for audio fingerprinting,” in IEEE International Workshop on Multimedia Signal Processing (MMSP), pp. 169 – 173, December 2002. 4 pages.]

Revision as of 15:28, 14 August 2008

This page is intended to supplement the lecture material found in the class - providing extra tutorials, support, references for further reading, or demonstration code snippets for those interested in a given topic. Please contribute to this growing list of resources. Do you have a great explanation of how a technique works? Found a great Java applet that illustrates a concept? Discovered a great survey of the field for a particular area? Please add it for the benefit of future students. Thanks!

I encourage you to ADD links and sections - but please do not REMOVE headings or items from the page.

Timing and Segmentation

Onset Detection

Papers

Code

Beat Extraction

Papers

Code

Tempo Extraction

Papers

Code

Feature Extraction

Low Level Features

Zero Crossing, Temporal centroid, Log Attack time, Attack slope), Spectral features (Centroid, Flux, RMS, Rolloff, Flatness, Kurtosis, Brightness),Spectral bands, Log spectrogram

Chroma bins

MFCC

Auditory Toolbox (code and docs)

MPEG-7

Higher-level features

Key Estimation

Chord Estimation

Genre (genre, artist ID, similarity)

"Fingerprints"

Visualizing and Sonifying Feature data

Matt Hoffman's feature sonification work

Analysis / Decision Making

Classification

Heuristic Analysis

Distance measures (Euclidean, Manhattan, etc.)

k-NN

SVM / One-class SVM

Resources

Code

Clustering and probability density models

Density distance measures (centroid distance, EMD, KL-divergence, etc)

k-Means

Clustering

GMM

HMM

Nested classifier / Anchor-space / template-based systems

Model / Data Preparation Techniques

Data Preparation

PCA / LDA

Scaling data

Model organization

  • concept, design, data set construction and organization

Evaluation Methodology

Feature selection

Cross Validation

Information Retrieval metrics (precision, recall, F-Measure)

Real-world applications

Audio Segmentation

Automatic Audio Segmentation: Segment Boundary and Structure Detection in Popular Music

Audio Fingerprinting

Drum Transcription

Audio Similarity

Music Recommendation / Playlisting

Getting Involved in the MIR Community