User Tools

Site Tools


deepunlearning

====== Differences ====== This shows you the differences between two versions of the page.

Link to this comparison view

deepunlearning [2013/06/30 16:23]
bmcfee [Implementation]
deepunlearning [2013/06/30 17:04] (current)
craffel
Line 1: Line 1:
-====== ​Idea ======+====== ​Deep Unlearning ​======
  
 MIR techniques rely upon accurate representations of acoustic content in order to produce high-quality results. ​ Over the past few decades, most research has operated on hand-crafted features, which work well up to a point, but may discard important information from the representation,​ thereby degrading performance. MIR techniques rely upon accurate representations of acoustic content in order to produce high-quality results. ​ Over the past few decades, most research has operated on hand-crafted features, which work well up to a point, but may discard important information from the representation,​ thereby degrading performance.
Line 17: Line 17:
  
 The model architecture is based upon the ''​convolutional_mlp.py''​ example from the DeepLearningTutorial,​ with the following modifications:​ The model architecture is based upon the ''​convolutional_mlp.py''​ example from the DeepLearningTutorial,​ with the following modifications:​
-  - The input layer operates on a short fragment of audio (~0.5s) represented as a %$64\times 40$-dimensional Mel power spectrum.+  - The input layer operates on a short fragment of audio (~0.5s) represented as a $64\times 40$-dimensional Mel power spectrum.
   - Layer 1 consists of a bank of 2-dimensional convolutional filters. ​ Each filter is convolved with the input layer, and the resulting filter responses are downsampled by spatial max-pooling.   - Layer 1 consists of a bank of 2-dimensional convolutional filters. ​ Each filter is convolved with the input layer, and the resulting filter responses are downsampled by spatial max-pooling.
   - Layer 2 consists of a linear transformation of the pooled filter responses, followed by a bank of rectified linear units   - Layer 2 consists of a linear transformation of the pooled filter responses, followed by a bank of rectified linear units
Line 33: Line 33:
   * [[https://​github.com/​Theano/​Theano|Theano]]   * [[https://​github.com/​Theano/​Theano|Theano]]
   * [[https://​github.com/​bmcfee/​librosa|LibROSA]]   * [[https://​github.com/​bmcfee/​librosa|LibROSA]]
 +
 +====== Authors ======
 +  * Brian McFee
 +  * Nicola Montecchio
 +
deepunlearning.1372623798.txt.gz ยท Last modified: 2013/06/30 16:23 by bmcfee