<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <title>4.3. Decomposing signals in components (matrix factorization problems) — scikits.learn v0.6.0 documentation</title> <link rel="stylesheet" href="../_static/nature.css" type="text/css" /> <link rel="stylesheet" href="../_static/pygments.css" type="text/css" /> <script type="text/javascript"> var DOCUMENTATION_OPTIONS = { URL_ROOT: '../', VERSION: '0.6.0', COLLAPSE_INDEX: false, FILE_SUFFIX: '.html', HAS_SOURCE: true }; </script> <script type="text/javascript" src="../_static/jquery.js"></script> <script type="text/javascript" src="../_static/underscore.js"></script> <script type="text/javascript" src="../_static/doctools.js"></script> <link rel="shortcut icon" href="../_static/favicon.ico"/> <link rel="author" title="About these documents" href="../about.html" /> <link rel="top" title="scikits.learn v0.6.0 documentation" href="../index.html" /> <link rel="up" title="4. Unsupervised learning" href="../unsupervised_learning.html" /> <link rel="next" title="5. Model Selection" href="../model_selection.html" /> <link rel="prev" title="4.2. Clustering" href="clustering.html" /> </head> <body> <div class="header-wrapper"> <div class="header"> <p class="logo"><a href="../index.html"> <img src="../_static/scikit-learn-logo-small.png" alt="Logo"/> </a> </p><div class="navbar"> <ul> <li><a href="../install.html">Download</a></li> <li><a href="../support.html">Support</a></li> <li><a href="../user_guide.html">User Guide</a></li> <li><a href="../auto_examples/index.html">Examples</a></li> <li><a href="../developers/index.html">Development</a></li> </ul> <div class="search_form"> <div id="cse" style="width: 100%;"></div> <script src="http://www.google.com/jsapi" type="text/javascript"></script> <script type="text/javascript"> google.load('search', '1', {language : 'en'}); google.setOnLoadCallback(function() { var customSearchControl = new google.search.CustomSearchControl('016639176250731907682:tjtqbvtvij0'); customSearchControl.setResultSetSize(google.search.Search.FILTERED_CSE_RESULTSET); var options = new google.search.DrawOptions(); options.setAutoComplete(true); customSearchControl.draw('cse', options); }, true); </script> </div> </div> <!-- end navbar --></div> </div> <div class="content-wrapper"> <!-- <div id="blue_tile"></div> --> <div class="sphinxsidebar"> <div class="rel"> <a href="clustering.html" title="4.2. Clustering" accesskey="P">previous</a> | <a href="../model_selection.html" title="5. Model Selection" accesskey="N">next</a> | <a href="../genindex.html" title="General Index" accesskey="I">index</a> </div> <h3>Contents</h3> <ul> <li><a class="reference internal" href="#">4.3. Decomposing signals in components (matrix factorization problems)</a><ul> <li><a class="reference internal" href="#principal-component-analysis-pca">4.3.1. Principal component analysis (PCA)</a></li> <li><a class="reference internal" href="#independent-component-analysis-ica">4.3.2. Independent component analysis (ICA)</a></li> </ul> </li> </ul> </div> <div class="content"> <div class="documentwrapper"> <div class="bodywrapper"> <div class="body"> <div class="section" id="decomposing-signals-in-components-matrix-factorization-problems"> <h1>4.3. Decomposing signals in components (matrix factorization problems)<a class="headerlink" href="#decomposing-signals-in-components-matrix-factorization-problems" title="Permalink to this headline">¶</a></h1> <div class="section" id="principal-component-analysis-pca"> <span id="pca"></span><h2>4.3.1. Principal component analysis (PCA)<a class="headerlink" href="#principal-component-analysis-pca" title="Permalink to this headline">¶</a></h2> <p>PCA is used to decompose a multivariate dataset in a set of successive orthogonal components that explain a maximum amount of the variance. In the scikit-learn, <tt class="xref py py-class docutils literal"><span class="pre">PCA</span></tt> is implemented as a <cite>transformer</cite> object that learns n components in its <cite>fit</cite> method, and can be used on new data to project it on these components.</p> <p>In addition, the <tt class="xref py py-class docutils literal"><span class="pre">ProbabilisticPCA</span></tt> object provides a probabilistic interpretation of the PCA that can give a likelihood of data based on the amount of variance it explains. As such it implements a <cite>score</cite> method that can be used in cross-validation.</p> <p>Below is an example of the iris dataset, which is comprised of 4 features, projected on the 2 dimensions that explain most variance:</p> <div class="figure align-center"> <a class="reference external image-reference" href="../auto_examples/plot_pca.html"><img alt="auto_examples/images/plot_pca.png" src="auto_examples/images/plot_pca.png" /></a> </div> <div class="topic"> <p class="topic-title first">Examples:</p> <ul class="simple"> <li><a class="reference internal" href="../auto_examples/plot_pca.html#example-plot-pca-py"><em>PCA 2d projection of of Iris dataset</em></a></li> </ul> </div> </div> <div class="section" id="independent-component-analysis-ica"> <span id="ica"></span><h2>4.3.2. Independent component analysis (ICA)<a class="headerlink" href="#independent-component-analysis-ica" title="Permalink to this headline">¶</a></h2> <p>ICA finds components that are maximally independent. It is classically used to separate mixed signals (a problem know as <em>blind source separation</em>), as in the example below:</p> <div class="figure align-center"> <a class="reference external image-reference" href="../auto_examples/plot_ica_blind_source_separation.html"><img alt="auto_examples/images/plot_ica_blind_source_separation.png" src="auto_examples/images/plot_ica_blind_source_separation.png" /></a> </div> <div class="topic"> <p class="topic-title first">Examples:</p> <ul class="simple"> <li><a class="reference internal" href="../auto_examples/plot_ica_blind_source_separation.html#example-plot-ica-blind-source-separation-py"><em>Blind source separation using FastICA</em></a></li> <li><a class="reference internal" href="../auto_examples/plot_ica_vs_pca.html#example-plot-ica-vs-pca-py"><em>FastICA on 2D point clouds</em></a></li> </ul> </div> </div> </div> </div> </div> </div> <div class="clearer"></div> </div> </div> <div class="footer"> <p style="text-align: center">This documentation is relative to scikits.learn version 0.6.0<p> © 2010, scikits.learn developers (BSD Lincense). Created using <a href="http://sphinx.pocoo.org/">Sphinx</a> 1.0.5. Design by <a href="http://webylimonada.com">Web y Limonada</a>. </div> </body> </html>