<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Mirror Image</title>
	<atom:link href="http://mirror2image.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://mirror2image.wordpress.com</link>
	<description>Mostly AR and Stuff</description>
	<lastBuildDate>Sat, 28 Jan 2012 09:50:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='mirror2image.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Mirror Image</title>
		<link>http://mirror2image.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://mirror2image.wordpress.com/osd.xml" title="Mirror Image" />
	<atom:link rel='hub' href='http://mirror2image.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Total Variation in Image Processing and classical Action</title>
		<link>http://mirror2image.wordpress.com/2012/01/14/total-variation-in-image-processing-and-classical-action/</link>
		<comments>http://mirror2image.wordpress.com/2012/01/14/total-variation-in-image-processing-and-classical-action/#comments</comments>
		<pubDate>Sat, 14 Jan 2012 10:00:28 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=960</guid>
		<description><![CDATA[This post is inspired by Extremal Principles in Classical, Statistical and Quantum Mechanics in Azimuth blog. Total Variation used a lot in image processing. Image denoising, optical flow, depth maps processing. The standard form of Total Variation f or norm is minimizing &#8220;energy&#8221; of the form (I&#8217;m talking about Total Variaton- for now, not ) [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=960&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This post is inspired by <a href="http://johncarlosbaez.wordpress.com/2012/01/13/extremal-principles-in-classical-statistical-and-quantum-mechanics/" />Extremal Principles in Classical, Statistical and Quantum Mechanics</a> in <a href="http://johncarlosbaez.wordpress.com/">Azimuth</a> blog.<br />
<a href="http://en.wikipedia.org/wiki/Total_variation">Total Variation</a> used a lot in image processing. <a href="http://en.wikipedia.org/wiki/Total_variation_denoising">Image denoising</a>, optical flow, depth maps processing. The standard form of Total Variation f or <img src='http://s0.wp.com/latex.php?latex=L_2&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_2' title='L_2' class='latex' /> norm is minimizing &#8220;energy&#8221; of the form<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cint_%5COmega+f%28u%28x%29%29+%2B+%28%5Cnabla+u%28x%29%29%5E2+dx&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;int_&#92;Omega f(u(x)) + (&#92;nabla u(x))^2 dx' title='&#92;int_&#92;Omega f(u(x)) + (&#92;nabla u(x))^2 dx' class='latex' /><br />
(I&#8217;m talking about Total Variaton-<img src='http://s0.wp.com/latex.php?latex=L_2&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_2' title='L_2' class='latex' /> for now, not <img src='http://s0.wp.com/latex.php?latex=L_1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_1' title='L_1' class='latex' />) over all functions <img src='http://s0.wp.com/latex.php?latex=u%3A%5C+%5COmega+%5Crightarrow+R%5En&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='u:&#92; &#92;Omega &#92;rightarrow R^n' title='u:&#92; &#92;Omega &#92;rightarrow R^n' class='latex' /><br />
In case of image denoising it would be<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cint_%5COmega+%28v%28x%29-u%28x%29%29%5E2+%2B+%28%5Cmu%2F2%29%28%5Cnabla+u%28x%29%29%5E2+dx&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;int_&#92;Omega (v(x)-u(x))^2 + (&#92;mu/2)(&#92;nabla u(x))^2 dx' title='&#92;int_&#92;Omega (v(x)-u(x))^2 + (&#92;mu/2)(&#92;nabla u(x))^2 dx' class='latex' /><br />
where <img src='http://s0.wp.com/latex.php?latex=v&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='v' title='v' class='latex' />  is original image and <img src='http://s0.wp.com/latex.php?latex=u&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='u' title='u' class='latex' /> is denoised image<br />
Part <img src='http://s0.wp.com/latex.php?latex=f%28u%28x%29%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='f(u(x))' title='f(u(x))' class='latex' /> is called &#8220;fidelity term&#8221; and <img src='http://s0.wp.com/latex.php?latex=%28%5Cnabla+u%28x%29%29%5E2&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='(&#92;nabla u(x))^2' title='(&#92;nabla u(x))^2' class='latex' /> is &#8220;regularizer&#8221;<br />
Regularizer part is to provide smoothness of solution and fidelity term is to force smooth solution to resemble original image (that is in case of image denoising)<br />
Now if we return to classical <a href="http://en.wikipedia.org/wiki/Action_%28physics%29">Action</a>, movement of the point is defined by the minimum of functional<br />
<img src='http://s0.wp.com/latex.php?latex=A%28u%29+%3D+%5Cint+K%28u%28t%29%29+-+V%28u%28t%29%29+dt&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='A(u) = &#92;int K(u(t)) - V(u(t)) dt' title='A(u) = &#92;int K(u(t)) - V(u(t)) dt' class='latex' />, over trajectories <img src='http://s0.wp.com/latex.php?latex=u%28t%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='u(t)' title='u(t)' class='latex' /> where <img src='http://s0.wp.com/latex.php?latex=K&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='K' title='K' class='latex' /> is kinetic energy and <img src='http://s0.wp.com/latex.php?latex=V&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='V' title='V' class='latex' /> is potential energy, or<br />
<img src='http://s0.wp.com/latex.php?latex=A%28u%29+%3D+%5Cint+f%28u%28t%29%29+%2B+%28m%2F2%29%28%5Cnabla+u%28t%29%29%5E2+dt%2C+%5C+f%3D-V&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='A(u) = &#92;int f(u(t)) + (m/2)(&#92;nabla u(t))^2 dt, &#92; f=-V' title='A(u) = &#92;int f(u(t)) + (m/2)(&#92;nabla u(t))^2 dt, &#92; f=-V' class='latex' /><br />
<i>One-dimensional <img src='http://s0.wp.com/latex.php?latex=L_2&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_2' title='L_2' class='latex' /> total variation for image denoising is the same as classical mechanics of the particle, with potential energy defined by iamge and smoothness of denoised image as kinetic energy!</i> For optical flow potential energy is differences between tranformed first image and the second<br />
<img src='http://s0.wp.com/latex.php?latex=%28I_0%28u%28x%29%29+-+I_1%28x%29%29%5E2&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='(I_0(u(x)) - I_1(x))^2' title='(I_0(u(x)) - I_1(x))^2' class='latex' /> and kinetic energy is the smoothness of the optical flow.<br />
Of cause the strict equality hold only for one-dimentional image and <img src='http://s0.wp.com/latex.php?latex=L_2&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_2' title='L_2' class='latex' />, and potential energy is quite strange &#8211; it depend not on coordinate but on velocity, like some kind of friction.<br />
While it hold some practical meaning, most of practical task have two or more dimensional image and <img src='http://s0.wp.com/latex.php?latex=L_1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_1' title='L_1' class='latex' /> or <img src='http://s0.wp.com/latex.php?latex=L_p&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_p' title='L_p' class='latex' /> regulariser. So in term of classical mechanics we have movement in multidimensional time with non-classical kinetic energy<br />
<img src='http://s0.wp.com/latex.php?latex=K+%3D+%5Cmu%2F2+%5Csqrt%7Bv%5E2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='K = &#92;mu/2 &#92;sqrt{v^2}' title='K = &#92;mu/2 &#92;sqrt{v^2}' class='latex' /><br />
which has uncanny resemblance to <a href="http://en.wikipedia.org/wiki/Action_%28physics%29#Action_principle_for_single_relativistic_particle">Lagrangian of relativistic particle</a><br />
 <img src='http://s0.wp.com/latex.php?latex=-m+c%5E2%5Csqrt%7B1-v%5E2%2Fc%5E2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='-m c^2&#92;sqrt{1-v^2/c^2}' title='-m c^2&#92;sqrt{1-v^2/c^2}' class='latex' /><br />
<i>So total variation in image processing is equivalent to physics of non-classical movement with multidimensional time, in the field with potential energy defined by image.</i> I have no idea what does it signify, but it sounds cool <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  . <a href="http://en.wikipedia.org/wiki/Holographic_principle">Holographic principle</a>? May be crowd from <a href="http://johncarlosbaez.wordpress.com/">Azimuth</a> or <a href="http://golem.ph.utexas.edu/category/" />n-category cafe</a> will give some explanation eventually&#8230;<br />
And another, related question: regularizer in Total Variation. There is inherent connection between regularizers and Bayesian priors. What TV-L1 regularizer mean from Bayesian statistics point of view?</p>
<p>PS I&#8217;m posting mostly on <a href="https://plus.google.com/u/1/?tab=wX#117183349381733551173/posts">my google plus</a> now, so this blog is a small part of my posts.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/960/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/960/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/960/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/960/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/960/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/960/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/960/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/960/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/960/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/960/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/960/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/960/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/960/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/960/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=960&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2012/01/14/total-variation-in-image-processing-and-classical-action/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>Samsung SARI 1.5 Augmented Reality SDK is out in the wild</title>
		<link>http://mirror2image.wordpress.com/2011/11/14/samsung-sari-1-5-augmented-reality-sdk-is-out-in-the-wild/</link>
		<comments>http://mirror2image.wordpress.com/2011/11/14/samsung-sari-1-5-augmented-reality-sdk-is-out-in-the-wild/#comments</comments>
		<pubDate>Mon, 14 Nov 2011 17:14:11 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[Augmented Reality]]></category>
		<category><![CDATA[Coding AR]]></category>
		<category><![CDATA[computer vision]]></category>
		<category><![CDATA[Demo]]></category>
		<category><![CDATA[Games]]></category>
		<category><![CDATA[mobile games]]></category>
		<category><![CDATA[AR Edibear]]></category>
		<category><![CDATA[feature detection]]></category>
		<category><![CDATA[fun]]></category>
		<category><![CDATA[Markerless]]></category>
		<category><![CDATA[Markerless tracking]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[pose estimation]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=956</guid>
		<description><![CDATA[Something I did for Samsung (kernel of tracker). Biggest improvement in SARI 1.5 is the sensors fusion, which allow for a lot more robust tracking. Here is example of run-time localization and mapping with SARI 1.5: This is the AR EdiBear game (free in Samsung apps store) Contact Eduard Oks if u Android dev and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=956&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Something I did for Samsung (kernel of tracker). Biggest improvement in SARI 1.5 is the sensors fusion, which allow for a lot more robust tracking.<br />
Here is example of run-time localization and mapping with SARI 1.5:</p>
<p><object width="500" height="375"><param name="movie" value="http://www.youtube.com/v/3vyooE24Mh4?version=3&#038;feature=oembed"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/3vyooE24Mh4?version=3&#038;feature=oembed" type="application/x-shockwave-flash" width="500" height="375" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>This is the AR EdiBear game (<a href="http://www.samsungapps.com/topApps/topAppsDetail.as?productId=G00002052886">free in Samsung apps store</a>)</p>
<p>Contact Eduard Oks if u Android dev and want to test SARI 1.5 . You have to understand matrices OpenGL to use it. </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/956/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/956/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/956/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/956/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/956/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/956/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/956/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/956/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/956/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/956/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/956/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/956/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/956/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/956/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=956&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/11/14/samsung-sari-1-5-augmented-reality-sdk-is-out-in-the-wild/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>Interior point method &#8211; if all you have is a hammer</title>
		<link>http://mirror2image.wordpress.com/2011/09/04/interior-point-method-if-all-you-have-is-a-hammer/</link>
		<comments>http://mirror2image.wordpress.com/2011/09/04/interior-point-method-if-all-you-have-is-a-hammer/#comments</comments>
		<pubDate>Sun, 04 Sep 2011 06:05:04 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[computer vision]]></category>
		<category><![CDATA[sci]]></category>
		<category><![CDATA[Interior point method]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[nonlinear optimization]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=920</guid>
		<description><![CDATA[Interior point method for nonlinear optimization often considered as complex, or highly nontrivial etc. The fact is, that for &#8220;simple&#8221; nonlinear optimization it&#8217;s quite simple, manageable and can even be explained in #3tweets. For those not familiar with it there is a simple introduction to it in wikipedia, which in turn follows an excellent paper [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=920&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Interior point method for nonlinear optimization often considered as complex, or highly nontrivial etc. The fact is, that for &#8220;simple&#8221; nonlinear optimization it&#8217;s quite simple, manageable and can even be explained in #3tweets. For those not familiar with it there is <a href="http://en.wikipedia.org/wiki/Interior_point_method#Primal-dual_interior_point_method_for_nonlinear_optimization">a simple introduction to it in wikipedia</a>, which in turn follows an <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.97.8349">excellent paper by Margaret H. Wright</a>.<br />
Now about &#8220;if all you have is a hammer, everything looks like a nail&#8221;. Some of applications of interior point method could be quite unexpected.<br />
 Everyone who worked with <a href="http://en.wikipedia.org/wiki/Levenberg-Marquardt_algorithm">Levenberg-Marquardt minimization algorithm</a> know how much pain is the choice of the small parameter <img src='http://s0.wp.com/latex.php?latex=%5Clambda+&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;lambda ' title='&#92;lambda ' class='latex' /> . Levenberg-Marquardt can also be seen as modification of <a href="http://en.wikipedia.org/wiki/Gauss-Newton">Gauss-newton</a> with a <a href="http://en.wikipedia.org/wiki/Trust_region">trust region</a>. The <img src='http://s0.wp.com/latex.php?latex=%5Clambda+&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;lambda ' title='&#92;lambda ' class='latex' /> of  Levenberg-Marquardt do correspond to the trust region radius, but that dependence is highly complex and is difficult to estimate. You want trust region of the radius <em>r</em>, but what should be avlue of <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' />? There is no easy answer to that question; there are some complex methods, or there is a testing with subdivision, which is what the original Levenberg-Marquardt implement.<br />
Interior point can help here.<br />
If we choose shape of trust region for Gauss-Newton as hypercube or simplex or like, we can formulate it as set of <a href="http://en.wikipedia.org/wiki/L1_norm">L1 norm</a> inequality constrains. And that is the domain of interior point method! For hypercube <img src='http://s0.wp.com/latex.php?latex=%7C%7C%5CDelta+x%7C%7C_1+%5Cleq+%5Cepsilon&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='||&#92;Delta x||_1 &#92;leq &#92;epsilon' title='||&#92;Delta x||_1 &#92;leq &#92;epsilon' class='latex' /> the resulting equations looks especially nice<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cbegin%7Bpmatrix%7D+W+%26+-I+%26+I+%5C%5C+-I+%26+diag+%26+0+%5C%5C+I+%26+0+%26+diag+%5Cend%7Bpmatrix%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;begin{pmatrix} W &amp; -I &amp; I &#92;&#92; -I &amp; diag &amp; 0 &#92;&#92; I &amp; 0 &amp; diag &#92;end{pmatrix}' title='&#92;begin{pmatrix} W &amp; -I &amp; I &#92;&#92; -I &amp; diag &amp; 0 &#92;&#92; I &amp; 0 &amp; diag &#92;end{pmatrix}' class='latex' /><br />
<em>W</em> &#8211; hessian, <em>I</em> &#8211; identity, <em>diag</em> &#8211; diagonal<br />
That is a <em>banded arrowhead matrix</em>, and for it <a href="http://en.wikipedia.org/wiki/Cholesky_decomposition">Cholesky decomposition</a> cost insignificantly more than decomposition of original <em>W</em>. The matrix is not positive definite &#8211; Cholesky without square root should be used.<br />
Now there is a temptation to use single constrain <img src='http://s0.wp.com/latex.php?latex=%7C%7C%5CDelta+x%7C%7C_2+%5Cleq+%5Cepsilon&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='||&#92;Delta x||_2 &#92;leq &#92;epsilon' title='||&#92;Delta x||_2 &#92;leq &#92;epsilon' class='latex' /> instead of set of constrain <img src='http://s0.wp.com/latex.php?latex=%7C%7C%5CDelta+x%7C%7C_1+%5Cleq+%5Cepsilon&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='||&#92;Delta x||_1 &#92;leq &#92;epsilon' title='||&#92;Delta x||_1 &#92;leq &#92;epsilon' class='latex' />, but that will not work. <img src='http://s0.wp.com/latex.php?latex=%7C%7C%5CDelta+x%7C%7C_2+%5Cleq+%5Cepsilon&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='||&#92;Delta x||_2 &#92;leq &#92;epsilon' title='||&#92;Delta x||_2 &#92;leq &#92;epsilon' class='latex' /> should have to be linearized to be tractable, but it&#8217;s a second order condition &#8211; it&#8217;s linear part is zero, so linearization doesn&#8217;t constrain anything.<br />
The same method could be used whenever we have to put constrain on the  value of Gauss-Newton update, and shape of constrain in not important (or polygonal)<br />
Now last touch &#8211; Interior point method has small parameter of it&#8217;s own. It&#8217;s called usually <img src='http://s0.wp.com/latex.php?latex=%5Cmu&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;mu' title='&#92;mu' class='latex' /> . In the &#8220;normal&#8221; method there is a nice rule for it update &#8211; take it as <img src='http://s0.wp.com/latex.php?latex=%5Cmu+%3D+%5Clambda+C+&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;mu = &#92;lambda C ' title='&#92;mu = &#92;lambda C ' class='latex' /> (in the notation from  <a href="http://en.wikipedia.org/wiki/Interior_point_method#Primal-dual_interior_point_method_for_nonlinear_optimization">wikipedia article</a> &#8211; <img src='http://s0.wp.com/latex.php?latex=C&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='C' title='C' class='latex' /> is a value of constraint, <img src='http://s0.wp.com/latex.php?latex=%5Clambda&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;lambda' title='&#92;lambda' class='latex' /> is a value of slack variable at the previous iteration) That rule usually explicitly stated in the articles about Interior Point Method(IPM) for Linear Programming, but omitted (as obvious probably) in the papers about IPM for nonlinear optimization<br />
In our case (IPM for trust region) we don&#8217;t need update <img src='http://s0.wp.com/latex.php?latex=%5Cmu&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;mu' title='&#92;mu' class='latex' /> at all &#8211; we move boundary of the region with each iteration, so each <img src='http://s0.wp.com/latex.php?latex=%5Cmu&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;mu' title='&#92;mu' class='latex' /> is an initial value. Have to remember, <img src='http://s0.wp.com/latex.php?latex=%5Cmu&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;mu' title='&#92;mu' class='latex' /> is not a size of trust region, but strength of it&#8217;s enforcement.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/920/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/920/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/920/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/920/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/920/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/920/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/920/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/920/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/920/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/920/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/920/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/920/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/920/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/920/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=920&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/09/04/interior-point-method-if-all-you-have-is-a-hammer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>Effectivness of compressed sensing in image processing and other stuff</title>
		<link>http://mirror2image.wordpress.com/2011/08/06/effectivness-of-compressed-sensing-in-image-processing-and-other-staff/</link>
		<comments>http://mirror2image.wordpress.com/2011/08/06/effectivness-of-compressed-sensing-in-image-processing-and-other-staff/#comments</comments>
		<pubDate>Sat, 06 Aug 2011 11:14:54 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[computer vision]]></category>
		<category><![CDATA[3d imaging]]></category>
		<category><![CDATA[compressed sensing]]></category>
		<category><![CDATA[image processing]]></category>
		<category><![CDATA[Math]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=909</guid>
		<description><![CDATA[I seldom post in the this blog now, mostly because I&#8217;m positing on twitter and G+ a lot lately. I still haven&#8217;t figured out which post should go where &#8211; blog, G+ or twitter, so it&#8217;s kind of chaotic for now. What of interest is going on: There are two paper on the CVPR11 which [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=909&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I seldom post in the this blog now, mostly because I&#8217;m positing on <a href="http://twitter.com/sergecell" title="twitter">twitter</a> and <a href="https://plus.google.com/u/1/?hl=en&amp;tab=wX#117183349381733551173/posts" title="G+">G+</a> a lot lately. I still haven&#8217;t figured out which post should go where &#8211; blog, G+ or twitter, so it&#8217;s kind of chaotic for now.<br />
What of  interest is going on: There are two paper on the <a href="http://www.cvpapers.com/cvpr2011.html" title="CVPR'11">CVPR11</a> which claim that <a href="http://en.wikipedia.org/wiki/Sparse_recovery">compressed sensing(sparse recovery)</a> is not applicable to some of the most important computer vision tasks:<br />
<a href="http://users.cecs.anu.edu.au/~qshi/pub/face_cvpr11.pdf">Is face recognition really a Compressive Sensing problem?</a><br />
<a href="http://cvlab.epfl.ch/%7Elepetit/papers/rigamonti_cvpr11.pdf">Are Sparse Representations Really Relevant for Image Classification?</a><br />
Both paper claim that space of the natural images(or their important subsets) are not really sparse.<br />
Those claims however dont&#8217;t square with claim of high effectiveness of <a href="cvlab.epfl.ch/~calonder/CalonderLFKMB09.pdf">compact signature of Random Ferns</a>.<br />
Could both of those be true? In my opinion &#8211; yes. Difference of two approaches is that first two paper assumed <em>explicit</em> sparsity &#8211; that is they enforced sparsity on the feature vector. Compressed signature approach used <em>implicit</em> sparsity  &#8211; feature vector underling the signature is assumed sparse but is not explicitly reconstructed.  Why compressed signature is working while explicit approach didn&#8217;t? That could be the case if image space is sparse in the different coordinate system &#8211; that is here one is dealing with the <em>union of subspaces</em>. Assumption not of the simple sparsity, but of the union of subspaces is called <a href="http://arxiv.org/abs/1002.2586">blind compressed sensing</a>.<br />
Now if we look at the space of the natural images it&#8217;s easy to see why it is low dimensional. Natural image is the image of some scene, an that scene has limited number of moving object. So dimension of space images of the scene is approximately the sum of degree of freedom of the objects(and camera) of the scene, plus effects of occlusions, illumination and noise. Now  if the add strong enough random error to the scene, the image is stop to be the natural image(that is image of any scene). That mean manifold of the images of the scene is isolated &#8211; there is no natural images in it&#8217;s neighborhood. That hint that up to some error the space of the natural images is at least is <em>the union of isolated low-dimensional manifolds</em>. The union of mainfolds is obviously is more complex structure than the union of subspace, but methods of blind compressed sensing could be applicable to it too. Of cause to think about union of manifolds could be necessary only if the space of images is not union of subspace, which is obviously preferable case </p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/909/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/909/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/909/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=909&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/08/06/effectivness-of-compressed-sensing-in-image-processing-and-other-staff/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>Stuff and AR</title>
		<link>http://mirror2image.wordpress.com/2011/07/04/stuff-and-ar/</link>
		<comments>http://mirror2image.wordpress.com/2011/07/04/stuff-and-ar/#comments</comments>
		<pubDate>Mon, 04 Jul 2011 14:08:00 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Augmented Reality]]></category>
		<category><![CDATA[fun]]></category>
		<category><![CDATA[Games]]></category>
		<category><![CDATA[Markerless]]></category>
		<category><![CDATA[SLAM]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=894</guid>
		<description><![CDATA[I once asked, what&#8217;s 3d registration/reconstruction/pose estimation is about &#8211; optimization or statistics? The more I think about it, the more I convinced it&#8217;s at least 80% statistics. Often specifically optimization tricks like Tikhonov regularization have statistical underpinning. Stability of optimization is robust statistics(Yes I know, I repeat it way too often). Cost function formulation [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=894&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I once asked, what&#8217;s 3d registration/reconstruction/pose estimation is about &#8211; optimization or statistics? The more I think about it, the more I convinced it&#8217;s at least 80% statistics. Often specifically optimization tricks like <a href="http://en.wikipedia.org/wiki/Tikhonov_regularization">Tikhonov regularization</a> have statistical underpinning. Stability of optimization is robust statistics(Yes I know, I repeat it way too often). Cost function formulation is a formulation for error distribution and define convergence speed.</p>
<p>Now unrelated(almost) AR stuff:<br />
I already mentioned on Twitter that version of markerless tracker for which I did a lot of work is part of Samsung AR SDK (SARI) for Android and Bada. It was <a href="http://s-a-m-m-i.blogspot.com/2011/06/some-presentations-from-ap2011-event.html">was shown at AP2011</a>(Presentaion and also include nice <a href="http://en.wikipedia.org/wiki/Bada_OS">Bada</a> code). AR SDK presentation is <a href="http://www.slideshare.net/mbortens/sari-ap2011">here</a>.<br />
Some videos form presentation &#8211; <a href="http://www.youtube.com/watch?v=JmC0VUYWMhE">Edi Bear game demo</a> with non-reference tracking at the end of the video and  less trivial <a href="http://www.youtube.com/watch?v=98AT8DJqNOc">elements of SLAM tracking</a>. Other application of SARI SDK &#8211; <a href="http://www.youtube.com/watch?v=ST5POqI1fGY">PBI</a> (This one seems use earlier version).</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/894/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/894/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/894/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/894/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/894/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/894/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/894/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/894/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/894/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/894/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/894/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/894/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/894/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/894/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=894&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/07/04/stuff-and-ar/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>XKCD turtles</title>
		<link>http://mirror2image.wordpress.com/2011/05/07/xkcd-turtles/</link>
		<comments>http://mirror2image.wordpress.com/2011/05/07/xkcd-turtles/#comments</comments>
		<pubDate>Sat, 07 May 2011 06:48:39 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[sci]]></category>
		<category><![CDATA[fun]]></category>
		<category><![CDATA[xkcd]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=886</guid>
		<description><![CDATA[I&#8217;m Achilles! I&#8217;m a turtle I&#8217;m Spartacus! I&#8217;m a turtle I think therefore I am! I&#8217;m a turtle I&#8217;m ClearCase! I&#8217;m a turtle I am the alpha and the omega! I&#8217;m a turtle via xkcd<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=886&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><img src="http://imgs.xkcd.com/comics/turtles.png" alt="turtles" /><br />
I&#8217;m Achilles!<br />
I&#8217;m a turtle<br />
I&#8217;m Spartacus!<br />
I&#8217;m a turtle<br />
I think therefore I am!<br />
I&#8217;m a turtle<br />
I&#8217;m ClearCase!<br />
I&#8217;m a turtle<br />
I am the alpha and the omega!<br />
I&#8217;m a turtle</p>
<p>via <a href="http://xkcd.com">xkcd</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/886/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/886/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/886/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/886/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/886/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/886/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/886/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/886/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/886/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/886/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/886/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/886/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/886/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/886/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=886&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/05/07/xkcd-turtles/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>

		<media:content url="http://imgs.xkcd.com/comics/turtles.png" medium="image">
			<media:title type="html">turtles</media:title>
		</media:content>
	</item>
		<item>
		<title>Robust estimators III: Into the deep</title>
		<link>http://mirror2image.wordpress.com/2011/05/02/robust-estimators-iii-into-the-deep/</link>
		<comments>http://mirror2image.wordpress.com/2011/05/02/robust-estimators-iii-into-the-deep/#comments</comments>
		<pubDate>Mon, 02 May 2011 18:17:15 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[computer vision]]></category>
		<category><![CDATA[sci]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[pose estimation]]></category>
		<category><![CDATA[robust statistics]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=837</guid>
		<description><![CDATA[Cauchy estimator have some nice properties (Gonzales et al &#8220;Statistically-Efficient Filtering in Impulsive Environments: Weighted Myriad Filter&#8221; 2002): By tuning in it can approximate either least squares (big ), or mode &#8211; maximum of histogram &#8211; of sample set (small ). For small estimator behave the same way as power law distribution estimator with small [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=837&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Cauchy estimator have some nice properties (Gonzales et al &#8220;Statistically-Efficient Filtering in Impulsive Environments: Weighted Myriad Filter&#8221; 2002):<br />
By tuning <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' /> in<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cpsi%28x%29+%3D+%5Cfrac%7Bx%7D%7B%5Cgamma%5E2%2B+x%5E2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi(x) = &#92;frac{x}{&#92;gamma^2+ x^2}' title='&#92;psi(x) = &#92;frac{x}{&#92;gamma^2+ x^2}' class='latex' /><br />
it can approximate either least squares (big <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' />), or <a href="http://en.wikipedia.org/wiki/Mode_%28statistics%29">mode</a> &#8211; maximum of histogram &#8211; of sample set (small <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' />). For small <img src='http://s0.wp.com/latex.php?latex=%5Cgamma&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;gamma' title='&#92;gamma' class='latex' />  estimator behave the same way as <a href="http://en.wikipedia.org/wiki/Power_law">power law distribution estimator</a> with small <img src='http://s0.wp.com/latex.php?latex=%5Calpha&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;alpha' title='&#92;alpha' class='latex' />.<br />
Another property is that for several measurements with different scales  <img src='http://s0.wp.com/latex.php?latex=%5Cgamma_i&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;gamma_i' title='&#92;gamma_i' class='latex' /> estimator of their sum will be simple<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cpsi%28x%29+%3D+%5Cfrac%7Bx%7D%7B%28%5Csum+%5Cgamma_i%29%5E2%2B+x%5E2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi(x) = &#92;frac{x}{(&#92;sum &#92;gamma_i)^2+ x^2}' title='&#92;psi(x) = &#92;frac{x}{(&#92;sum &#92;gamma_i)^2+ x^2}' class='latex' /><br />
which is convenient for estimation of random walks</p>
<p>I heard convulsion in the sky,<br />
And flight of angel hosts on high,<br />
And monsters moving in the deep</p>
<p>Those verses from <a href="http://www.google.com/search?source=ig&amp;hl=en&amp;rlz=&amp;=&amp;q=Prophet+pushkin">The Prophet</a> by <a href="http://en.wikipedia.org/wiki/Pushkin">A.Pushkin</a> could be seen as metaphor of profound mathematical insight, encompassing <a href="http://en.wikipedia.org/wiki/Bifurcation_theory">bifurcations</a>, <a href="http://en.wikipedia.org/wiki/Higher_dimensional_algebra">higher dimensional algebra</a> and  murky depths of statistics.<br />
I now intend to dive deeper into of statistics &#8211; toward &#8220;data depth&#8221;. Data depth is a generalization of <a href="http://en.wikipedia.org/wiki/Median">median</a> concept to multidimensional data. Remind you that median can be seen either as order parameter &#8211; value dividing  the higher half of measurements from lower, or geometrically, as the minimum of <img src='http://s0.wp.com/latex.php?latex=L_1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_1' title='L_1' class='latex' /> norm. Second approach lead to <a href="http://en.wikipedia.org/wiki/Geometric_median">geometric median</a>, about which I already <a href="http://mirror2image.wordpress.com/2011/04/10/minimum-sum-of-distance-vs-l1-and-geometric-median/" />talked about</a>.<br />
 First approach to generalizations of median is to try to apply order statistics to multidimensional vectors.The idea is to make some kind of partial order for n-dimensional points &#8211; &#8220;depth&#8221; of points, and to choose as the analog of median the point of maximum depth.<br />
Basically all <em>data depth</em> concepts define &#8220;depth&#8221; as some characterization of how deep points are reside inside the point cloud.<br />
Historically first and easiest to understand was convex hull approach &#8211; make convex hull of data set, assign points in the hull depth 1, remove it, get convex hull of points remained inside, assign new hull depth 2, remove etc.; repeat until there is no point inside last convex hull.<br />
Later Tukey introduce similar &#8220;halfspace depth&#8221; concept &#8211; for each point X find the minimum number of points which could be cut from the dataset by plane through the point X. That number count as depth(see the nice overview of those and other geometrical definition of depth at <a href="http://cgm.cs.mcgill.ca/~athens/Geometric-Estimators/halfspace.html">Greg Aloupis page</a>)<br />
In 2002 Mizera introduced &#8220;global depth&#8221;, which is less geometric and more statistical. It start with assumption of some loss function (&#8220;criterial function&#8221; in Mizera definition) <img src='http://s0.wp.com/latex.php?latex=F%28x_i%2C+%5Ctheta%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='F(x_i, &#92;theta)' title='F(x_i, &#92;theta)' class='latex' /> of measurement set <img src='http://s0.wp.com/latex.php?latex=x_i&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='x_i' title='x_i' class='latex' />. This function could be(but not necessary) cumulative probability distribution. Now for two parameters <img src='http://s0.wp.com/latex.php?latex=%5Ctheta_1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;theta_1' title='&#92;theta_1' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=%5Ctheta_2&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;theta_2' title='&#92;theta_2' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%5Ctheta_1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;theta_1' title='&#92;theta_1' class='latex' /> is <em>more fit</em>  with respect <img src='http://s0.wp.com/latex.php?latex=A+%5Csubset+N&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='A &#92;subset N' title='A &#92;subset N' class='latex' /> if for all  <img src='http://s0.wp.com/latex.php?latex=i+%5Cin+A&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='i &#92;in A' title='i &#92;in A' class='latex' /> <img src='http://s0.wp.com/latex.php?latex=F%28x_i%2C+%5Ctheta_1%29+%3E+F%28x_i%2C+%5Ctheta_2%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='F(x_i, &#92;theta_1) &gt; F(x_i, &#92;theta_2)' title='F(x_i, &#92;theta_1) &gt; F(x_i, &#92;theta_2)' class='latex' />. <img src='http://s0.wp.com/latex.php?latex=%5Chat%7B%5Ctheta%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;hat{&#92;theta}' title='&#92;hat{&#92;theta}' class='latex' /> is <em>weakly optimal</em> with respect to  <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> if there is nor better fit parameter with respect to  <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' />. At last <em>global depth</em> of  <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' /> is the minimum possible size of <img src='http://s0.wp.com/latex.php?latex=A&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='A' title='A' class='latex' /> such that <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' /> is <em>not</em> weakly optimal with respect to  <img src='http://s0.wp.com/latex.php?latex=N+%5Csetminus+A&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='N &#92;setminus A' title='N &#92;setminus A' class='latex' /> &#8211; reminder of measurements. In other words <em>global depth</em> is minimum number of measurements  which should be removed for  <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' /> stop being weakly optimal. Global depth is not easy to calculate or visualize, so Mizera introduce more simple concept &#8211; <em>tangent depth</em>.<br />
Tangent depth defined as <img src='http://s0.wp.com/latex.php?latex=min_%7B%5Cparallel+u%5Cparallel%3D1%7D%5Cmid+%5C%7B+i%3A+u%5ET+%5Cbigtriangledown_%7B%5Ctheta%7D+F%28x_i%29+%5Cgeq+0+%5C%7D%5Cmid&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='min_{&#92;parallel u&#92;parallel=1}&#92;mid &#92;{ i: u^T &#92;bigtriangledown_{&#92;theta} F(x_i) &#92;geq 0 &#92;}&#92;mid' title='min_{&#92;parallel u&#92;parallel=1}&#92;mid &#92;{ i: u^T &#92;bigtriangledown_{&#92;theta} F(x_i) &#92;geq 0 &#92;}&#92;mid' class='latex' />. What does it mean? Tangent depth is minimum number of &#8220;bad&#8221; points &#8211; such points that for specific direction loss function for themis growing.<br />
Those definitions of &#8220;data depth&#8221; allow for another type of estimator, based not on likelihood, but on <a href="http://en.wikipedia.org/wiki/Order_statistics">order statistics</a> -<em>maximum depth estimators</em>. The advantage of those estimators is robustness(<a href="http://en.wikipedia.org/wiki/Breakdown_point#Breakdown_point">breakdown point</a> ~25%-33%) and disadvantage &#8211; low precision (<a href="http://en.wikipedia.org/wiki/Bias_of_an_estimator">high bias</a>). So I wouldn&#8217;t use them for precise estimation, but for sanity check or initial approximation. In some cases they could be computationally  more cheap than M-estimators. As useful side effect they also give some insight into structure of dataset(it seems originally maximum depth estimators was seen as data visualization tool). Depth could be good criterion for outliers rejection.<br />
Disclaimer: while I had very positive experience with Cauchy estimator, data depth is a new thing for me.I have yet to see how useful it could be for computer vision related problems.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/837/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/837/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/837/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/837/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/837/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/837/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/837/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/837/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/837/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/837/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/837/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/837/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/837/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/837/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=837&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/05/02/robust-estimators-iii-into-the-deep/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>Robust estimators II</title>
		<link>http://mirror2image.wordpress.com/2011/04/19/robust-estimators-ii/</link>
		<comments>http://mirror2image.wordpress.com/2011/04/19/robust-estimators-ii/#comments</comments>
		<pubDate>Tue, 19 Apr 2011 06:57:02 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[computer vision]]></category>
		<category><![CDATA[sci]]></category>
		<category><![CDATA[bundle adjustment]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[pose estimation]]></category>
		<category><![CDATA[robust statistics]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=805</guid>
		<description><![CDATA[In this post I was complaining that I don&#8217;t know what breakdown point for redescending M-estimators is. Now I found out that upper bound for breakdown point of redescending of M-estimators was given by Mueller in 1995, for linear regression (that is statisticians word for simple estimation of p-dimensional hyperplane): &#8211; number of measurements and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=805&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>In  <a href="http://mirror2image.wordpress.com/2011/04/02/l1-robust-statisrics-and-compressed-sensing/" />this post</a> I was complaining that I don&#8217;t know what <a href="http://en.wikipedia.org/wiki/Breakdown_point#Breakdown_point">breakdown point</a> for <a href="http://en.wikipedia.org/wiki/Redescending_M-estimator">redescending M-estimators</a> is. Now I found out that upper bound for breakdown point of redescending of M-estimators was given by Mueller in 1995, for <a href="http://en.wikipedia.org/wiki/Linear_regression">linear regression</a> (that is statisticians word for simple estimation of p-dimensional hyperplane):<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cfrac%7B1%7D%7BN%7D%28%5Cfrac%7BN+-+%5Cmathcal%7BN%7D%28x%29+%2B+1%29%7D%7B2%7D%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;frac{1}{N}(&#92;frac{N - &#92;mathcal{N}(x) + 1)}{2})' title='&#92;frac{1}{N}(&#92;frac{N - &#92;mathcal{N}(x) + 1)}{2})' class='latex' /><br />
<img src='http://s0.wp.com/latex.php?latex=N&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='N' title='N' class='latex' /> &#8211; number of measurements and <img src='http://s0.wp.com/latex.php?latex=%5Cmathcal%7BN%7D%28x%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;mathcal{N}(x)' title='&#92;mathcal{N}(x)' class='latex' /> is little tricky: it is a maximum number of measurement vectors X  lying in the same p-dimensional hyperplane. If number of measurements N &gt;&gt; p that mean breakdown point is near 50% &#8211; You can have half measurement results completely out of the blue and estimator will still work.<br />
That only work if the error present only in results of measurements, which is reasonable condition &#8211; in most cases we can move random error from x part to y part.<br />
Now which M-estimators attain this upper bound?<br />
The condition is &#8220;slow variation&#8221;(Mizera and Mueller 1999)<br />
<img src='http://s0.wp.com/latex.php?latex=%5Clim_%7Bt%5Cto+%5Cinfty%7D+%5Cfrac%7B%5Crho%28t+x%29%7D%7B%5Crho%28t%29%7D+%3D+1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;lim_{t&#92;to &#92;infty} &#92;frac{&#92;rho(t x)}{&#92;rho(t)} = 1' title='&#92;lim_{t&#92;to &#92;infty} &#92;frac{&#92;rho(t x)}{&#92;rho(t)} = 1' class='latex' /><br />
Mentioned in previous post Cauchy estimator is satisfy that condition:<br />
<img src='http://s0.wp.com/latex.php?latex=%5Crho%28x%29+%3D+-%5Cln%281+%2B%28%5Cfrac%7Bx%7D%7B%5Cgamma%7D%29%5E2%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;rho(x) = -&#92;ln(1 +(&#92;frac{x}{&#92;gamma})^2)' title='&#92;rho(x) = -&#92;ln(1 +(&#92;frac{x}{&#92;gamma})^2)' class='latex' /> and its derivative <img src='http://s0.wp.com/latex.php?latex=%5Cpsi%28x%29+%3D+%5Cfrac%7Bx%7D%7B%5Cgamma%5E2%2B+x%5E2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi(x) = &#92;frac{x}{&#92;gamma^2+ x^2}' title='&#92;psi(x) = &#92;frac{x}{&#92;gamma^2+ x^2}' class='latex' /><br />
In practice we always work with <img src='http://s0.wp.com/latex.php?latex=%5Cpsi&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi' title='&#92;psi' class='latex' />, not <img src='http://s0.wp.com/latex.php?latex=%5Crho&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;rho' title='&#92;rho' class='latex' /> so Cauchy estimator is easy to calculate.<br />
Rule of the thumb: if you don&#8217;t know which robust estimator to use, use Cauchy: It&#8217;s fast(which is important in real time apps), its easy to understand, it&#8217;s differentiable, and it is as robust as possible (that is for redescending M-estimator)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/805/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/805/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/805/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/805/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/805/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/805/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/805/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/805/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/805/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/805/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/805/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/805/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/805/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/805/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=805&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/04/19/robust-estimators-ii/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>Robust estimators &#8211; understand or die&#8230; err&#8230; be bored trying</title>
		<link>http://mirror2image.wordpress.com/2011/04/15/robust-estimators-understand-or-die-err-be-bored-trying/</link>
		<comments>http://mirror2image.wordpress.com/2011/04/15/robust-estimators-understand-or-die-err-be-bored-trying/#comments</comments>
		<pubDate>Fri, 15 Apr 2011 16:28:04 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[sci]]></category>
		<category><![CDATA[bundle adjustment]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[nonlinear optimization]]></category>
		<category><![CDATA[pose estimation]]></category>
		<category><![CDATA[robust statistics]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=744</guid>
		<description><![CDATA[This is continuation of my attempt to understand internal mechanics of robust statistics. First I want to say that robust statistics &#8220;just works&#8221;. It&#8217;s not necessary to have deep understanding of it to use it and even to use it creatively. However without that deeper understanding I feel myself kind of blind. I can modify [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=744&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This is continuation of my attempt to understand internal mechanics of <a href="http://en.wikipedia.org/wiki/Robust_statistics">robust statistics</a>. First I want to say that robust statistics &#8220;just works&#8221;. It&#8217;s not necessary to have deep understanding of it to use it and even to use it creatively. However without that deeper understanding I feel myself kind of blind. I can modify or invent robust estimators empirically, but I can not see clearly the reasons, why use this and not that modification.<br />
Now about robust estimators. They could be divided into two groups: <a href="http://en.wikipedia.org/wiki/Robust_estimator#M-estimators">maximum likelihood estimators</a>(M-estimators), which in case of robust statistics usually, but not always are <a href="http://en.wikipedia.org/wiki/Redescending_M-estimator">redescending estimators</a> (notable <strong><em>not</em></strong> redescending estimator is <img src='http://s0.wp.com/latex.php?latex=L_1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_1' title='L_1' class='latex' /> norm), and all the rest of estimators.<br />
This second &#8220;all the rest&#8221; group include subset of <a href="http://en.wikipedia.org/wiki/L-estimator">L-estimators</a>(think of median, which is also M-estimator with <img src='http://s0.wp.com/latex.php?latex=L_1&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='L_1' title='L_1' class='latex' /> norm.Yea, it&#8217;s kind of messy), S-estimators (use global scale estimation for all the measurements) R-estimators, which like L-estimator use <a href="http://en.wikipedia.org/wiki/Order_statistics">order statistics</a> but use it for weights. There may be some others too, but I don&#8217;t know much about this second group.<br />
It&#8217;s easy to understand what M-estimators do: just find the value of parameter which give maximum probability of given set of measurements.<br />
<img src='http://s0.wp.com/latex.php?latex=argmax_%7B%5Ctheta%7D++%5Cprod_%7Bi%3D1%7D%5E%7Bn%7Dp+%28+x_i%5Cmid%5Ctheta%29+&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='argmax_{&#92;theta}  &#92;prod_{i=1}^{n}p ( x_i&#92;mid&#92;theta) ' title='argmax_{&#92;theta}  &#92;prod_{i=1}^{n}p ( x_i&#92;mid&#92;theta) ' class='latex' /><br />
or<br />
<img src='http://s0.wp.com/latex.php?latex=argmin_%7B%5Ctheta%7D%5Csum_%7Bi%3D1%7D%5E%7Bn%7D++-ln%28p%28x_i%7C%5Ctheta%29%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='argmin_{&#92;theta}&#92;sum_{i=1}^{n}  -ln(p(x_i|&#92;theta))' title='argmin_{&#92;theta}&#92;sum_{i=1}^{n}  -ln(p(x_i|&#92;theta))' class='latex' /><br />
which give us traditional M-estimator form<br />
<img src='http://s0.wp.com/latex.php?latex=argmin_%7B%5Ctheta%7D%5Csum_%7Bi%3D1%7D%5En+%5Crho%28x_i%2C+%5Ctheta%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='argmin_{&#92;theta}&#92;sum_{i=1}^n &#92;rho(x_i, &#92;theta)' title='argmin_{&#92;theta}&#92;sum_{i=1}^n &#92;rho(x_i, &#92;theta)' class='latex' /><br />
or<br />
<img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5En+%5Cpsi%28x_i%2C+%5Ctheta%29+%3D+0&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^n &#92;psi(x_i, &#92;theta) = 0' title='&#92;sum_{i=1}^n &#92;psi(x_i, &#92;theta) = 0' class='latex' />, <img src='http://s0.wp.com/latex.php?latex=%5Cpsi+%3D+%5Cfrac%7B%5Cpartial+%5Crho%7D%7B%5Cpartial+%5Ctheta%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi = &#92;frac{&#92;partial &#92;rho}{&#92;partial &#92;theta}' title='&#92;psi = &#92;frac{&#92;partial &#92;rho}{&#92;partial &#92;theta}' class='latex' /><br />
Practically we are usually work not with measurements per se, but with some distribution of cost function <img src='http://s0.wp.com/latex.php?latex=F%28x%2C%5Ctheta%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='F(x,&#92;theta)' title='F(x,&#92;theta)' class='latex' /> of the measurements <img src='http://s0.wp.com/latex.php?latex=%5Crho%28x%2C+%5Ctheta%29+%3D+p%28F%28x%2C+%5Ctheta%29%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;rho(x, &#92;theta) = p(F(x, &#92;theta))' title='&#92;rho(x, &#92;theta) = p(F(x, &#92;theta))' class='latex' />, so it become<br />
<img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5En+%5Cpsi%28x_i%2C+%5Ctheta%29%5Cfrac%7B%5Cpartial+F%28x_i%2C%5Ctheta%29%7D%7B%5Cpartial+%5Ctheta%7D+%3D+0&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^n &#92;psi(x_i, &#92;theta)&#92;frac{&#92;partial F(x_i,&#92;theta)}{&#92;partial &#92;theta} = 0' title='&#92;sum_{i=1}^n &#92;psi(x_i, &#92;theta)&#92;frac{&#92;partial F(x_i,&#92;theta)}{&#92;partial &#92;theta} = 0' class='latex' /><br />
it&#8217;s the same as the previous equation just <img src='http://s0.wp.com/latex.php?latex=%5Cpsi&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi' title='&#92;psi' class='latex' /> defined in such a way as  to separate statistical part from cost function part.<br />
Now if we make a set of weights <img src='http://s0.wp.com/latex.php?latex=w_i+%3D+%5Cfrac%7B%5Cpsi_i%7D%7BF_i%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='w_i = &#92;frac{&#92;psi_i}{F_i}' title='w_i = &#92;frac{&#92;psi_i}{F_i}' class='latex' /> it become<br />
<img src='http://s0.wp.com/latex.php?latex=%5Csum_%7Bi%3D1%7D%5En+w_i%28x_i%2C+%5Ctheta%29+F%28x_i%2C+%5Ctheta%29+%5Cfrac%7B%5Cpartial+F%28x_i%2C%5Ctheta%29%7D%7B%5Cpartial+%5Ctheta%7D+%3D+0&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;sum_{i=1}^n w_i(x_i, &#92;theta) F(x_i, &#92;theta) &#92;frac{&#92;partial F(x_i,&#92;theta)}{&#92;partial &#92;theta} = 0' title='&#92;sum_{i=1}^n w_i(x_i, &#92;theta) F(x_i, &#92;theta) &#92;frac{&#92;partial F(x_i,&#92;theta)}{&#92;partial &#92;theta} = 0' class='latex' /><br />
We see that it could be considered as &#8220;nonlinear least squares&#8221;, which could be solved with <a href="http://en.wikipedia.org/wiki/IRLS">iteratively reweighted least squares</a><br />
Now for second group of estimators we have probability of <a href="http://en.wikipedia.org/wiki/Joint_distribution">joint distribution</a><br />
<img src='http://s0.wp.com/latex.php?latex=argmax_%7B%5Ctheta%7D++%5Cprod_%7Bi%3D1%7D%5E%7Bn%7Dp+%28+x_i%5Cmid+x_%7Bj%2C+j%5Cneq+i%7D%2C+%5Ctheta%29+&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='argmax_{&#92;theta}  &#92;prod_{i=1}^{n}p ( x_i&#92;mid x_{j, j&#92;neq i}, &#92;theta) ' title='argmax_{&#92;theta}  &#92;prod_{i=1}^{n}p ( x_i&#92;mid x_{j, j&#92;neq i}, &#92;theta) ' class='latex' /><br />
All the global factors &#8211; sort order, global scale etc. are incorporated into measurements dependence.<br />
It seems the difference between this formulation of second group of estimators and M-estimator is that <a href="http://en.wikipedia.org/wiki/Conditional_independence">conditional independence</a> assumption about measurements is dropped.<br />
Another interesting thing is that if some of measurements are not dependent on others, this formulation can get us <a href="http://en.wikipedia.org/wiki/Bayesian_network">bayesian network</a></p>
<p>Now lets return to M-estimators. M-estimator is defined by assumption about <a href="http://en.wikipedia.org/wiki/Probability_distribution">probability distribution</a> of the measurements.<br />
So M-estimator and <em>probabilistic distribution</em> through which it is defined are essentially the same. Least squares, for example, is produced by <a href="http://en.wikipedia.org/wiki/Normal_distribution">normal</a>(gausssian) distribution. Just take sum of logarithms of gaussian and you get least squares estimator.<br />
If we are talking about normal (pun intended), non-robust <a href="http://en.wikipedia.org/wiki/Estimator">estimator</a>, their defining feature is finite <a href="http://en.wikipedia.org/wiki/Variance">variance</a> of distribution.<br />
We have <a href="http://en.wikipedia.org/wiki/Central_limit_theorem">central limit theorem</a> which saying that for any distribution mean value of samples will have approximately <a href="http://en.wikipedia.org/wiki/Normal_distribution">normal</a>(or Gaussian) distribution.<br />
From this follow <a href="http://en.wikipedia.org/wiki/Estimator#Behavioural_properties">property of asymptotic normality</a> &#8211; for estimator with finite variance its distribution around true value of parameter <img src='http://s0.wp.com/latex.php?latex=%5Ctheta&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;theta' title='&#92;theta' class='latex' /> approximate normal distribution.<br />
We are discussing robust estimators, which are stable to error and have &#8220;thick-tailed&#8221; distribution, so we <em>can not</em> assume finite variance of distribution.<br />
Nevertheless to have &#8220;true&#8221; result we want some form of probabilistic <a href="http://en.wikipedia.org/wiki/Convergence_of_random_variables#Convergence_in_distribution">convergence</a> of measurements to true value. As it happens such class of distribution with infinite variance exists. It&#8217;s called <a href="http://en.wikipedia.org/wiki/Stable_distribution">alpha-stable distributions</a>.<br />
Alpha stable distribution are those distributions for which linear combination of random variables have the same distribution, up to scale factor. From this follow <a href="http://en.wikipedia.org/wiki/Infinitely_divisible_distribution#Limit_theorem">analog of central limit theorem for stable distribution</a>.<br />
The most well known alpha-stable distribution is <a href="http://en.wikipedia.org/wiki/Cauchy_distribution">Cauchy distribution</a>, which correspond to widely used redescending estimator<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cpsi%28x%29+%3D+%5Cfrac+%7Bx%7D+%7B%5Cvarepsilon+%2B+x%5E2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi(x) = &#92;frac {x} {&#92;varepsilon + x^2}' title='&#92;psi(x) = &#92;frac {x} {&#92;varepsilon + x^2}' class='latex' /><br />
Cauchy distribution can be generalized in several way, including recent GCD &#8211; generalized Cauchy distribution(<a href="http://www.ece.udel.edu/~carrillo/Papers/jasp2010.pdf">Carrillo et al</a>), with density function<br />
<img src='http://s0.wp.com/latex.php?latex=p%5CGamma%28p%2F2%29%2F2%5CGamma%281%2Fp%29%5E2%28%5Csigma%5Ep+%2B+x%5Ep%29%5E%7B-2%2Fp%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='p&#92;Gamma(p/2)/2&#92;Gamma(1/p)^2(&#92;sigma^p + x^p)^{-2/p}' title='p&#92;Gamma(p/2)/2&#92;Gamma(1/p)^2(&#92;sigma^p + x^p)^{-2/p}' class='latex' /><br />
and estimator<br />
<img src='http://s0.wp.com/latex.php?latex=%5Cpsi%28x%29%3D%5Cfrac%7Bp%7Cx%7C%5E%7Bp-1%7Dsgn%28x%29%7D%7B%5Csigma%5Ep+%2B+x%5Ep%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='&#92;psi(x)=&#92;frac{p|x|^{p-1}sgn(x)}{&#92;sigma^p + x^p}' title='&#92;psi(x)=&#92;frac{p|x|^{p-1}sgn(x)}{&#92;sigma^p + x^p}' class='latex' /><br />
Carrillo also introduce Cauchy distribution-based &#8220;norm&#8221; (it&#8217;s not a real norm obviously) which he called &#8220;Lorentzian norm&#8221;<br />
<img src='http://s0.wp.com/latex.php?latex=%7C%7Cu%7C%7C_%7BLL_p%7D+%3D+%5Csum+ln%281+%2B+%5Cfrac%7B%7Cu_i%7C%5Ep%7D%7B%5Csigma%5Ep%7D%29&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='||u||_{LL_p} = &#92;sum ln(1 + &#92;frac{|u_i|^p}{&#92;sigma^p})' title='||u||_{LL_p} = &#92;sum ln(1 + &#92;frac{|u_i|^p}{&#92;sigma^p})' class='latex' /><br />
<img src='http://s0.wp.com/latex.php?latex=%7BLL_2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='{LL_2}' title='{LL_2}' class='latex' /> is correspond classical Cauchy distribution<br />
He successfully applied Lorentzian norm <img src='http://s0.wp.com/latex.php?latex=%7BLL_2%7D&amp;bg=e6e6e6&amp;fg=333333&amp;s=0' alt='{LL_2}' title='{LL_2}' class='latex' /> based <a href="http://en.wikipedia.org/wiki/Basis_pursuit">basis pursuit</a> to <a href="http://en.wikipedia.org/wiki/Compressed_sensing">compressed sensing</a> problem, which support idea that compressed sensing and robust statistics are dual each other.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/744/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/744/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/744/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/744/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/744/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/744/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/744/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/744/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/744/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/744/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/744/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/744/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/744/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/744/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=744&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/04/15/robust-estimators-understand-or-die-err-be-bored-trying/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
		<item>
		<title>Is Robust Statistics have formal mathematical foundation?</title>
		<link>http://mirror2image.wordpress.com/2011/04/11/is-robust-statistics-have-formal-mathematical-foundation/</link>
		<comments>http://mirror2image.wordpress.com/2011/04/11/is-robust-statistics-have-formal-mathematical-foundation/#comments</comments>
		<pubDate>Mon, 11 Apr 2011 08:14:23 +0000</pubDate>
		<dc:creator>mirror2image</dc:creator>
				<category><![CDATA[sci]]></category>
		<category><![CDATA[bundle adjustment]]></category>
		<category><![CDATA[Math]]></category>
		<category><![CDATA[nonlinear optimization]]></category>
		<category><![CDATA[robust statistics]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://mirror2image.wordpress.com/?p=736</guid>
		<description><![CDATA[As I have already written I have a trouble understanding what robust estimators actually estimate from probabilistic or other formal point of view. I mean estimators which are not maximum likelihood estimators. There is a formal definition which doesn&#8217;t explain a lot to me. It looks like estimator estimate some quantity, and we know how [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=736&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>As I have already <a href="http://mirror2image.wordpress.com/2011/04/02/l1-robust-statisrics-and-compressed-sensing/" />written</a> I have  a trouble understanding what <a href="http://en.wikipedia.org/wiki/Robust_estimator">robust estimators</a> actually estimate from probabilistic or other formal point of view. I mean estimators which are <em>not</em> maximum likelihood estimators. There is a formal <a href="http://en.wikipedia.org/wiki/Extremum_estimator#Definition">definition</a> which doesn&#8217;t explain a lot to me. It looks like estimator estimate some quantity, and we know how good we are at estimating it, but how we know what we are actually estimate? Or does this question even make sense? But that is actually a minor bummer. A problem with understanding outliers is a lot worse for me.  A <a href="http://en.wikipedia.org/wiki/Robust_statistics#Breakdown_point">breakdown point</a> is a fundamental concept in robust statistics. And breakdown point is defined as a relative number of outliers in the sample set. The problem is, it seems  there is no formal definition of <a href="http://en.wikipedia.org/wiki/Outliers">outlier</a> in statistics or probability theory. We can talk about mixture models, and tail distributions but those concepts are not quite consistent with breakdown point. Breakdown point looks like it belong to area of optimization/topology, not statistics. Could it be that outliers could be defined consistently only if we have some additional structural information/constraints beside statistical (distribution)?   That inability to reconcile statistics and optimization is a problem which causing cognitive headache for me.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/mirror2image.wordpress.com/736/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/mirror2image.wordpress.com/736/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/mirror2image.wordpress.com/736/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/mirror2image.wordpress.com/736/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/mirror2image.wordpress.com/736/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/mirror2image.wordpress.com/736/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/mirror2image.wordpress.com/736/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/mirror2image.wordpress.com/736/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/mirror2image.wordpress.com/736/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/mirror2image.wordpress.com/736/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/mirror2image.wordpress.com/736/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/mirror2image.wordpress.com/736/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/mirror2image.wordpress.com/736/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/mirror2image.wordpress.com/736/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=mirror2image.wordpress.com&amp;blog=5263770&amp;post=736&amp;subd=mirror2image&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://mirror2image.wordpress.com/2011/04/11/is-robust-statistics-have-formal-mathematical-foundation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/0e52b3782501acf1b61c02132fe8275e?s=96&#38;d=identicon" medium="image">
			<media:title type="html">mirror2image</media:title>
		</media:content>
	</item>
	</channel>
</rss>
