<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>
<channel>
	<title>Visual-Experiments.com &#187; structure from motion</title>
	<atom:link href="http://www.visual-experiments.com/tag/structure-from-motion/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.visual-experiments.com</link>
	<description>ASTRE Henri experiments with Ogre3D and web stuff</description>
	<lastBuildDate>Mon, 16 Jan 2017 18:59:35 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.2</generator>
		<item>
		<title>Structure from motion projects</title>
		<link>http://www.visual-experiments.com/2010/12/20/structure-from-motion-projects/</link>
		<comments>http://www.visual-experiments.com/2010/12/20/structure-from-motion-projects/#comments</comments>
		<pubDate>Mon, 20 Dec 2010 18:57:29 +0000</pubDate>
		<dc:creator>Henri</dc:creator>
				<category><![CDATA[photogrammetry]]></category>
		<category><![CDATA[bundler]]></category>
		<category><![CDATA[photosynth]]></category>
		<category><![CDATA[structure from motion]]></category>
		<guid isPermaLink="false">http://www.visual-experiments.com/?p=1029</guid>
		<description><![CDATA[I&#8217;ve introduced my tracking algorithm in the previous post. One of the issue I have is that the point cloud generated by my SFMToolkit (using Bundler) is not always accurate. This is a list of structure from motion projects alternative I&#8217;m interested in: Building Rome in a Day: Project home is using Bundler (GPL) Building [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve introduced my tracking algorithm in <a href="http://www.visual-experiments.com/2010/12/13/augmented-reality-outdoor-tracking-becoming-reality/">the previous post</a>. One of the issue I have is that the point cloud generated by my <a href="http://www.visual-experiments.com/2010/11/05/structure-from-motion-toolkit-released/">SFMToolkit</a> (using <a href="http://phototour.cs.washington.edu/bundler/">Bundler</a>) is not always accurate. This is a list of structure from motion projects alternative I&#8217;m interested in:</p>
<h3>Building Rome in a Day:</h3>
<p><a href="http://grail.cs.washington.edu/projects/rome/">Project home</a> is using <a href="http://phototour.cs.washington.edu/bundler/">Bundler</a> (GPL)</p>
<p><object width="500" height="400"><param name="movie" value="http://www.youtube.com/v/kxtQqYLRaSQ?version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/kxtQqYLRaSQ?version=3" type="application/x-shockwave-flash" width="500" height="400" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<h3>Building Rome on a Cloudless Day:</h3>
<p><a href="http://www.cs.unc.edu/~jmf/rome_on_a_cloudless_day/">Project home</a> | <a href="http://www.cs.unc.edu/~tjohnson/research.html">Source code</a> (Non-profit license, I&#8217;ve ported their source to windows)</p>
<p><object width="500" height="400"><param name="movie" value="http://www.youtube.com/v/PySBQ8Q_R8k?version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/PySBQ8Q_R8k?version=3" type="application/x-shockwave-flash" width="500" height="400" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<h3>Samantha:</h3>
<p><a href="http://profs.sci.univr.it/~fusiello/demo/samantha/">Project home</a> (I&#8217;ve contacted them without response but they said that they were going to release the source code: <a href="http://videolectures.net/cvpr2010_spotlights5/">check at 28:50</a>)</p>
<table>
<tbody style="background-color: white;">
<tr>
<td colspan="2"><img src="http://www.visual-experiments.com/blog/wp-content/uploads/2010/11/samantha.jpg" alt="" title="samantha" width="551" height="199" class="alignnone size-full wp-image-1030" /></td>
</tr>
<tr>
<td style="text-align: center;">Samantha</td>
<td style="text-align: center;">Bundler</td>
</tr>
</tbody>
</table>
<h3>PhotoSynth:</h3>
<p><a href="http://photosynth.net/">Website</a> &#8211; Microsoft closed-source SFM application: <a href="http://www.visual-experiments.com/2010/11/09/photosynth-toolkit-updated/">check out my PhotoSynthToolkit</a></p>
<table>
<tbody style="background-color: white;">
<tr>
<td colspan="2"><img src="http://www.visual-experiments.com/blog/wp-content/uploads/2010/11/photosynth.jpg" alt="" title="photosynth" width="551" height="373" class="alignnone size-full wp-image-1035" /></td>
</tr>
<tr>
<td style="text-align: center;">PhotoSynth</td>
<td style="text-align: center;">Bundler</td>
</tr>
</tbody>
</table>
<h3>ETH-V3D Structure-and-Motion software:</h3>
<p><a href="http://www.inf.ethz.ch/personal/chzach/opensource.html">Project home</a> with source code (GPL, <a href="http://www.visual-experiments.com/2010/12/20/structure-from-motion-projects/comment-page-1/#comment-1824">I&#8217;ve partially ported</a> it to windows)</p>
<h3>Simple Sparse Bundle Adjustment:</h3>
<p><a href="http://www.cs.unc.edu/~cmzach/opensource.html">Project home</a> with source code (LGPL, I&#8217;ve ported it to windows)</p>
<h3>A multi-stage linear approach to structure from motion:</h3>
<p><a href="http://research.microsoft.com/en-us/um/redmond/groups/ivm/LinearSfm/">Project home</a> | <a href="http://research.microsoft.com/en-us/um/redmond/groups/ivm/LinearSfm/sinhaRMLE10_linearSfm.pdf">paper</a><br />
<img src="http://www.visual-experiments.com/blog/wp-content/uploads/2010/12/linearsfm.jpg" alt="" title="linearsfm" width="606" height="404" class="alignnone size-full wp-image-1124" /><br />
Results from the paper of <a href="http://research.microsoft.com/en-us/um/redmond/groups/ivm/LinearSfm/">LinearSFM</a> (Microsoft Research)</p>
<p>This list is not exhaustive, I&#8217;ve seen other projects (<a href="http://cvlab.epfl.ch/research/surface/emvs/">Efficient Large Scale Multi-View Stereo for Ultra High Resolution Image Sets</a>: not sure how it is related to ETH-V3D Structure-and-Motion software)</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.visual-experiments.com%2F2010%2F12%2F20%2Fstructure-from-motion-projects%2F&amp;title=Structure%20from%20motion%20projects"><img src="http://www.visual-experiments.com/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://www.visual-experiments.com/2010/12/20/structure-from-motion-projects/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Pose Estimation using SfM point cloud</title>
		<link>http://www.visual-experiments.com/2010/07/12/pose-estimation-using-sfm-point-cloud/</link>
		<comments>http://www.visual-experiments.com/2010/07/12/pose-estimation-using-sfm-point-cloud/#comments</comments>
		<pubDate>Mon, 12 Jul 2010 08:42:14 +0000</pubDate>
		<dc:creator>Henri</dc:creator>
				<category><![CDATA[augmented reality]]></category>
		<category><![CDATA[ogre3d]]></category>
		<category><![CDATA[bundler]]></category>
		<category><![CDATA[gpusurf]]></category>
		<category><![CDATA[pose estimation]]></category>
		<category><![CDATA[sift]]></category>
		<category><![CDATA[structure from motion]]></category>
		<guid isPermaLink="false">http://www.visual-experiments.com/?p=600</guid>
		<description><![CDATA[The idea of this pose estimator is based on PTAM (Parallel Tracking and Mapping). PTAM is capable of tracking in an unknown environment thanks to the mapping done in parallel. But in fact if you want to augment reality, it&#8217;s generally because you already know what you are looking at. So, being able to have [...]]]></description>
			<content:encoded><![CDATA[<p>The idea of this pose estimator is based on <a href="http://www.robots.ox.ac.uk/~gk/PTAM/">PTAM</a> <em>(Parallel Tracking and Mapping)</em>. PTAM is capable of tracking in an unknown environment thanks to the mapping done in parallel. But in fact if you want to augment reality, it&#8217;s generally because you already know what you are looking at. So, being able to have a tracking working in an unknown environment is not always needed. My idea was simple: <strong>instead of doing a mapping in parallel, why not using SFM in a pre-processing step ?</strong></p>
<table>
<tbody style="background-color: white">
<tr>
<td colspan="2"><img src="http://www.visual-experiments.com/blog/wp-content/uploads/2010/07/sfm.pose_.estimation.png" alt="" title="sfm.pose.estimation" width="571" height="258" class="alignnone size-full wp-image-621" /></td>
</tr>
<tr>
<td>input: point cloud + camera shot</td>
<td>output: position and orientation of the camera</td>
</tr>
</tbody>
</table>
<div style="height: 10px"></div>
<p>So my outdoor tracking algorithm will eventually work like this:</p>
<ul style="margin-left: 20px">
<li>pre-processing step
<ul style="margin-left: 20px">
<li>generate a point cloud of the outdoor scene you want to track using Bundler</li>
<li>create a binary file with a descriptor <em>(Sift/Surf)</em> per vertex of the point cloud</li>
</ul>
</li>
<li>in real-time, for each frame N:
<ul style="margin-left: 20px">
<li>extract feature using <a href="http://mi.eng.cam.ac.uk/~er258/work/fast.html">FAST</a></li>
<li>match feature from frame N-1 using 2D patch</li>
<li>compute <strong>&#8220;relative pose&#8221;</strong> between frame N and N-1</li>
</ul>
</li>
<li>in almost real-time, for each &#8220;key frame&#8221;:
<ul style="margin-left: 20px">
<li>extract feature and descriptor</li>
<li>match descriptor with those of the point cloud</li>
<li>generate 2D/3D correspondence from matches</li>
<li>compute <strong>&#8220;absolute pose&#8221;</strong> using PnP solver <em>(<a href="http://cvlab.epfl.ch/software/EPnP/">EPnP</a>)</em></li>
</ul>
</li>
</ul>
<p>The tricky part is that absolute pose computation could last several &#8220;relative pose&#8221; estimation. So once you&#8217;ve got the absolute pose you&#8217;ll have to compensate the delay by cumulating the previous relative pose&#8230;</p>
<p>This is what I&#8217;ve got so far:</p>
<ul style="margin-left: 20px">
<li><strong>pre-processing step:</strong> binary file generated using SiftGPU (planning to move on my GPUSurf implementation) and Bundler (planning to move on <a href="http://insight3d.sourceforge.net/">Insight3D</a> or implement it myself using <a href="http://www.ics.forth.gr/~lourakis/sba/index.html">sba</a>)</li>
<li><strong>relative pose:</strong> I don&#8217;t have an implementation of the relative pose estimator</li>
<li><strong>absolute pose:</strong> it&#8217;s basically working but needs some improvements:
<ul style="margin-left: 20px">
<li>switch feature extraction/matching from Sift to Surf</li>
<li>remove unused descriptors to speed-up maching step (by scoring descriptors used as inlier with training data)</li>
<li>use another PnP solver (or add ransac to support outliers and have more accurate results)</li>
</ul>
</li>
</ul>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.visual-experiments.com%2F2010%2F07%2F12%2Fpose-estimation-using-sfm-point-cloud%2F&amp;title=Pose%20Estimation%20using%20SfM%20point%20cloud"><img src="http://www.visual-experiments.com/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://www.visual-experiments.com/2010/07/12/pose-estimation-using-sfm-point-cloud/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Structure From Motion Experiment</title>
		<link>http://www.visual-experiments.com/2010/07/08/structure-from-motion-experiment/</link>
		<comments>http://www.visual-experiments.com/2010/07/08/structure-from-motion-experiment/#comments</comments>
		<pubDate>Thu, 08 Jul 2010 22:05:25 +0000</pubDate>
		<dc:creator>Henri</dc:creator>
				<category><![CDATA[ogre3d]]></category>
		<category><![CDATA[photogrammetry]]></category>
		<category><![CDATA[bundler]]></category>
		<category><![CDATA[structure from motion]]></category>
		<guid isPermaLink="false">http://www.visual-experiments.com/?p=535</guid>
		<description><![CDATA[I have taken a new set of picture of the &#8220;Porte Cailhau&#8221; in Bordeaux. And I have used one of my tools (BundlerMatcher) to compute image matching using SiftGPU. BundlerMatcher generates a file compatible with Bundler match file. So using BundlerMatcher you can skip the long pre-processing step of feature extraction and image matching and [...]]]></description>
			<content:encoded><![CDATA[<p>I have taken a new set of picture of the &#8220;<a href="http://maps.google.com/maps?hl=en&#038;q=porte+cailhau&#038;ie=UTF8&#038;hq=porte+cailhau&#038;hnear=&#038;t=h&#038;z=16">Porte Cailhau</a>&#8221; in Bordeaux. And I have used one of my tools (BundlerMatcher) to compute image matching using <a href="http://www.cs.unc.edu/~ccwu/siftgpu/">SiftGPU</a>. BundlerMatcher generates a file compatible with <a href="http://phototour.cs.washington.edu/bundler/">Bundler</a> match file. So using BundlerMatcher you can skip the long pre-processing step of feature extraction and image matching and enjoy GPU acceleration!</p>
<p>I have used the &#8220;bundle.out&#8221; file produced by Bundler to get cameras informations:</p>
<ul style="margin-left: 20px">
<li>intrinsic parameters: focal, distorsion</li>
<li>extrinsic parameters: position, orientation</li>
</ul>
<p>With these informations you can see the point cloud through the viewpoint of one of the camera registered by Bundler. I&#8217;ve added this feature to my current Ogre3D PlyReader. I also have added a background plane to be able to see the picture taken from this viewpoint. This demo is not available for download right now, but you can still watch the video :</p>
<p><object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/wTaZCa06NHQ&amp;hl=fr_FR&amp;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/wTaZCa06NHQ&amp;hl=fr_FR&amp;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"></embed></object></p>
<p>The Ogre3D PlyReader and BundlerMatcher will eventually be added to my SVN. I&#8217;m currently busy working on another demo, so stay tuned !</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.visual-experiments.com%2F2010%2F07%2F08%2Fstructure-from-motion-experiment%2F&amp;title=Structure%20From%20Motion%20Experiment"><img src="http://www.visual-experiments.com/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://www.visual-experiments.com/2010/07/08/structure-from-motion-experiment/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>GPU-Surf video demo</title>
		<link>http://www.visual-experiments.com/2010/06/25/gpu-surf-video-demo/</link>
		<comments>http://www.visual-experiments.com/2010/06/25/gpu-surf-video-demo/#comments</comments>
		<pubDate>Fri, 25 Jun 2010 12:29:20 +0000</pubDate>
		<dc:creator>Henri</dc:creator>
				<category><![CDATA[ogre3d]]></category>
		<category><![CDATA[gpusurf]]></category>
		<category><![CDATA[structure from motion]]></category>
		<guid isPermaLink="false">http://www.visual-experiments.com/?p=454</guid>
		<description><![CDATA[In the previous post I&#8217;ve been announcing GPU-Surf first release. Now I&#8217;m glad to show you a live video demo of GPU-Surf and another demo using Bundler (structure from motion tools): There are three demos in this video: GPU-Surf live demo. PlyReader displaying Notre-Dame dataset. PlyReader displaying my own dataset (Place de la Bourse, Bordeaux). [...]]]></description>
			<content:encoded><![CDATA[<p>In the <a href="http://www.visual-experiments.com/2010/06/23/gpusurf-and-ogregpgpu/">previous post</a> I&#8217;ve been announcing <a href="http://www.visual-experiments.com/demos/gpusurf/">GPU-Surf</a> first release. Now I&#8217;m glad to show you a live video demo of GPU-Surf and another demo using <a href="http://phototour.cs.washington.edu/bundler/">Bundler</a> (structure from motion tools):</p>
<p><object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/lKQZaqG8yJc&#038;hl=fr&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/lKQZaqG8yJc&#038;hl=fr&#038;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"></embed></object></p>
<p>There are three demos in this video:</p>
<ol>
<li>GPU-Surf live demo.</li>
<li>PlyReader displaying Notre-Dame dataset.</li>
<li>PlyReader displaying my own dataset (<a href="http://maps.google.fr/maps?cid=16664510742528689060&#038;q=place+de+la+bourse&#038;ved=0CEcQ2wU&#038;ei=WX0kTPCyLd-5jAe4reXTCA&#038;ie=UTF8&#038;hq=place+de+la+bourse&#038;hnear=&#038;ll=44.841576,-0.569524&#038;spn=0.003534,0.006089&#038;t=h&#038;z=18">Place de la Bourse, Bordeaux</a>).</li>
</ol>
<h3>GPU-Surf</h3>
<p>You&#8217;ll get more information on the <a href="http://www.visual-experiments.com/demos/gpusurf/">dedicated demo section</a>.<br />
In this video GPU-Surf was running slowly because of Ogre::Canvas but it should be running really faster.</p>
<h3>PlyReader displaying Notre-Dame dataset</h3>
<p>I&#8217;m also interested in <a href="http://en.wikipedia.org/wiki/Structure_from_motion">structure from motion</a> algorithm, that&#8217;s why I have tested <a href="http://phototour.cs.washington.edu/bundler/">Bundler</a>, which comes with a <a href="http://phototour.cs.washington.edu/datasets/">good dataset of Notre-Dame de Paris</a>.</p>
<p>I have created a very simple PlyReader using Ogre3D, the first version was using billboard to display point cloud but it was slow (30fps with 130k points). Now I&#8217;m using custom vertex buffer and it runs at 800fps with 130k points.</p>
<p>The reconstruction was done by the team who created Bundler from 715 pictures of Notre-Dame de Paris (thanks to Flickr). In fact, in this demo they have done the big part of the job, I have just grab their output to check if my PlyReader was capable of reading such a big file.</p>
<h3>PlyReader displaying my own dataset</h3>
<p>If you already used Bundler you know that structure from motion algorithm needs a very slow pre-processing step to get &#8220;matches&#8221; between pictures of the dataset. Bundler is packaged to use <a href="http://www.cs.ubc.ca/~lowe/keypoints/">Lowe&#8217;s Sift binary</a>, but it&#8217;s very slow because it&#8217;s taking pgm as picture input and the output is written in a text file. Then a matching step is executed using KeyMatchFull.exe which is optimized using libANN but still very slow. </p>
<p>I have replaced the feature extraction and matching steps by my own tool: BundlerMatcher. It is using <a href="http://www.cs.unc.edu/~ccwu/siftgpu/">SiftGPU</a>, which gives a very nice speed-up. As my current implementation of GPU-Surf isn&#8217;t complete I can&#8217;t use it instead of SiftGPU but this is my intention.</p>
<table>
<tbody style="background-color: white">
<tr>
<td colspan="2"><img src="http://www.visual-experiments.com/blog/wp-content/uploads/2010/06/place-de-la-bourse.jpg" alt="" title="place-de-la-bourse" width="590" height="285" class="alignnone size-full wp-image-457" /></td>
</tr>
<tr>
<td>23 pictures taken with a classic camera <br />(Canon Powershot A700)</td>
<td>Point cloud generated using <a href="http://phototour.cs.washington.edu/bundler/">Bundler</a></td>
</tr>
</tbody>
</table>
<div style="height: 20px"></div>
<p>I have created this dataset with my camera and matched the pictures using my own tool: BundlerMatcher. This tool creates the same .key file as Lowe Sift tool and creates a matches.txt file that is used by Bundler. I have tried to get rid off this temporary .key file and keep everything in memory but changing Bundler code to handle this structure was harder than I predicted&#8230; I&#8217;m now more interested by insight3d implementation (<a href="http://insight3d.sourceforge.net/">presentation</a>, <a href="http://sourceforge.net/projects/insight3d/">source</a>) which seems to be easier to hack with.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save#url=http%3A%2F%2Fwww.visual-experiments.com%2F2010%2F06%2F25%2Fgpu-surf-video-demo%2F&amp;title=GPU-Surf%20video%20demo"><img src="http://www.visual-experiments.com/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share"/></a> </p>]]></content:encoded>
			<wfw:commentRss>http://www.visual-experiments.com/2010/06/25/gpu-surf-video-demo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
