<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-5778844313309310715</id><updated>2011-11-27T23:26:17.777Z</updated><category term='LOC'/><category term='lines of code'/><category term='LOCC'/><category term='Nvidia'/><category term='Sun'/><category term='cmake'/><category term='Tesla'/><category term='vi'/><category term='risk analysis'/><category term='ray tracing'/><category term='books'/><category term='software engineering'/><category term='reference'/><category term='Fermi'/><category term='GPU gems'/><category term='CUDA'/><category term='video editing'/><category term='vim'/><category term='GT200'/><category term='HPC'/><category term='multi-core'/><category term='CLOC'/><title type='text'>Programming in CUDA</title><subtitle type='html'>Discussions on all things CUDA</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>27</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-7908289118405449096</id><published>2011-08-11T13:32:00.000+01:00</published><updated>2011-08-11T13:57:55.909+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='LOCC'/><category scheme='http://www.blogger.com/atom/ns#' term='LOC'/><category scheme='http://www.blogger.com/atom/ns#' term='lines of code'/><category scheme='http://www.blogger.com/atom/ns#' term='CLOC'/><title type='text'>Lines of code counter for cuda</title><content type='html'>After a quick search I didn't find a fully baked version which had .cu line counting. I looked briefly at:&lt;br /&gt;&lt;a href="http://cloc.sourceforge.net/"&gt;CLOC&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.sonarsource.org/"&gt;Sonar&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.cmcrossroads.com/bradapp/clearperl/sclc.html"&gt;sclc&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.dwheeler.com/sloccount/"&gt;SLOCCount&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;None had cuda support out of the box but CLOC allowed you to define your own language support.&lt;br /&gt;CLOC requires perl so I installed&amp;nbsp;&lt;a href="http://www.activestate.com/activeperl/downloads"&gt;ActiveState ActivePerl 5.12.4&lt;/a&gt;&amp;nbsp;to run on windows.&lt;br /&gt;I then downloaded &lt;a href="http://sourceforge.net/projects/cloc/files/cloc/v1.53/cloc-1.53.exe/download"&gt;CLOC-1.5.3.exe&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Next step is to define support for cuda. This was very straightforward.&lt;br /&gt;run&lt;br /&gt;&lt;span class="Apple-style-span" style="background-color: #444444; color: cyan;"&gt;cloc.exe --write-lang-def=&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;lt;&lt;/span&gt;your definition file&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span class="Apple-style-span"&gt;then edit&amp;nbsp;&lt;your definition="" file=""&gt;&amp;nbsp;&lt;/your&gt;&lt;span class="Apple-style-span" style="background-color: #444444; color: cyan;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="background-color: #444444; color: cyan;"&gt;&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;lt;&lt;/span&gt;your definition file&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span"&gt;&amp;nbsp;to include a definition for cuda.&lt;/span&gt;&lt;br /&gt;The CLOC howto on this is &lt;a href="http://cloc.sourceforge.net/#custom_lang"&gt;here&lt;/a&gt;&lt;br /&gt;The lines I added were:&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;Cuda&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&amp;nbsp; &amp;nbsp; filter remove_matches ^\s*//&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&amp;nbsp; &amp;nbsp; filter remove_inline //.*$&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&amp;nbsp; &amp;nbsp; filter call_regexp_common C&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&amp;nbsp; &amp;nbsp; extension cu&lt;/span&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="color: blue;"&gt;&amp;nbsp; &amp;nbsp; 3rd_gen_scale 1.51&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;To run cloc with your updated definition file:&lt;br /&gt;&lt;span class="Apple-style-span" style="background-color: #444444; color: cyan;"&gt;cloc.exe --read-lang-def=&amp;nbsp;&lt;your definition="" file=""&gt;&lt;folders files="" list="" of="" or=""&gt;&lt;/folders&gt;&lt;/your&gt;&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;lt;&lt;/span&gt;your definition file&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;gt;&lt;/span&gt;&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;lt;&lt;/span&gt;list of files/folders&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;You will then have an output similar to this:&lt;br /&gt;&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4 text files.&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;3 unique files.&lt;br /&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1 file ignored.&lt;br /&gt;&lt;br /&gt;http://cloc.sourceforge.net v 1.53 &amp;nbsp;T=1.0 s (3.0 files/s, 139.0 lines/s)&lt;br /&gt;-------------------------------------------------------------------------------&lt;br /&gt;Language &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; files &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;blank &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;comment &amp;nbsp; &amp;nbsp;code&lt;br /&gt;-------------------------------------------------------------------------------&lt;br /&gt;C++ &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 19 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 51&lt;br /&gt;Cuda &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 13 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 47&lt;br /&gt;C/C++ Header &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;0 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 0 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 5&lt;br /&gt;-------------------------------------------------------------------------------&lt;br /&gt;SUM: &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;3 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 32 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;4 &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;103&lt;br /&gt;-------------------------------------------------------------------------------&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;This has not been extensively tested yet.&lt;br /&gt;As a quick check you can output a version of processed files with&amp;nbsp;comments/white spaces removed&lt;br /&gt;&lt;span class="Apple-style-span" style="background-color: #444444; color: cyan;"&gt;&lt;span class="Apple-style-span"&gt;cloc.exe -strip-comments=uncom&amp;nbsp;--read-lang-def=&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;lt;&lt;/span&gt;your definition file&amp;nbsp;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;gt;&lt;/span&gt;&lt;span class="Apple-style-span"&gt;&amp;nbsp;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;lt;&lt;/span&gt;&lt;span class="Apple-style-span"&gt;list of files/folders&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Consolas, Menlo, Monaco, 'Lucida Console', 'Liberation Mono', 'DejaVu Sans Mono', 'Bitstream Vera Sans Mono', 'Courier New', monospace, serif; font-size: 14px; line-height: 18px;"&gt;&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;Which will output uncommented versions of your processed files with an additional .uncom extension&lt;br /&gt;You can then compare the correctness of parsing using these files.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-7908289118405449096?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/7908289118405449096/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2011/08/lines-of-code-counter-for-cuda.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7908289118405449096'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7908289118405449096'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2011/08/lines-of-code-counter-for-cuda.html' title='Lines of code counter for cuda'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-1881706691711662421</id><published>2011-04-04T13:21:00.000+01:00</published><updated>2011-04-04T13:35:38.284+01:00</updated><title type='text'>Streaming multimedia to an xbox 360 from a laptop on the same subnet  with UPC EPC2425 Cisco router in Ireland</title><content type='html'>xbox 360 has support for streaming multimedia from a laptop using windows media player 11. Unfortunately, the EPC2425 UPC setup does not support multicasting and therefore windows media player streaming out of the box will not work.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;A workaround is to download &lt;a href="http://tversity.com"&gt;Tversity&lt;/a&gt; free version. You may run into problems starting the Tversity Multimedia server and timeouts.&lt;/div&gt;&lt;div&gt;This is due to conflicts with the flexnet licencing serve and acroread  service which can be disabled using services.msc executed from start-&amp;gt;run.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-1881706691711662421?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/1881706691711662421/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2011/04/streaming-multimedia-to-xbox-360-from.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/1881706691711662421'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/1881706691711662421'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2011/04/streaming-multimedia-to-xbox-360-from.html' title='Streaming multimedia to an xbox 360 from a laptop on the same subnet  with UPC EPC2425 Cisco router in Ireland'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-2218811459565250972</id><published>2009-12-16T19:06:00.000Z</published><updated>2009-12-16T19:56:14.819Z</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='Fermi'/><category scheme='http://www.blogger.com/atom/ns#' term='Tesla'/><title type='text'>Introducing new Tesla Fermi based products webinar review</title><content type='html'>Just after attending Nvidia's "Introducing new Tesla Fermi based products" webinar.&lt;br /&gt;It was light on detail and was more marketing than technical but I did pick up some nuggets of info.&lt;br /&gt;Current planned release for Tesla based Fermi products is Q2 and Q3.&lt;br /&gt;In the Tesla personal supercomputer market current product been C1060,&lt;br /&gt;the Tesla C2050 will be available in Q2: 520 -630 GFlops DP, 3GB mem with ECC,&lt;br /&gt;and C2070 in Q3: 520 -630 GFlops DP, 6GB mem with ECC.&lt;br /&gt;In the Tesla Datacenter product line, current model been the S1070&lt;br /&gt;the Teslo S2060 will be available in Q2: 2.1 -2.5 TFlops DP, 3GB mem with ECC,&lt;br /&gt;and S2070 in Q3: 2.1 -2.5 GFlops DP, 6GB mem with ECC.&lt;br /&gt;&lt;br /&gt;There was some information on faster infiniband with DMA between the Mellamox Infiniband and Fermi chips.&lt;br /&gt;&lt;br /&gt;Regarding ECC memory performance profiling they are seeing a 0-10% drop in memory bandwidth with ECC and no ECC support on GeForce product range (as expected).&lt;br /&gt;GDDR5 is about 2X faster than GDDR3.&lt;br /&gt;&lt;br /&gt;Most other aspects ddiscussed is available in greater detail in the &lt;a href="http://programmingcuda.blogspot.com/2009/10/nvidia-fermi-whitepaper.html"&gt;Nvidia Fermi Whitepaper&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-2218811459565250972?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/2218811459565250972/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/introducing-new-tesla-fermi-based.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2218811459565250972'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2218811459565250972'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/introducing-new-tesla-fermi-based.html' title='Introducing new Tesla Fermi based products webinar review'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-8472418037933219456</id><published>2009-12-16T17:21:00.001Z</published><updated>2009-12-16T17:21:28.186Z</updated><title type='text'>MIT and Harvard researchers demonstrate a better way for computers to see</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;The video and academic source article are well worth the time. The high throughput screening methodology like all many good ideas is extremely simple and highlights the importance of how clever quantitive research can be a winner when compared to restricted qulaitive research.&lt;/p&gt;in reference to: &lt;a href='http://web.mit.edu/press/2009/visual-systems.html'&gt;MIT and Harvard researchers demonstrate a better way for computers to ‘see’&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/francieflannery/id/6YgyTifStP7mPHP-gOl5SbN0SpE'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-8472418037933219456?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/8472418037933219456/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/mit-and-harvard-researchers-demonstrate.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/8472418037933219456'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/8472418037933219456'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/mit-and-harvard-researchers-demonstrate.html' title='MIT and Harvard researchers demonstrate a better way for computers to see'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-2529613437661621736</id><published>2009-12-16T16:32:00.001Z</published><updated>2009-12-16T16:32:52.025Z</updated><title type='text'>Coordinating the use of GPU and CPU for improving performance of compute intensive applications</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;Just finished reading this paper. It looks at the allocation of hetrogenous CPU/GPU resources to solve a classification problem.&lt;br/&gt;It uses and extends an event driven parallel framework called Anthill. &lt;br/&gt;The concepts used are well known and well understood parallel models. It uses streams to label data streams through data filter functions. The functions can be implemented on the CPU and/or GPU and the user can ask for filter function weighting to be used i.e. GPU filter function is 3x CPU filter. &lt;br/&gt;The candidate example application of Neuroblastoma image analysis displays some of the power of this type of technique.&lt;br/&gt;The parallel model allows:&lt;br/&gt;* Data parallelism&lt;br/&gt;* Task parallelism&lt;br/&gt;* aschrony. I don't necessarily agree with the aschrony concept. As I would consider it a translation of a recursive type algorithm to something simailar a split and search. Where the arbitrary splitting may lead to redundancy in the search space which is hidden due to the increased parallel throughput.&lt;br/&gt;&lt;br/&gt;There is also very minimal discussion on the memory management which I would consider the main complication of hetrogenous systems. Nothin is mentioned on whether the host--&amp;gt;device transactions are synchronous/asynchronous and do they depend on the compute mode of the GPU used.&lt;br/&gt;&lt;br/&gt;The overall programmming model reminds me of a DSP model and although leads itself nicely to classifier and data mining applications. I am not sure of its transferability to a wider application domain.&lt;br/&gt;In addition, no discussion is given on the developer cost in developing multiple implementation for CPU/GPU versus performance improvement.&lt;/p&gt;in reference to: &lt;a href='http://gpgpu.org/2009/12/08/teodoro-coordinating-gpu-and-cpu'&gt;Coordinating the Use of GPU and CPU for Improving Performance of Compute Intensive Applications :: GPGPU.org&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/francieflannery/id/v_v_q1qwXDnCVh7hraa4rEbjxCw'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-2529613437661621736?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/2529613437661621736/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/coordinating-use-of-gpu-and-cpu-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2529613437661621736'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2529613437661621736'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/coordinating-use-of-gpu-and-cpu-for.html' title='Coordinating the use of GPU and CPU for improving performance of compute intensive applications'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-5269133240157524656</id><published>2009-12-16T14:16:00.001Z</published><updated>2009-12-16T14:16:13.472Z</updated><title type='text'>Third Workshop on General-Purpose Computation on Graphics Processing units</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;I won't be able to attend this myself but may be of interest to others.&lt;br/&gt;GPGPU-3, 14th March in Pittsburgh&lt;br/&gt;Papers due: January 15, 2010&lt;/p&gt;in reference to: &lt;p&gt;&lt;blockquote&gt;"Papers due: January 15, 2010"&lt;br/&gt;- &lt;a href='http://www.ece.neu.edu/groups/nucar/GPGPU/'&gt;Workshop on General Purpose Processing on Graphics Processing Units&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/FjAas9GZZ-ya7hasB27wVIO1JHA'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/blockquote&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-5269133240157524656?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/5269133240157524656/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/third-workshop-on-general-purpose.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5269133240157524656'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5269133240157524656'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/third-workshop-on-general-purpose.html' title='Third Workshop on General-Purpose Computation on Graphics Processing units'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-2427343161824691154</id><published>2009-12-16T14:09:00.001Z</published><updated>2009-12-16T14:09:56.080Z</updated><title type='text'>Frontiers of GPU, Multi- and Many-Core Systems Workshop</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;Workshop on in Melbourne, Australia  from the 17th-20 May, 2010. Paper submission 31st December.&lt;br/&gt;Not too sure if I'll be able to attend myself but looks very interesting. Looking at cost effective translation of applications across differing parallel architectures is an admirable but challenging goal. &lt;br/&gt;Let me know if anyone is planning on attending this or has previousily attended any of the related workshops.&lt;/p&gt;in reference to: &lt;a href='http://salsahpc.indiana.edu/FGMMS2010/'&gt;Salsa Workshop&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/p5XsyybE_9tz2xsUs79t5I39EnI'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-2427343161824691154?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/2427343161824691154/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/frontiers-of-gpu-multi-and-many-core.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2427343161824691154'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2427343161824691154'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/frontiers-of-gpu-multi-and-many-core.html' title='Frontiers of GPU, Multi- and Many-Core Systems Workshop'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-4927790018743964309</id><published>2009-12-02T15:39:00.001Z</published><updated>2009-12-02T15:39:19.995Z</updated><title type='text'>DirectX 11 used in Colin McRae 2</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;I've became very jealous of those with an ATI Radeon Series GPU after watching the video in the referenced article. &lt;br/&gt; The full game is not released (actually just released today I think) but there is a demo link in the article for those who want to have a real taster. Interestingly the main conclusion in the article was that the DirectX11 functionality in the demo hurt frame rate by 30% and didn't give any appreciable improvements. Still roll on DirectX 11 support in Nvidia's upcoming fermi chip and games titles with DirectX11 built in for the bottom up.This will be when we will be really able to rate the performance of DirectX11 functionality. &lt;br/&gt;Also a big question is what version of DirectX will be in the next generation of consoles, as many PC games these days are console ports and therefore mainly DirectX 9 derived.  As the next gen consoles are more than likely going to be locked down sometime next year, and release around 2012-2013 then DirectX 11 looks like the prime candidate.&lt;br/&gt; &lt;br/&gt;From a technical perspective, the game Dirt 2 uses Directx 11 in five situations:&lt;br/&gt;* Hardware tesselated dynamic water, animated crowd and dynamic clothing.&lt;br/&gt;* Full floating point HDR lighting, otherwise known as high dynamic range rendering (HDRR or HDR Rendering) carries out light calculations in a higher dynamic range. Therefore bright things can be very dark, light things very bright and details can be seen in both instances. &lt;br/&gt;* High Definition Ambient Occlusion&lt;br/&gt;* Full screen resolution post processing.&lt;br/&gt;&lt;br/&gt;All of these features look extremely useful and I can't wait for a Nvidia DirectX 11 chip so I can spend a bit of time playing around with them.&lt;/p&gt;in reference to: &lt;a href='http://www.bit-tech.net/gaming/pc/2009/12/01/directx-11-performance-first-look-dirt-2/1?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed%3A+bit-tech%2Fall+%28bit-tech.net+feed%29&amp;amp;utm_content=Google+International'&gt;&lt;a href='http://bit-tech.net'&gt;bit-tech.net&lt;/a&gt; | Feature - DirectX 11 performance first look: Dirt 2&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/t0DZXGOvEddZQrxQnJIsgUu7NbU'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-4927790018743964309?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/4927790018743964309/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/directx-11-used-in-colin-mcrae-2.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/4927790018743964309'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/4927790018743964309'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/12/directx-11-used-in-colin-mcrae-2.html' title='DirectX 11 used in Colin McRae 2'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-8002443918005491232</id><published>2009-11-30T14:02:00.001Z</published><updated>2009-11-30T14:02:25.799Z</updated><title type='text'>Interview article with Jen-Hsun Huang</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;Jen-Hsun Huang is the Co-founder and CEO of Nvidia. In the referenced article he gives an an interview to www.tbreak,com&lt;br/&gt;&lt;br/&gt;I liked his advice on business plan development as I am currently working on this. &lt;br/&gt;"when I started writing NVIDIA’s Business Plan and after three months of rephrasing and making it sound good, I realized that if I kept working on this, the market opportunity will pass us. I knew what we believed so I just wrote the first principals and list of actions that are basically strategies on what we believe in and the resources we have. To this day, NVIDIA’s business planing is exactly like that- Short and crisp."&lt;br/&gt;&lt;br/&gt;A key learning is the way a key architectural decision (limited use of high cost memory) led to initial success but when the price point of memory changed.&lt;br/&gt;&lt;br/&gt;His take on the difference of strategy and technology. Two companies can have two similar products but widely varying strategies. Within Nvidia he defines strategies as "things like how do you build a product, how do you pace it, how do you work with developers, what are the things you want to be really good at and lesser-good at etc."&lt;br/&gt;&lt;br/&gt;The strategies used by Nvidia to bring geometry processing and SLI to market when large players such as microsoft, OEMs and Intel were not happy about the technologies.&lt;br/&gt;&lt;br/&gt;He also gives his take on future direction: ION on the smartphone, Tesla in the cloud, gaming with physx, raytracing and 3D.&lt;/p&gt;in reference to: &lt;a href='http://tech.tbreak.com/2009/11/nvidia-past-present-future-an-interview-with-jen-hsun-huang/'&gt;An interview with NVIDIA's CEO Jen Hsun Huang | t-break: Tech @ Its Fastest&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/FSZOxm2RirLzKAVG-5T0FEC49Pk'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-8002443918005491232?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/8002443918005491232/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/interview-article-with-jen-hsun-huang.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/8002443918005491232'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/8002443918005491232'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/interview-article-with-jen-hsun-huang.html' title='Interview article with Jen-Hsun Huang'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-9140037766259102838</id><published>2009-11-26T18:21:00.001Z</published><updated>2009-11-26T18:21:09.872Z</updated><title type='text'>PyCUDA: GPU Run-Time Code Generation for High-Performance Computing</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;Have yet to use pyCUDA but hope to get around to using it at some stage maybe for some rapid prototyping. &lt;br/&gt;The reference article gives some nice technical detail on the use of run-time code generation (RTCG) for the GPU and the use of pyCUDA.&lt;/p&gt;in reference to: &lt;p&gt;&lt;blockquote&gt;"PyCUDA: GPU Run-Time Code Generation for High-Performance Computing"&lt;br/&gt;- &lt;a href='http://arxiv.org/abs/0911.3456'&gt;[0911.3456] PyCUDA: GPU Run-Time Code Generation for High-Performance Computing&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/3CbGrY8kddlYjZYwmAAMM6HJCkU'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/blockquote&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-9140037766259102838?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/9140037766259102838/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/pycuda-gpu-run-time-code-generation-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/9140037766259102838'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/9140037766259102838'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/pycuda-gpu-run-time-code-generation-for.html' title='PyCUDA: GPU Run-Time Code Generation for High-Performance Computing'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-3353981005630326852</id><published>2009-11-26T16:18:00.001Z</published><updated>2009-11-26T16:18:13.152Z</updated><title type='text'>A fast two-dimensional floodplain inundation model</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;Looks like a very nice application for CUDA and has particular resonance with me currently due to Irelands current record breaking floods. I have yet to find the paper available online though.&lt;/p&gt;in reference to: &lt;a href='http://gpgpu.org/2009/11/25/fast-2d-floodplain-inundation-model'&gt;A fast two-dimensional floodplain inundation model :: GPGPU.org&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/g4xxRTnRP2yyLVvI1YLRSRkgFA8'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-3353981005630326852?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/3353981005630326852/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/fast-two-dimensional-floodplain.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/3353981005630326852'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/3353981005630326852'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/fast-two-dimensional-floodplain.html' title='A fast two-dimensional floodplain inundation model'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-5430365354616354601</id><published>2009-11-26T16:13:00.001Z</published><updated>2009-11-26T16:13:19.411Z</updated><title type='text'>Nice article on MIT algorithm lectures</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;Goes through the complete lecture set of MIT's introduction to algorithms lecture course.&lt;/p&gt;in reference to: &lt;a href='http://www.catonmat.net/blog/summary-of-mit-introduction-to-algorithms/'&gt;Summary of all the MIT Introduction to Algorithms lectures - good coders code, great reuse&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/KA70IpMYI1gn-eKdQhMoI9MVdH8'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-5430365354616354601?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/5430365354616354601/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/nice-article-on-mit-algorithm-lectures.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5430365354616354601'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5430365354616354601'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/nice-article-on-mit-algorithm-lectures.html' title='Nice article on MIT algorithm lectures'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-2866101129751440066</id><published>2009-11-26T16:05:00.001Z</published><updated>2009-11-26T16:05:38.636Z</updated><title type='text'>End of Line for IBM's cell</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;&lt;p&gt;IBM's VP of Deep Computing, David Turek, has stated that the Cell processor is at the end of its evolutionary line. &lt;br/&gt;While many of the concepts are planned to live on in other IBM product offerings the cell product will not be developed further.&lt;br/&gt;This has been clarified by an IBM sokesman here &lt;a href='http://www.driverheaven.net/news.php?newsid=344'&gt;http://www.driverheaven.net/&lt;wbr/&gt;news.php?newsid=344&lt;/a&gt; &lt;br/&gt;Only one development cycle has been halted for one product in the Cell processor family the PowerXCell-8i cpu. So whether the Cell processor will be used in playstation 4 is still open to debate.&lt;br/&gt;&lt;br/&gt;The main sticking point for the cell has been it's programmability. It has the name of a programming model that is considered outright hostel by many developers.&lt;br/&gt;&lt;br/&gt;The future of the heterogenous single chip multiprocessors (CMP) does not end with the Cell with a large number of companies trending towards this model.&lt;br/&gt; &lt;br/&gt;The Nvidia CUDA model shows a model that works well for programmability and with the future release of extended functionality in the fermi architecture. The future looks interesting. In addition, the rumour mill is still full of talk regarding a Nvidia x86 type chip in the future.&lt;br/&gt;&lt;br/&gt;Then we have the planned fusion of the CPU and GPU in next generation offers from Intel and AMD. In addition, we have the planned release of Larrabbe with vectorised x86 based multicore architecture. Oficially, this product is not planned for high end graphics but will be an interesting and potentially very adaptable product. &lt;br/&gt;&lt;br/&gt;It will now be interesting to see what the playstation 4 and the future generations of HPC will be based on Intel, Nvidia, IBM and AMD or someone else?&lt;/p&gt;in reference to: &lt;a href='http://arstechnica.com/hardware/news/2009/11/end-of-the-line-for-ibms-cell.ars'&gt;End of the line for IBM's Cell&lt;/a&gt; (&lt;a href='http://www.google.com/sidewiki/entry/104967752191725162142/id/8rw-CJYd4YFIQ9N--_Q1HoOjAoE'&gt;view on Google Sidewiki&lt;/a&gt;)&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-2866101129751440066?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/2866101129751440066/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/end-of-line-for-ibm-cell.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2866101129751440066'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2866101129751440066'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/11/end-of-line-for-ibm-cell.html' title='End of Line for IBM&amp;#39;s cell'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-7590283089612187182</id><published>2009-10-15T20:07:00.000+01:00</published><updated>2009-10-15T20:23:34.618+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Fermi'/><title type='text'>Nvidia Fermi Whitepaper</title><content type='html'>Got a chance to go through the Nvidia Fermi whitepaper available &lt;a href="http://www.nvidia.com/content/PDF/fermi_white_papers/NVIDIA_Fermi_Compute_Architecture_Whitepaper.pdf"&gt;here&lt;/a&gt;.&lt;br /&gt;This really looks like a very exciting architecture with some extremely nice features.&lt;br /&gt;&lt;br /&gt;Heres a list of some that I hope to hope to go into more detail in later posts.&lt;br /&gt;potentially 8x faster double precision&lt;br /&gt;ECC support&lt;br /&gt;64 KB RAM with configurable partitioning for L1 and shared memory &lt;br /&gt;A L2 cache&lt;br /&gt;Faster atomic operations&lt;br /&gt;Faster context switching&lt;br /&gt;Dual wrap scheduler whichs allows two independent warp  dispatches&lt;br /&gt;unified address architecture with support for C++&lt;br /&gt;Optimised OpenCL and directCompute execution&lt;br /&gt;Full IEEE 754-2008 32-bit and 64-bit floating point precision&lt;br /&gt;Full 32-bit integer pipeline with 64-bit extensions&lt;br /&gt;Support for 64-bit memory&lt;br /&gt;Faster predication support&lt;br /&gt;Faster atomic operations&lt;br /&gt;Gigathread technology&lt;br /&gt;10x faster context switching&lt;br /&gt;concurrent kernel execution&lt;br /&gt;overlapping bidirectional memory operations&lt;br /&gt;Out of Order thread block execution&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-7590283089612187182?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/7590283089612187182/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/10/nvidia-fermi-whitepaper.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7590283089612187182'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7590283089612187182'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/10/nvidia-fermi-whitepaper.html' title='Nvidia Fermi Whitepaper'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-3094451892774523026</id><published>2009-10-14T13:52:00.000+01:00</published><updated>2009-10-14T17:11:19.390+01:00</updated><title type='text'>Powering GPUs with diesel</title><content type='html'>Its very interesting to view the current performance/watt of CPU versus GPU technologies.&lt;br /&gt;&lt;br /&gt;A good example is the article &lt;a href="http://www.brightsideofnews.com/news/2009/10/12/gpu-computing-shows-superior-efficiency-in-australian-outback.aspx"&gt;here&lt;/a&gt;. Where they are trying to achieve radio astronomy observations which are much wider than the current 3 degree limited radio telescopes. It is planned to achieve this through a grid of smaller cheaper antennaes called the MWA [&lt;a title="MWA - Murchinson Widefield Array" href="http://www.mwatelescope.org/" target="_blank"&gt;Murchinson Widefield Array&lt;/a&gt;]. Unfortunately the telescopes need to be developed in extermely isolated areas to remove as much radio noise as possible. The frequency they are searching at is 300 - 800 Mhz.&lt;br /&gt;A prototype system which had  32 antenna clusters with each cluster formed out of 16 small antennas was developed in the Australian outback (the complete system planned to have 8000 antennaes). A major issue was powering a system in such an isolated area. Looking at various power requirements it became obvious that for there particular processing domain GPUs are currently a much more power effecient alternative than a CPU only solution. For the prototype all that was available for power was a 20Kw diesel engine. In its current prototype implementation it requires 20 teraflops of processing power.  Given that you would need around 200 CPUs for the job, as 200 Xeon 5500 CPUs at 3.2 GHz [100 GFLOPS each] would consume a grand total of 24,000Watts [that's not counting the rest of the computer needed for these CPUs to function]. Overall estimate in CPU-based setup was in excess of 55,000 Watts [55kW] and obviously, it was a dead end. On the other hand, in around 1kW ofpower, scientists managed to squeeze 4.5TFLOPS [dual GTX 295 card], meaning "only" 5.5kW is needed for 20TFLOPS. With upcoming Fermi-based cards, Australians expect to build a 20TFLOPS setup using only3.3kW.&lt;br /&gt;&lt;br /&gt;But this issue is not closed a fully grown array will generate 1Tbps of data and require 1 EFLOPS [&lt;a title="Exa on Wikipedia" href="http://en.wikipedia.org/wiki/Exa-" target="_blank"&gt;Exa&lt;/a&gt;] to process all the generated data. if you would need to build a 1EFLOPS supercomputer today, you would need quite an interesting number: 666,666 Fermi-based Tesla cards with 3.99 PetaBytes of GDDR5 memory. If you require 1EFLOPS from world's most powerful CPUs, the clock would stop with Intel Xeon W5590 at 3.33 GHz, and you would need around 10 million of them. In both cases, EFLOPS is unattainable.&lt;br /&gt;&lt;br /&gt;Therefore it will be interesting when MWA opens in 2020 what will be the underlying processing technology.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-3094451892774523026?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/3094451892774523026/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/10/powering-gpus-with-diesel.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/3094451892774523026'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/3094451892774523026'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/10/powering-gpus-with-diesel.html' title='Powering GPUs with diesel'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-7711445445230403479</id><published>2009-08-26T12:07:00.000+01:00</published><updated>2009-10-30T15:37:00.566Z</updated><title type='text'>OpenGL 3.2</title><content type='html'>I'm doing a bit of work these days on OpenGL with CUDA.&lt;br /&gt;A decent if a little AMD specific take on the new features of OpenGL is available &lt;a href="http://fireuser.com/blog/opengl_32_steps_up_to_the_plate/"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;p&gt;A good webinar of openGL and its new features is available from Nvidia &lt;a href="http://nvidia.fullviewmedia.com/GPU2009/0930-gold-1407.html"&gt;here&lt;/a&gt;.&lt;br /&gt;while the Nvidia OpenGL 3.2 beta drivers are available &lt;a href="http://developer.nvidia.com/object/opengl_3_driver.html"&gt;here&lt;/a&gt;. &lt;/p&gt;&lt;p&gt;&lt;br /&gt; &lt;/p&gt;The AMD article highlights the following new features.&lt;br /&gt;Geometry Shader Support --&gt; geometry primitives can be modified on GPU&lt;br /&gt;Sync objects --&gt; keeps CPU and GPU in sync&lt;br /&gt;Multisample tectures and samplers --&gt; Can apply multisample rendering hardware to extures and render buffer objects, instead of only screen space windows. Shader can read from each sample of a multisampled texture and apply custom blend schemes.&lt;br /&gt;&lt;br /&gt;I hope to give an overview of the Nvidia webinar at a later date.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-7711445445230403479?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/7711445445230403479/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/08/opengl-32.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7711445445230403479'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7711445445230403479'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/08/opengl-32.html' title='OpenGL 3.2'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-7649542091222071417</id><published>2009-08-11T14:11:00.000+01:00</published><updated>2009-08-11T15:44:38.178+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='ray tracing'/><title type='text'>Simple Ray tracing example</title><content type='html'>Peter Trier from the &lt;a href="http://cg.alexandra.dk/"&gt;computer graphics group &lt;/a&gt;at the &lt;a href="http://www.alexandra.dk/uk/about/index.htm"&gt;Alexandria Institute &lt;/a&gt;has written a nice simple tutorial for building a CUDA based ray tracer &lt;a href="http://cg.alexandra.dk/2009/08/10/triers-cuda-ray-tracing-tutorial/"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Running pre-built executable&lt;/strong&gt;&lt;br /&gt;The executable ran without issue when I copied the glut32.dll availble from &lt;nvidia&gt;\C\bin\win32\Release into example directory.&lt;br /&gt;&lt;br /&gt;&lt;strong&gt;Building available code&lt;/strong&gt;&lt;br /&gt;The build instructions I have here were applied using Windows XP 32-bit, Visual Studio 2005 and CUDA 2.3.&lt;br /&gt;&lt;br /&gt;With regards to building the code.&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Move the src folder to &lt;nvidia&gt;\src&lt;/li&gt;&lt;li&gt;When you open the CUDA_raytracer.vcproj you are asked to create a .sln do this. (Another option would be to include this vcproj in an existing .sln such as the release.sln in the &lt;nvidia&gt;\src folder.&lt;/li&gt;&lt;li&gt;Install &lt;a href="http://www.antisphere.com/Wiki/tools:anttweakbar?sb=tools"&gt;AntTweakBar&lt;/a&gt;. It is used to for the user interface in the ray tracing program. After installing this. Can simply copy the AntTweakBar.h, dll and lib files into the example source folder.&lt;/li&gt;&lt;li&gt;There is an error regarding missing &lt;a href="http://developer.nvidia.com/object/cg_toolkit.html"&gt;Cg&lt;/a&gt; libraries these can be removed as there is no dependencies in example code.&lt;/li&gt;&lt;li&gt;The app will be built at &lt;nvidia&gt;\NVIDIA GPU Computing SDK\C\bin\win&lt;3264&gt;\&lt;releasemode&gt;&lt;/li&gt;&lt;li&gt;To run application copy the AntTweakBar.dll and data folder into build folder.&lt;/li&gt;&lt;li&gt;The application should now run &lt;/li&gt;&lt;/ol&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-7649542091222071417?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/7649542091222071417/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/08/simple-ray-tracing-example.html#comment-form' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7649542091222071417'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7649542091222071417'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/08/simple-ray-tracing-example.html' title='Simple Ray tracing example'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>8</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-3180400035936834963</id><published>2009-07-29T18:34:00.000+01:00</published><updated>2009-07-30T13:43:54.103+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='GT200'/><title type='text'>Very good article on CUDA anf the GT200 architecture</title><content type='html'>&lt;a href="http://www.realworldtech.com/page.cfm?ArticleID=RWT090808195242&amp;amp;p=1"&gt;here&lt;/a&gt; is one of the best articles around to get a handle on CUDA with emphasis on the GT200 Architecture.&lt;br /&gt;It has:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;A really nice concise overview of the way Directx in many ways drove hardware towards GPGPU capabilities.&lt;/li&gt;&lt;li&gt;Some basic pointers on what does and doesn't map well to the GPU.&lt;/li&gt;&lt;li&gt;A nice concise overview of where memory is located, its scope and potential usage scenarios.&lt;/li&gt;&lt;li&gt;Overview of the CUDA API architecture, toolchain and optional libraries etc.&lt;/li&gt;&lt;li&gt;Describes the extensions of different hardware compute architectures.&lt;/li&gt;&lt;li&gt;A nice description of the GT200 system architecture and contrasts it with other architectures such as x86, Niagra.&lt;/li&gt;&lt;li&gt;Nice analysis of the actual hardware architecture such as the TPC, SM, memory pipelines, clocking domains&lt;/li&gt;&lt;li&gt;A nice analysis of the TPC --&gt; SM --&gt; SP control and computation model.&lt;/li&gt;&lt;li&gt;Useful info such as the cycle times for differenet operations, dual issues and hardware constraints.&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-3180400035936834963?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/3180400035936834963/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/very-good-article-on-cuda-anf-gt200.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/3180400035936834963'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/3180400035936834963'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/very-good-article-on-cuda-anf-gt200.html' title='Very good article on CUDA anf the GT200 architecture'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-5616494571206555473</id><published>2009-07-29T16:20:00.000+01:00</published><updated>2009-07-29T16:23:29.438+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Sun'/><category scheme='http://www.blogger.com/atom/ns#' term='multi-core'/><title type='text'>Sun's Niagara 3</title><content type='html'>Report &lt;a href="http://www.theregister.co.uk/2008/06/23/sun_niagara_k2/"&gt;here&lt;/a&gt; on Sun's Niagara 3. It is reported that it will be out by late 2009 and will have 16-cores and 16 threads per core.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-5616494571206555473?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/5616494571206555473/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/suns-niagara-3.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5616494571206555473'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5616494571206555473'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/suns-niagara-3.html' title='Sun&apos;s Niagara 3'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-5589154053222665793</id><published>2009-07-23T19:12:00.000+01:00</published><updated>2009-07-23T19:36:14.169+01:00</updated><title type='text'>Dated but decent article "Nvidia's CUDA: The End of the CPU?"</title><content type='html'>The article &lt;a href="http://www.tomshardware.com/reviews/nvidia-cuda-gpu,1954.html"&gt;here&lt;/a&gt;. Is a little out dated but well worth a read for newbies to CUDA.&lt;br /&gt;&lt;br /&gt;It gives good brief background history of where CUDA evolved from.&lt;br /&gt;&lt;br /&gt;Although some of the details are outdated. For example double precision has been available for a while in CUDA and support for SLI has been made available just recently with &lt;a href="http://forums.nvidia.com/index.php?showtopic=102548"&gt;CUDA 2.3&lt;/a&gt;.&lt;br /&gt;It is also quite specific to 8800GTX hardware.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-5589154053222665793?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/5589154053222665793/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/dated-but-decent-article-nvidias-cuda.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5589154053222665793'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5589154053222665793'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/dated-but-decent-article-nvidias-cuda.html' title='Dated but decent article &quot;Nvidia&apos;s CUDA: The End of the CPU?&quot;'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-5343769780521256552</id><published>2009-07-15T18:14:00.001+01:00</published><updated>2009-07-15T18:16:14.052+01:00</updated><title type='text'>HEXUS.net - Tech explained :: NVIDIA Tegra SoC</title><content type='html'>A good overview of Nvidia new SOC for mobile devices.&lt;br /&gt;&lt;a href="http://www.hexus.net/content/item.php?item=19229"&gt;HEXUS.net - Tech explained :: NVIDIA Tegra SoC : Page - 1/1&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-5343769780521256552?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/5343769780521256552/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/hexusnet-tech-explained-nvidia-tegra.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5343769780521256552'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5343769780521256552'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/hexusnet-tech-explained-nvidia-tegra.html' title='HEXUS.net - Tech explained :: NVIDIA Tegra SoC'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-5593986711720919322</id><published>2009-07-15T18:11:00.001+01:00</published><updated>2009-07-15T18:13:55.535+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='video editing'/><title type='text'>Review of CUDA enabled video editing software</title><content type='html'>&lt;a href="http://www.elitebastards.com/cms/index.php?option=com_content&amp;amp;task=view&amp;amp;id=732&amp;amp;Itemid=29&amp;amp;limit=1&amp;amp;limitstart=0"&gt;A review of available CUDA enabled video editing software. &lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-5593986711720919322?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/5593986711720919322/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/review-of-cuda-enabled-video-editing.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5593986711720919322'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/5593986711720919322'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/review-of-cuda-enabled-video-editing.html' title='Review of CUDA enabled video editing software'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-9069240434347228664</id><published>2009-07-15T18:01:00.000+01:00</published><updated>2009-07-15T18:03:51.001+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='HPC'/><category scheme='http://www.blogger.com/atom/ns#' term='Nvidia'/><title type='text'>NeST ties up with Nvidia</title><content type='html'>It is reported &lt;a href="http://economictimes.indiatimes.com/Infotech/Software/NeST-ties-up-with-Nvidia/articleshow/4772425.cms"&gt;here&lt;/a&gt; that Network Systems and Technologies (NeST) based at the Technopark has tied up with visual computing initiative major Nvidia&lt;br /&gt;Corporation&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-9069240434347228664?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/9069240434347228664/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/nest-ties-up-with-nvidia.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/9069240434347228664'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/9069240434347228664'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/nest-ties-up-with-nvidia.html' title='NeST ties up with Nvidia'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-1546803203866035507</id><published>2009-07-15T15:57:00.000+01:00</published><updated>2009-07-15T16:43:31.939+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='risk analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='software engineering'/><title type='text'>Analysis of Risk</title><content type='html'>Bruce &lt;span id="SPELLING_ERROR_0" class="blsp-spelling-error"&gt;Schneier&lt;/span&gt;  discusses the over estimation of terrorist risk &lt;a href="http://www.wired.com/politics/security/commentary/securitymatters/2009/06/securitymatters_0619"&gt;here&lt;/a&gt;. He adopted the ideas of the a software engineering paper &lt;a href="http://simula.no/research/engineering/publications/Simula.SE.621"&gt;More Risk Analysis Can Lead to Increased Over-Optimism and Over-Confidence&lt;/a&gt; to the estimation terrorist attack risk. Both articles make interesting reading especially where they discuss &lt;span id="SPELLING_ERROR_1" class="blsp-spelling-corrected"&gt;behavioural&lt;/span&gt; economics, the cognitive biases that make us think and act.&lt;br /&gt;One is control bias. Where someone underestimates the risk when they are in control and overestimate risk when they are not. A good example of this is peoples estimation of risk when driving themselves compared to been flown by others.&lt;br /&gt;Another is availability heuristic. Where we estimate on what can be easily brought to mind. The example given is we overestimate what we see and hear on the news and underestimate hard to imagine risks.&lt;br /&gt;The peak end rule is basically we give additional weight to the last imagined risk.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-1546803203866035507?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/1546803203866035507/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/analysis-of-risk.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/1546803203866035507'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/1546803203866035507'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/07/analysis-of-risk.html' title='Analysis of Risk'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-7775322398968183847</id><published>2009-04-16T12:39:00.000+01:00</published><updated>2009-04-19T10:15:22.683+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='vi'/><category scheme='http://www.blogger.com/atom/ns#' term='vim'/><title type='text'>Using vi with CUDA</title><content type='html'>&lt;a href="http://forums.nvidia.com/index.php?showtopic=43774"&gt;Here&lt;/a&gt; is a nifty configuration file for CUDA syntax highlighting in vi.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-7775322398968183847?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/7775322398968183847/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/04/using-vi-with-cuda.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7775322398968183847'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/7775322398968183847'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/04/using-vi-with-cuda.html' title='Using vi with CUDA'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-346564447682822995</id><published>2009-04-16T12:24:00.000+01:00</published><updated>2009-04-19T10:15:51.514+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='cmake'/><title type='text'>cuda and cmake</title><content type='html'>cmake is a cross-platform makefile generator. &lt;a href="http://www.cmake.org/"&gt;http://www.cmake.org&lt;/a&gt;&lt;br /&gt;Abe Stephens has developed a really nifty script to allow the use of cmake with CUDA. &lt;a href="http://www.sci.utah.edu/%7Eabe/FindCuda.html"&gt;FindCuda.cmake&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;If you have any issues a good place to ask questions is &lt;a href="http://forums.nvidia.com/index.php?showtopic=29482"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-346564447682822995?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/346564447682822995/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/04/cuda-and-cmake.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/346564447682822995'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/346564447682822995'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/04/cuda-and-cmake.html' title='cuda and cmake'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5778844313309310715.post-2274337430462218864</id><published>2009-04-14T13:45:00.000+01:00</published><updated>2009-04-19T10:16:35.402+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CUDA'/><category scheme='http://www.blogger.com/atom/ns#' term='GPU gems'/><category scheme='http://www.blogger.com/atom/ns#' term='reference'/><category scheme='http://www.blogger.com/atom/ns#' term='books'/><title type='text'>GPU Gems series of books available online.</title><content type='html'>The book series GPU gems is a great source on GPU programming techniques. They are very well written with a wide range of articles. The third book in particular has quiet a number of articles on CUDA.&lt;br /&gt;&lt;br /&gt; The three GPU gems books are available free online:&lt;br /&gt;&lt;a href="http://developer.nvidia.com/object/gpu_gems_home.html"&gt;http://developer.nvidia.com/object/gpu_gems_home.html&lt;/a&gt;&lt;br /&gt;&lt;a href="http://developer.nvidia.com/object/gpu_gems_2_home.html"&gt;http://developer.nvidia.com/object/gpu_gems_2_home.html&lt;/a&gt;&lt;br /&gt;&lt;a href="http://http.developer.nvidia.com/GPUGems3/gpugems3_part01.html"&gt;http://http.developer.nvidia.com/GPUGems3/gpugems3_part01.html&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/5778844313309310715-2274337430462218864?l=programmingcuda.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://programmingcuda.blogspot.com/feeds/2274337430462218864/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://programmingcuda.blogspot.com/2009/04/gpu-gems-series-of-books-available.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2274337430462218864'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5778844313309310715/posts/default/2274337430462218864'/><link rel='alternate' type='text/html' href='http://programmingcuda.blogspot.com/2009/04/gpu-gems-series-of-books-available.html' title='GPU Gems series of books available online.'/><author><name>Francie</name><uri>http://www.blogger.com/profile/09104729371097720314</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
