<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:blogger='http://schemas.google.com/blogger/2008' xmlns:georss='http://www.georss.org/georss' xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7978475984487238672</id><updated>2024-09-21T19:56:07.299-07:00</updated><category term="GNU Octave"/><category term="GSoC"/><category term="GSoC 2012"/><category term="JIT"/><title type='text'>JIT Octave</title><subtitle type='html'>The development of a JIT compiler for GNU Octave</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>11</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-4817968674093307822</id><published>2012-11-05T00:10:00.001-08:00</published><updated>2012-11-05T00:10:59.273-08:00</updated><title type='text'>JIT, Debugging, and Interrupts</title><content type='html'>I finally found some time to work on Octave last weekend. There has been some talk on the mailing list recently about releasing a new version of Octave, so I figured I should clean up a few loose ends in JIT.&lt;br /&gt;
&lt;br /&gt;
&lt;h3&gt;
Breakpoints&lt;/h3&gt;
&lt;div&gt;
Up until now JIT has been skipping breakpoints. While there are several issues with supporting breakpoints in JIT, the biggest one is that the Octave debugger allows for the execution of arbitrary statements. Arbitrary statement execution is a very powerful and useful feature of the Octave debugger, but it allows users to change the type of a variable in the middle of code execution.&lt;br /&gt;
&lt;br /&gt;
JIT gets its performance&amp;nbsp;improvement&amp;nbsp;by making assumptions about variable types, so entering&amp;nbsp;debug mode means we need to exit JIT. I took the simple way out here and do not start JIT if there are any breakpoints in the code (see &lt;a href=&quot;http://hg.savannah.gnu.org/hgweb/octave/file/44272909d926/libinterp/interp-core/pt-jit.cc#l1941&quot;&gt;hg&lt;/a&gt;).&lt;br /&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;h3&gt;
Interrupts&lt;/h3&gt;
&lt;div&gt;
In Octave if you hit control-c Octave will stop execution and return to the Octave prompt (unless debug on interrupt is set). The interpreter does this by calling &lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;octave_quit&lt;/span&gt;, which checks the interrupt state and throws an&amp;nbsp;&lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;octave_interrupt_exception&lt;/span&gt; if an interrupt has occured.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
Ideally, to support interrupts in JIT a call to&amp;nbsp;&lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;octave_quit&lt;/span&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&amp;nbsp;should be inserted once per loop iteration&lt;/span&gt;.&amp;nbsp;Unfortunately, it is not that simple. After an interrupt occurs the current variable values need to be reported to the interpreter. For example,&lt;br /&gt;
&lt;pre&gt;i = 0;
while 1
  i += 1;
endwhile&lt;/pre&gt;
If the user interrupts the loop the interpreter needs to save the current value of &lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;i&lt;/span&gt;. This means JIT needs a way to catch and rethrow the &lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;octave_interrupt_exception&lt;/span&gt;. While LLVM does have a way of handling exceptions, the&amp;nbsp;infrastructure&amp;nbsp;in Octave does not yet exist to support the LLVM feature.&lt;br /&gt;
&lt;br /&gt;
Instead, I inserted a check to &lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;octave_interrupt_state&lt;/span&gt;. If &lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;octave_interrupt_state&lt;/span&gt; is greater than 0, we need to exit to the Octave prompt. I reused the code for checking&amp;nbsp;&lt;span style=&quot;font-family: Courier New, Courier, monospace;&quot;&gt;error_state&lt;/span&gt;&lt;span style=&quot;font-family: inherit;&quot;&gt;&amp;nbsp;to achieve this.&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
Now that JIT handles interrupts and breakpoints in a manner consistent with the interpreter, I can&#39;t think of ways in which JIT and the interpreter differ (besides speed). The amount of code which JIT can compile is still fairly limited. Hopefully, I will get some time over winter break to make it easier to extend JIT and improve what JIT can compile. In its current state JIT should be ready to include as an experimental feature in the next Octave release.&lt;/div&gt;
</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/4817968674093307822/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/11/jit-debugging-and-interrupts.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/4817968674093307822'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/4817968674093307822'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/11/jit-debugging-and-interrupts.html' title='JIT, Debugging, and Interrupts'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-8549902941537105721</id><published>2012-08-18T20:45:00.001-07:00</published><updated>2012-08-18T20:45:38.047-07:00</updated><title type='text'>GSoC Report</title><content type='html'>I have just finished writing my Google Summer of Code&amp;nbsp;&lt;a href=&quot;https://sites.google.com/site/2bass2/report.pdf?attredirects=0&amp;amp;d=1&quot;&gt;final report&lt;/a&gt;.</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/8549902941537105721/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/08/gsoc-report.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/8549902941537105721'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/8549902941537105721'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/08/gsoc-report.html' title='GSoC Report'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-8095678324275035972</id><published>2012-08-07T10:46:00.003-07:00</published><updated>2012-08-07T10:46:38.681-07:00</updated><title type='text'>Multidimensional indexing and end</title><content type='html'>This week I added support for multidimensional matrix indexing and using end in jit. Support for the end keyword is interesting. Take the following for example,&lt;br /&gt;
&lt;pre&gt;y = A(1, sin (end));&lt;/pre&gt;
From that line alone, it is not clear what end&amp;nbsp;refers&amp;nbsp;to. If sin is a matrix, then end will be the end of sin. If sin is a function, then end will be the end of A.&lt;br /&gt;
&lt;br /&gt;
I have solved this problem by keeping track of the full context information for each end. Lets take a look at a slightly more complicated example&lt;br /&gt;
&lt;pre&gt;y = A(1, 2, B(sin (end), 5));&lt;/pre&gt;
then during type inference our context might look something like this&lt;br /&gt;
&lt;pre&gt;type     identifier index count
function sin        0     1
matrix   B          0     2
matrix   A          2     4&lt;/pre&gt;
In this context end then refers to the end of matrix B at index 0.</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/8095678324275035972/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/08/multidimensional-indexing-and-end.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/8095678324275035972'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/8095678324275035972'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/08/multidimensional-indexing-and-end.html' title='Multidimensional indexing and end'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-6932260795939724422</id><published>2012-07-27T10:49:00.000-07:00</published><updated>2012-07-27T10:49:30.472-07:00</updated><title type='text'>OctConf Reflection and Project Plan</title><content type='html'>I had a great time at OctConf 2012. There were a lot of interesting people there, and it is nice to be able to put faces to names. I&amp;nbsp;definitively&amp;nbsp;hope I will be able to make it next year.&lt;br /&gt;
&lt;br /&gt;
I recently realized that there are only two weeks left before the GSoC&amp;nbsp;suggested&amp;nbsp;pencils down date (8/13/2012). In the remaining time I plan on focusing my effort on better matrix support and supporting more builtin functions. I should be able to make&amp;nbsp;significant&amp;nbsp;progress on this in the remaining two weeks.&lt;br /&gt;
&lt;br /&gt;
After GSoC is done, I plan on working on compiling user functions and function handles. I think adding support for user functions in JIT is important, but I&#39;m not sure if I will be able to complete it in two weeks.</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/6932260795939724422/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/07/octconf-reflection-and-project-plan.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/6932260795939724422'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/6932260795939724422'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/07/octconf-reflection-and-project-plan.html' title='OctConf Reflection and Project Plan'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-8487359965612038072</id><published>2012-07-03T15:40:00.001-07:00</published><updated>2012-07-03T15:40:32.464-07:00</updated><title type='text'>Comparison of JIT with Oct files</title><content type='html'>In my last &lt;a href=&quot;http://jit-octave.blogspot.com/2012/06/realistic-test.html&quot;&gt;post&lt;/a&gt;&amp;nbsp;I tested the Octave JIT compiler on a problem presented on the&amp;nbsp;&lt;a href=&quot;https://mailman.cae.wisc.edu/pipermail/help-octave/2012-June/052642.html&quot;&gt;mailing list&lt;/a&gt;. I got a request for a comparison&amp;nbsp;with oct files. I think this is an interesting&amp;nbsp;comparison, because ideally the JIT compiler should reduce the need to rewrite Octave scripts as oct files.&lt;br /&gt;
&lt;br /&gt;
&lt;h4&gt;



















Oct file&lt;/h4&gt;
&lt;div&gt;
The oct file is mostly&amp;nbsp;equivalent&amp;nbsp;to the loopy version in my previous post.&lt;/div&gt;
&lt;pre&gt;#include &amp;lt;octave/oct.h&amp;gt;
#include &amp;lt;octave/parse.h&amp;gt;

DEFUN_DLD (oct_loopy, args, , &quot;TODO&quot;)
{
  feval (&quot;tic&quot;);

  octave_value ret;
  int nargin = args.length ();
  if (nargin != 2)
    print_usage ();
  else
    {
      NDArray data = args(0).array_value ();
      octave_idx_type nconsec;
      nconsec = static_cast&amp;lt;octave_idx_type&amp;gt; (args(1).double_value ());

      if (!error_state)
        {
          double *vec = data.fortran_vec ();
          octave_idx_type counter = 0;
          octave_idx_type n = data.nelem ();
          for (octave_idx_type i = 0; i &amp;lt; n; ++i)
            {
              if (vec[i])
                ++counter;
              else
                {
                  if (counter &amp;gt; 0 &amp;amp;&amp;amp; counter &amp;lt; nconsec)
                    std::fill (vec + i - counter, vec + i, 0);

                  counter = 0;
                }
            }

          if (counter &amp;gt; 0 &amp;amp;&amp;amp; counter &amp;lt; nconsec)
            std::fill (vec + n - counter, vec + n, 0);

          ret = octave_value (data);
        }
    }

  feval (&quot;toc&quot;);
  return ret;
}&lt;/pre&gt;
&lt;pre&gt;
&lt;/pre&gt;
&lt;h4&gt;











Results&lt;/h4&gt;
&lt;div&gt;
I ran each test five times, taking the lowest time. I have also&amp;nbsp;separated&amp;nbsp;out the compile/link time from the run time. For JIT, compile time was determined by running the function twice, and subtracting the first run time from the second run time. The compile time for the oct file was determined by timing mkoctfile. The initial parameters were a random vector, A, of size 1,000,000 and a K = 3.&lt;br /&gt;
&lt;br /&gt;
&lt;table border=&quot;2&quot; cellpadding=&quot;3&quot; cellspacing=&quot;3&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;th&gt;&lt;/th&gt;
&lt;th&gt;Compile time&lt;/th&gt;
&lt;th&gt;Run time&lt;/th&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;th&gt;JIT&lt;/th&gt;
&lt;td&gt;14ms&lt;/td&gt;
&lt;td&gt;21ms&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;th&gt;OCT&lt;/th&gt;
&lt;td&gt;2400ms&lt;/td&gt;
&lt;td&gt;3.3ms&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;
When using JIT, the compile time is part of the run time for the first execution of the loop. This means that for this example, JIT is currently about 10 times slower than the oct file. However, if we were to execute the function 50 times on 1,000,000 element vectors, then JIT would be 6 times slower.&lt;br /&gt;
&lt;br /&gt;
After looking at the assembly, it looks like JIT runs into issues with checks for matrix index validity and that loop variables are doubles (in loops like `for ii=1:5&#39; ii is a double). It should be possible to fix these issues in JIT, but it will result in a larger compile time.&lt;/div&gt;
</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/8487359965612038072/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/07/comparison-of-jit-with-oct-files.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/8487359965612038072'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/8487359965612038072'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/07/comparison-of-jit-with-oct-files.html' title='Comparison of JIT with Oct files'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-7812359514262339929</id><published>2012-06-28T22:30:00.002-07:00</published><updated>2012-06-29T15:52:02.738-07:00</updated><title type='text'>A Realistic Test</title><content type='html'>There was an interesting post on the mailing list (&lt;a href=&quot;https://mailman.cae.wisc.edu/pipermail/help-octave/2012-June/052642.html&quot;&gt;https://mailman.cae.wisc.edu/pipermail/help-octave/2012-June/052642.html&lt;/a&gt;). The problem is, given some logical array, A = [1 0 0 1 0 0 1 1 1 ...], and minimum length of consecutive ones, K, all sequences of ones less than K should be filtered out.&lt;br /&gt;
&lt;br /&gt;
The exciting part for me is that the JIT compiler is currently far enough along to compile the loopy solution.&lt;br /&gt;
&lt;br /&gt;
&lt;h4&gt;














Input generation&lt;/h4&gt;
I used a simple script to generate some random input data. A double matrix is used, because JIT does not yet work for logical&amp;nbsp;matrices.&lt;br /&gt;
&lt;pre&gt;function result = gen_test (n)
  result = double (rand (1, n) &amp;gt; .01);
endfunction&lt;/pre&gt;
&lt;h4&gt;













Vectorized&lt;/h4&gt;
Vectorized code (based off of code from the mailing list)&lt;br /&gt;
&lt;pre&gt;function z = vectorized (A, K)
  tic;
  temp = ones (1, K);
  z = conv (A, temp);
  z = z &amp;gt; K-1;
  z = conv (z, temp);
  z = z(K:end-K+1);
  z = z &amp;gt;= 1;
  toc;
endfunction&lt;/pre&gt;
&lt;h4&gt;










Loopy&lt;/h4&gt;
&lt;div&gt;
I didn&#39;t do anything fancy here.&lt;/div&gt;
&lt;pre&gt;function z = loopy (A, K)
  tic;&lt;/pre&gt;
&lt;pre&gt;  z = A;
  n = numel (A);
  counter = 0;
  for ii=1:n
    if z(ii)
      counter = counter + 1;
    else
      if counter &amp;gt; 0 &amp;amp;&amp;amp; counter &amp;lt; K
        z(ii-counter:ii-1) = 0;
      endif
      counter = 0;
    endif
  endfor

  if counter &amp;gt; 0 &amp;amp;&amp;amp; counter &amp;lt; K
    z(end-counter+1:end) = 0;
  endif
  toc;
endfunction
&lt;/pre&gt;
&lt;h4&gt;








Results&lt;/h4&gt;
These numbers were taken from a&amp;nbsp;AMD FX(tm)-4100 Quad-Core Processor with 8 GB of RAM. I just ran each test once. For each test, the number of elements in A was 1,000,000.&lt;br /&gt;
&lt;br /&gt;
&lt;table border=&quot;2&quot; cellpadding=&quot;3&quot; cellspacing=&quot;3&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;th&gt;&lt;/th&gt;
&lt;th&gt;Vectorized&lt;/th&gt;
&lt;th&gt;Loopy JIT&lt;/th&gt;
&lt;th&gt;Loopy JIT (no overhead)&lt;/th&gt;
&lt;th&gt;Loopy (No JIT)&lt;/th&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;th&gt;K=3&lt;/th&gt;
&lt;td&gt;0.078s&lt;/td&gt;
&lt;td&gt;0.059s&lt;/td&gt;
&lt;td&gt;0.028s&lt;/td&gt;
&lt;td&gt;5.27s&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;th&gt;K=100&lt;/th&gt;
&lt;td&gt;0.43s&lt;/td&gt;
&lt;td&gt;0.063s&lt;/td&gt;
&lt;td&gt;0.028s&lt;/td&gt;
&lt;td&gt;5.66s&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;&lt;th&gt;K=500&lt;/th&gt;
&lt;td&gt;1.58s&lt;/td&gt;
&lt;td&gt;0.082s&lt;/td&gt;
&lt;td&gt;0.033s&lt;/td&gt;
&lt;td&gt;5.73s&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;&lt;/table&gt;
&lt;br /&gt;
These results are expected. The efficiency of the vectorized approach depends on K, while the loopy version does not. While JIT support is not complete or stable yet&lt;sup&gt;1&lt;/sup&gt;, I think this shows that the current JIT implementation is able to handle a few practical examples, not just interpreter benchmarks.&lt;br /&gt;
&lt;br /&gt;
hg id for regular octave branch:&amp;nbsp;52cb71787cd1&lt;br /&gt;
hg id for jit branch:&amp;nbsp;f649b66ef1af&lt;br /&gt;
&lt;br /&gt;
&lt;span style=&quot;font-size: x-small;&quot;&gt;&lt;sup&gt;1&lt;/sup&gt;&amp;nbsp;Occasionally&amp;nbsp;I break functionality like assignment, addition, and function calls.&lt;/span&gt;</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/7812359514262339929/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/06/realistic-test.html#comment-form' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/7812359514262339929'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/7812359514262339929'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/06/realistic-test.html' title='A Realistic Test'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-2092577772795985531</id><published>2012-06-18T21:14:00.000-07:00</published><updated>2012-06-18T21:14:48.684-07:00</updated><title type='text'>Matrix Support</title><content type='html'>&lt;h3&gt;











The Octave Language&lt;/h3&gt;
An interesting feature of the Octave language is that matrices are treated as values, not references. For example, the following code&lt;br /&gt;
&lt;pre&gt;A = [1 2 3];
B = A;
A(1) = 100;
disp (B(1));&lt;/pre&gt;
will display the value 1, not 100. Matrices are also passed by value in function calls.&lt;br /&gt;
&lt;br /&gt;
&lt;h3&gt;











Interpreter Implementation&lt;/h3&gt;
Octave currently uses a really cool system to minimize the number of times&amp;nbsp;matrices. It combines Copy on Write (COW) with reference counting to reduce the number of copies.&lt;br /&gt;
&lt;br /&gt;
The idea is to delay copies until a matrix is mutated. Then the copy is only made if the reference count is greater than one. Take the previous example,&lt;br /&gt;
&lt;pre&gt;A = [1 2 3];
B = A; # No copy
A(1) = 100; # Copy, as both A and B refer to [1 2 3]&lt;/pre&gt;
This algorithm has two nice properties. First, it is simple. Second, copies are only made when required.&lt;br /&gt;
&lt;br /&gt;
&lt;h3&gt;










JIT&lt;/h3&gt;
I plan on using the same algorithm in JIT. This decision means that every assignment of the form&lt;br /&gt;
&lt;pre&gt;A(idx) = x; # Where idx and x are scalar&lt;/pre&gt;
requires a check to see if a copy is required. This actually is not such a big issue, because we already require a check to ensure the index is legal. In order to speed up the normal case, we can directly inline the condition check.&lt;br /&gt;
&lt;br /&gt;
Which leads us to the pseudo code for the inlined function.&lt;br /&gt;
&lt;pre&gt;if (isint (idx) &amp;amp;&amp;amp; idx &amp;gt;= 1 &amp;amp;&amp;amp; i &amp;lt; nelem (A) &amp;amp;&amp;amp; count (A) == 1)
  A(idx) = x;
else
  A = handle_odd_assign (A, idx, x);&lt;/pre&gt;
Hopefully, this will allow the normal case to be quick, but still provide a correct implementation for more complicated cases.</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/2092577772795985531/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/06/matrix-support.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/2092577772795985531'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/2092577772795985531'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/06/matrix-support.html' title='Matrix Support'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-1607068395467108979</id><published>2012-06-11T13:53:00.000-07:00</published><updated>2012-06-12T20:54:45.215-07:00</updated><title type='text'>Errors are annoying</title><content type='html'>&lt;i&gt;Edit&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;I realized that I need to qualify that this issue isn&#39;t a problem with Octave&#39;s implementation of error handling. Instead, the issue lies in the language, and the fact that almost every operation can cause an error.&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
In Octave correctly handling errors is annoying. For an example, let&#39;s look at a simplified implementation of binary operators in the interpreter.&lt;br /&gt;
&lt;pre&gt;octave_value retval;
octave_value lhs = // compute lhs
if (! error_state &amp;amp;&amp;amp; lhs.is_defined ())
{
  octave_value rhs = // compute rhs
  if (! error_state &amp;amp;&amp;amp; rhs.is_defined ())
    retval = ::do_binary_op (op_type, lhs, rhs);
}
return retval;
&lt;/pre&gt;
Notice that after every operand is computed, the error state must be checked. Take the follow statement&lt;br /&gt;
&lt;pre&gt;a = (a + b) / c;&lt;/pre&gt;
When converted into a linear SSA form we get&lt;br /&gt;
&lt;pre&gt;block0:
  temp0 = a0 + b0
  check_error block1, done
block1:
  temp2 = temp0 / c0
  check_error block2, done
block2:
  c1 = temp2
  goto done
done:
  c2 = phi (block0: c0, block1: c0, block2: c1)
  # yield a0, b0, and c2 as results&lt;/pre&gt;
Notice that the phi merge function requires an entry for every operation. Furthermore, each variable that is defined must be represented in the done block. At the worse case, each statement could define a new variable leading to O(n^2) space complexity. (where n is the statement count in Octave)&lt;br /&gt;
&lt;br /&gt;
However, in practice the number of variable should be low compared to the number of lines. As a test, I automatically generated a 500 line program consisting of scalar addition statements like&lt;br /&gt;
&lt;pre&gt;i1 = i0 + i0;&lt;/pre&gt;
It looks like the compile time stays within reasonable bounds.&lt;br /&gt;
&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;
&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgXyR_AK79eTcH_Og3tOB5Be_VjBKtsJnGyKIyN9yoAtt_LESOQkY7rVx62U8zmeGxSI1X_dYdjaIH_MXV5_OC1FqFa5EXsbzBUVePohdFN9n_d1bRo22yTAelFVSdhpzTnPayXev0f5A/s1600/foo.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; height=&quot;300&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgXyR_AK79eTcH_Og3tOB5Be_VjBKtsJnGyKIyN9yoAtt_LESOQkY7rVx62U8zmeGxSI1X_dYdjaIH_MXV5_OC1FqFa5EXsbzBUVePohdFN9n_d1bRo22yTAelFVSdhpzTnPayXev0f5A/s400/foo.png&quot; width=&quot;400&quot; /&gt;&lt;/a&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/1607068395467108979/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/06/errors-are-annoying.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/1607068395467108979'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/1607068395467108979'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/06/errors-are-annoying.html' title='Errors are annoying'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEgXyR_AK79eTcH_Og3tOB5Be_VjBKtsJnGyKIyN9yoAtt_LESOQkY7rVx62U8zmeGxSI1X_dYdjaIH_MXV5_OC1FqFa5EXsbzBUVePohdFN9n_d1bRo22yTAelFVSdhpzTnPayXev0f5A/s72-c/foo.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-2191817931151087531</id><published>2012-06-04T13:35:00.001-07:00</published><updated>2012-06-04T13:35:08.436-07:00</updated><title type='text'>Progress?</title><content type='html'>Progress last week was a little slower than I had hoped. Originally, I had planned on adding support for break and continue quickly, then moving onto other issues. However, I realized the hard way that my simple inline SSA construction algorithm was not very extensible. I have now implemented a textbook SSA construction algorithm.&lt;br /&gt;
&lt;br /&gt;
I also took another look at my modifications to configure.ac (with help from John W. Eaton and Jordi). The configure check was fixed and LLVM_CONFIG can be set to the location of llvm-config. This allows building with LLVM in a nonstandard path.&lt;br /&gt;
&lt;br /&gt;
Next, I think I will focus ensuring error and warning conditions are handled correctly (not talking about try/catch, just error and warning). In theory error checking is simple, for each operation that may error I need to check to see if it errored, then if it has exit the JIT function. There are a few annoying details. For example, I need to keep track of and return the values of each variable when an error&amp;nbsp;occurs. Additionally, the ability to change warnings into errors adds further complexity.</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/2191817931151087531/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/06/progress.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/2191817931151087531'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/2191817931151087531'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/06/progress.html' title='Progress?'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-924696910859305500</id><published>2012-05-28T09:23:00.000-07:00</published><updated>2012-05-28T09:23:01.721-07:00</updated><title type='text'>Type Inference</title><content type='html'>I just finished implementing a type inference algorithm based off of FALCON &amp;nbsp;[1]. I like this algorithm for several reasons&lt;br /&gt;
&lt;ol&gt;
&lt;li&gt;The efficiency is&amp;nbsp;&lt;i&gt;O(n)&lt;/i&gt;: This is important because we should minimize JIT overhead.&lt;/li&gt;
&lt;li&gt;Decouples type inference from code generation: This will make it easier if we decide to switch from LLVM to a GNU&amp;nbsp;equivalent.&lt;/li&gt;
&lt;li&gt;Type bounds are good: The algorithm is able to assign different types to the same variable when the variable is used in different contexts.&lt;/li&gt;
&lt;/ol&gt;
The algorithm works by introducing a type lattice (also similar to the MaJIC and McVM MATLAB JIT compilers[2][3]). For example, the simple lattice I am currently using&lt;br /&gt;
&lt;div class=&quot;separator&quot; style=&quot;clear: both; text-align: center;&quot;&gt;
&lt;a href=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh8LAaCcvKE-ZS7aKhMoSlgbshnb71PJJubme9LIqZNQxnucrBSTg6Crnqco4prbEtCJP_o1JGkwkN5K6QhdUHcu9hu8BntOGwB3N55ipOvLlvN3oSM0S5DW5A0VWQnePvA1NxSMIUA8g/s1600/lattice.png&quot; imageanchor=&quot;1&quot; style=&quot;margin-left: 1em; margin-right: 1em;&quot;&gt;&lt;img border=&quot;0&quot; height=&quot;200&quot; src=&quot;https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh8LAaCcvKE-ZS7aKhMoSlgbshnb71PJJubme9LIqZNQxnucrBSTg6Crnqco4prbEtCJP_o1JGkwkN5K6QhdUHcu9hu8BntOGwB3N55ipOvLlvN3oSM0S5DW5A0VWQnePvA1NxSMIUA8g/s320/lattice.png&quot; width=&quot;320&quot; /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div&gt;
Each variable in the&amp;nbsp;&lt;a href=&quot;http://en.wikipedia.org/wiki/Static_single_assignment_form&quot;&gt;SSA&lt;/a&gt;&amp;nbsp;is then assigned a type from our lattice. For example, in the code&lt;br /&gt;
&lt;pre&gt;b = 1;
for i=0:5
  temp = 5;
  b = b + temp;

  temp = [5 5 5];
  b = b + temp;
endfor
&lt;/pre&gt;
The algorithm is able to differentiate between the two&amp;nbsp;occurrences&amp;nbsp;of &lt;i&gt;temp&lt;/i&gt;. The algorithm will also realizes that &lt;i&gt;b&lt;/i&gt;&amp;nbsp;is a matrix inside of the loop (once&amp;nbsp;matrices&amp;nbsp;are added to type type lattice). Then before the start of the loop, &lt;i&gt;b&lt;/i&gt;, will be converted to a single element matrix. This means we can generate matrix addition operations inside of the loop.&lt;br /&gt;
&lt;br /&gt;
[1]&amp;nbsp;L. D. Rose and D. Padua. Techniques for the Translation of&amp;nbsp;MATLAB Programs into Fortran 90. ACM Transactions&amp;nbsp;on Programming Languages and Systems, 21(2):286–323,&amp;nbsp;Mar. 1999.
&lt;br /&gt;
[2]George Almási and David Padua. 2002. MaJIC: compiling MATLAB for speed and responsiveness. SIGPLAN Not. 37, 5 (May 2002), 294-303.&lt;br /&gt;
[3]&amp;nbsp;&lt;a href=&quot;http://www.sable.mcgill.ca/mclab/mcvm/mcvmthesis.pdf&quot; style=&quot;color: #830000; text-decoration: none;&quot;&gt;M.Sc. Thesis -&amp;nbsp;&lt;i&gt;McVM: an Optimizing Virtual Machine for the MATLAB Programming Language&lt;/i&gt;&lt;/a&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/924696910859305500/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/05/type-inference.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/924696910859305500'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/924696910859305500'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/05/type-inference.html' title='Type Inference'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEh8LAaCcvKE-ZS7aKhMoSlgbshnb71PJJubme9LIqZNQxnucrBSTg6Crnqco4prbEtCJP_o1JGkwkN5K6QhdUHcu9hu8BntOGwB3N55ipOvLlvN3oSM0S5DW5A0VWQnePvA1NxSMIUA8g/s72-c/lattice.png" height="72" width="72"/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7978475984487238672.post-7432283080547376384</id><published>2012-05-22T19:15:00.000-07:00</published><updated>2012-05-23T10:22:54.093-07:00</updated><category scheme="http://www.blogger.com/atom/ns#" term="GNU Octave"/><category scheme="http://www.blogger.com/atom/ns#" term="GSoC"/><category scheme="http://www.blogger.com/atom/ns#" term="GSoC 2012"/><category scheme="http://www.blogger.com/atom/ns#" term="JIT"/><title type='text'>Initial Work</title><content type='html'>Hi, I am Max Brister, a student working on JIT compilation in GNU Octave for google summer of code. My project proposal can be found&amp;nbsp;&lt;a href=&quot;https://google-melange.appspot.com/gsoc/project/google/gsoc2012/max_brister/23001&quot;&gt;on melange&lt;/a&gt;. In this blog I plan to explore implementation details, mark my progress, and present intermediary results.&lt;br /&gt;
&lt;br /&gt;
I have already implemented a simple proof of concept which shows some promise. The simple script&lt;br /&gt;
&lt;pre&gt;a = 1;
b = 1;&lt;/pre&gt;
&lt;pre&gt;tic;
for i=1:10000
  for j=1:10000
    a = a + b;
  endfor
endfor
toc;&lt;/pre&gt;
executes on my computer in&amp;nbsp;0.178s using my jit branch, and in&amp;nbsp;253.124s using Octave 3.6.1. A speedup of 1422 is not bad.&lt;br /&gt;
&lt;br /&gt;
There is still quite a ways to go before the code is ready for users though. I need to take another look at type inference, ensure error cases are handled correctly, and the current implementation requires the inner loop bounds to be constant for type inference.&lt;br /&gt;
&lt;br /&gt;
You can view my current progress by checking out my code from&lt;br /&gt;
&lt;pre&gt;hg clone http://inversethought.com/hg/octave-max&lt;/pre&gt;
The specific revision this post refers to is cba58541954c.&lt;br /&gt;
&lt;br /&gt;
*edit*&lt;br /&gt;
I realized that I should mention that a speedup of 1422x is really a best cast speedup. Code which is already vectorized will see a much smaller speedup (if any). This is because Octave already efficiently implements vectorization.</content><link rel='replies' type='application/atom+xml' href='https://jit-octave.blogspot.com/feeds/7432283080547376384/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='https://jit-octave.blogspot.com/2012/05/initial-work.html#comment-form' title='8 Comments'/><link rel='edit' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/7432283080547376384'/><link rel='self' type='application/atom+xml' href='https://www.blogger.com/feeds/7978475984487238672/posts/default/7432283080547376384'/><link rel='alternate' type='text/html' href='https://jit-octave.blogspot.com/2012/05/initial-work.html' title='Initial Work'/><author><name>Max Brister</name><uri>http://www.blogger.com/profile/01065466254954513693</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='https://img1.blogblog.com/img/b16-rounded.gif'/></author><thr:total>8</thr:total></entry></feed>