Speed of pg_streaming extension #1846

janko · 2022-03-03T20:17:48Z

janko
Mar 3, 2022

I was curious about measuring the memory usage & speed difference between classic cursor-based pagination and Postgres streaming. So, I created the following script:

require "sequel"
require "faker"
require "tty-progressbar"

DB = Sequel.postgres("janko")
# DB.extension :pg_streaming

unless DB.table_exists?(:accounts)
  DB.create_table! :accounts do
    primary_key :id
    String :status
    String :email
    String :password
  end

  values = Array.new(1000) do
    {
      status: %w[unverified verified closed].sample,
      email: Faker::Internet.email,
      password: Faker::Internet.password(min_length: 30),
    }
  end

  DB[:accounts].multi_insert(values * 1000, commit_every: 1000)
end

progress = TTY::ProgressBar.new("[:bar] :elapsed :percent (:eta)", total: DB[:accounts].count)
DB[:accounts].paged_each.each_slice(1000) do |accounts|
  progress.advance(accounts.size)
end

With cursors, this takes around 2 seconds. However, when I enable the pg_streaming extension, which changes #paged_each to use streaming, the ETA for completion was > 25 minutes.

Am I missing something? Maybe I'm misunderstanding Postgres streaming, thinking that it should be comparably fast to the cursor implementation? I'm using the latest Sequel (5.54.0), sequel_pg (1.14.0), and Postgres (14.2) versions (installed via Homebrew).

jeremyevans · 2022-03-03T20:27:55Z

jeremyevans
Mar 3, 2022
Maintainer

PostgreSQL streaming is designed to minimize memory usage (keep 1 row in memory at a time). It is sort of equivalent from a network protocol perspective for using a cursor with a fetch size of 1 (though you only need to use one statement total and not one statement per cursor fetch). So it doesn't surprise me that it is slower from a throughput perspective, though 10x slower is higher than I would have expected.

To clarify whether or not this is a Sequel issue, is it possible to do the similar testing using ruby-pg directly, and see if you get a similar result?

9 replies

jeremyevans Mar 3, 2022
Maintainer

Thanks for point that out. When I have time, I'll investigate that and see if I can use it directly or use a similar approach in sequel_pg.

jeremyevans Mar 4, 2022
Maintainer

stream_each cannot be used directly because among other things, it uses string keys and not symbol keys in the hashes. However, I can probably take a similar approach of using libpq directly instead of calling ruby-pg methods to speed up single row mode.

jeremyevans Mar 4, 2022
Maintainer

I'm doing some testing now. I can replicate the slowdown. With 400,000 records, I'm seeing about 1.69 seconds using the cursor approach, and 4.01 seconds using the streaming approach. This is closer to the performance difference I expected. I'm not sure why your streaming results are orders of magnitude slower. The example I'm using is simpler, though I'm not sure to what extent that matters:

t = Time.now
DB[:accounts].paged_each{}
p Time.now - t

I'll look into modifying sequel_pg to use the same streaming approach as stream_each to improve performance. If that isn't possible, I'll disable the default override of paged_each.

janko Mar 4, 2022
Author

Interesting, when I use plain paged_each (without the each_slice), I still get the same orders of magnitude slowdown. I'm wondering what's different in our systems. FWIW, this is what rbspy gets to say about it when I profile the running ruby process for couple of seconds:

janko Mar 4, 2022
Author

Or in the terminal form:

Time since start: 24s. Press Ctrl+C to stop.
Summary of profiling data so far:
% self  % total  name
 99.75    99.75  block [c function] - (unknown)
  0.12     0.12  sync_get_result [c function] - (unknown)
  0.08     0.08  block in each - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/dataset/actions.rb
  0.04   100.00  yield_each_row [c function] - (unknown)
  0.00   100.00  synchronize - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/database/connecting.rb
  0.00   100.00  paged_each - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel_pg-1.14.0/lib/sequel/extensions/pg_streaming.r
  0.00   100.00  hold - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/connection_pool/threaded.rb
  0.00   100.00  fetch_rows - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel_pg-1.14.0/lib/sequel/extensions/pg_streaming.r
  0.00   100.00  execute - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/dataset/actions.rb
  0.00   100.00  execute - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/adapters/postgres.rb
  0.00   100.00  each - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel_pg-1.14.0/lib/sequel_pg/sequel_pg.rb
  0.00   100.00  each - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/dataset/actions.rb
  0.00   100.00  check_database_errors - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/adapters/postgres
  0.00   100.00  block in fetch_rows - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel_pg-1.14.0/lib/sequel/extensions/pg_st
  0.00   100.00  block in execute - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/adapters/postgres.rb
  0.00   100.00  block (2 levels) in execute - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/adapters/po
  0.00   100.00  _execute - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel_pg-1.14.0/lib/sequel/extensions/pg_streaming.rb
  0.00   100.00  _execute - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/sequel-5.54.0/lib/sequel/adapters/postgres.rb
  0.00   100.00  <main> - bla.rb
  0.00    99.88  get_result - /Users/janko/.rbenv/versions/3.1.0/lib/ruby/gems/3.1.0/gems/pg-1.3.2/lib/pg/connection.rb

jeremyevans · 2022-03-04T18:55:10Z

jeremyevans
Mar 4, 2022
Maintainer

I think this is a performance regression in ruby-pg. With current sequel_pg and ruby-pg 1.2.3, it's only about 2x slower. With current sequel_pg and ruby-pg 1.3.0 and higher, it is orders of magnitude slower, about 300 records/second for me.

Can you see if you can replicate the results with pg 1.2.3 and pg 1.3.0 in your environment (using your ruby-pg test, without Sequel)? If you can replicate them, we should probably file a bug with ruby-pg. I checked that --disable-gvl-unlock didn't seem to effect performance any.

FWIW, with some small modifications to ruby-pg, I can get sequel_pg to use the ruby-pg streaming support. That results in about 4.72 seconds, still longer than the 4.01 seconds I was getting previously with older versions of ruby-pg, but not longer crazy long. It still looks like we should disable the use of streaming in paged_each by default.

1 reply

janko Mar 5, 2022
Author

I think you're right. I was able to replicate the performance regression between 1.2.3 and 1.3.0. On 1.2.3, the streaming & non-streaming iterations performed roughly the same to me, both were faster than the non-streaming iteration on 1.3.0, while the streaming iteration on 1.3.0 had the slowdown.

So, it might be worth filing a bug report with ruby-pg, as you suggested. Personally, as long as sequel_pg uses the optimized streaming that avoids the performance regression, I'm happy. Thanks for coming up with a fix 🙏🏻

jeremyevans · 2022-03-04T19:00:48Z

jeremyevans
Mar 4, 2022
Maintainer

Here's the diff to ruby-pg and sequel_pg if you want to play around with it:

ruby-pg:

diff --git a/ext/pg_result.c b/ext/pg_result.c
index 8306be1..c996be5 100644
--- a/ext/pg_result.c
+++ b/ext/pg_result.c
@@ -1383,7 +1383,7 @@ pgresult_type_map_get(VALUE self)


 static void
-yield_hash(VALUE self, int ntuples, int nfields)
+yield_hash(VALUE self, int ntuples, int nfields, void *data)
 {
        int tuple_num;
        t_pg_result *this = pgresult_get_this(self);
@@ -1397,7 +1397,7 @@ yield_hash(VALUE self, int ntuples, int nfields)
 }

 static void
-yield_array(VALUE self, int ntuples, int nfields)
+yield_array(VALUE self, int ntuples, int nfields, void *data)
 {
        int row;
        t_pg_result *this = pgresult_get_this(self);
@@ -1417,7 +1417,7 @@ yield_array(VALUE self, int ntuples, int nfields)
 }

 static void
-yield_tuple(VALUE self, int ntuples, int nfields)
+yield_tuple(VALUE self, int ntuples, int nfields, void *data)
 {
        int tuple_num;
        t_pg_result *this = pgresult_get_this(self);
@@ -1436,8 +1436,8 @@ yield_tuple(VALUE self, int ntuples, int nfields)
        }
 }

-static VALUE
-pgresult_stream_any(VALUE self, void (*yielder)(VALUE, int, int))
+VALUE
+pgresult_stream_any(VALUE self, void (*yielder)(VALUE, int, int, void*), void* data)
 {
        t_pg_result *this;
        int nfields;
@@ -1465,7 +1465,7 @@ pgresult_stream_any(VALUE self, void (*yielder)(VALUE, int, int))
                                pg_result_check( self );
                }

-               yielder( self, ntuples, nfields );
+               yielder( self, ntuples, nfields, data );

                pgresult = gvl_PQgetResult(pgconn);
                if( pgresult == NULL )
@@ -1516,7 +1516,7 @@ pgresult_stream_any(VALUE self, void (*yielder)(VALUE, int, int))
 static VALUE
 pgresult_stream_each(VALUE self)
 {
-       return pgresult_stream_any(self, yield_hash);
+       return pgresult_stream_any(self, yield_hash, NULL);
 }

 /*
@@ -1532,7 +1532,7 @@ pgresult_stream_each(VALUE self)
 static VALUE
 pgresult_stream_each_row(VALUE self)
 {
-       return pgresult_stream_any(self, yield_array);
+       return pgresult_stream_any(self, yield_array, NULL);
 }

 /*
@@ -1549,7 +1549,7 @@ pgresult_stream_each_tuple(VALUE self)
        /* allocate VALUEs that are shared between all streamed tuples */
        ensure_init_for_tuple(self);

-       return pgresult_stream_any(self, yield_tuple);
+       return pgresult_stream_any(self, yield_tuple, NULL);
 }

 /*

sequel_pg:

index d6c1503..41dfdd3 100644
--- a/ext/sequel_pg/sequel_pg.c
+++ b/ext/sequel_pg/sequel_pg.c
@@ -70,6 +70,7 @@
 PGconn* pg_get_pgconn(VALUE);
 PGresult* pgresult_get(VALUE);
 int pg_get_result_enc_idx(VALUE);
+VALUE pgresult_stream_any(VALUE self, void (*yielder)(VALUE, int, int, void*), void* data);

 static int spg_use_ipaddr_alloc;
 static int spg_use_pg_get_result_enc_idx;
@@ -1659,6 +1660,39 @@ static VALUE spg_set_single_row_mode(VALUE self) {
   return Qnil;
 }

+struct spg__yield_each_row_stream_data {
+  VALUE self;
+  VALUE *colsyms;
+  VALUE *colconvert;
+  VALUE pg_value;
+  PGresult *res;
+  int enc_index;
+  char type;
+};
+
+static void spg__yield_each_row_stream(VALUE rres, int ntuples, int nfields, void *rdata) {
+  struct spg__yield_each_row_stream_data* data = (struct spg__yield_each_row_stream_data *)rdata;
+  VALUE h = rb_hash_new();
+  VALUE self = data->self;
+  VALUE *colsyms = data->colsyms;
+  VALUE *colconvert= data->colconvert;
+  PGresult *res = data->res;
+  int enc_index = data->enc_index;
+  long j;
+
+  for(j=0; j<nfields; j++) {
+    rb_hash_aset(h, colsyms[j], spg__col_value(self, res, 0, j, colconvert , enc_index));
+  }
+
+  if(data->type == SPG_YIELD_MODEL) {
+    VALUE model = rb_obj_alloc(data->pg_value);
+    rb_ivar_set(model, spg_id_values, h);
+    rb_yield(model);
+  } else {
+    rb_yield(h);
+  }
+}
+
 static VALUE spg__yield_each_row_internal(VALUE self, VALUE rconn, VALUE rres, PGresult *res, int enc_index, VALUE *colsyms, VALUE *colconvert) {
   long nfields;
   long j;
@@ -1667,6 +1701,7 @@ static VALUE spg__yield_each_row_internal(VALUE self, VALUE rconn, VALUE rres, P
   VALUE pg_type;
   VALUE pg_value = Qnil;
   char type = SPG_YIELD_NORMAL;
+  struct spg__yield_each_row_stream_data data;

   nfields = PQnfields(res);

@@ -1684,6 +1719,17 @@ static VALUE spg__yield_each_row_internal(VALUE self, VALUE rconn, VALUE rres, P

   spg_set_column_info(self, res, colsyms, colconvert, enc_index);

+  data.self = self;
+  data.colsyms = colsyms;
+  data.colconvert = colconvert;
+  data.pg_value = pg_value;
+  data.res = res;
+  data.enc_index = enc_index;
+  data.type = type;
+
+  pgresult_stream_any(rres, spg__yield_each_row_stream, &data);
+  return self;
+
   while (PQntuples(res) != 0) {
     h = rb_hash_new();
     for(j=0; j<nfields; j++) {

1 reply

janko Mar 5, 2022
Author

Hmm, somehow I'm not able to figure out how to apply these diffs. I cloned both repositories, copied the diff into clipboard, ran pbpaste | git apply in the project root, but on both of them I get error: corrupt patch at line 77 (always the last line).

jeremyevans · 2022-03-04T19:15:32Z

jeremyevans
Mar 4, 2022
Maintainer

I updated sequel_pg to not use streaming by default for paged_each: jeremyevans/sequel_pg@d48a7ee

0 replies

janko · 2022-03-19T11:06:01Z

janko
Mar 19, 2022
Author

Thank you for helping ruby-pg patch the performance regression, and for finding a way how to utilize #stream_each 🤘🏻. I'm also glad that Active Record users can now utilize PG streaming as well 🙂

0 replies

Speed of pg_streaming extension #1846

Uh oh!

Uh oh!

janko Mar 3, 2022

Replies: 5 comments · 11 replies

Uh oh!

jeremyevans Mar 3, 2022 Maintainer

Uh oh!

jeremyevans Mar 3, 2022 Maintainer

Uh oh!

jeremyevans Mar 4, 2022 Maintainer

Uh oh!

jeremyevans Mar 4, 2022 Maintainer

Uh oh!

Uh oh!

janko Mar 4, 2022 Author

Uh oh!

janko Mar 4, 2022 Author

Uh oh!

jeremyevans Mar 4, 2022 Maintainer

Uh oh!

Uh oh!

janko Mar 5, 2022 Author

Uh oh!

jeremyevans Mar 4, 2022 Maintainer

Uh oh!

janko Mar 5, 2022 Author

Uh oh!

jeremyevans Mar 4, 2022 Maintainer

Uh oh!

janko Mar 19, 2022 Author

janko
Mar 3, 2022

Replies: 5 comments 11 replies

jeremyevans
Mar 3, 2022
Maintainer

jeremyevans Mar 3, 2022
Maintainer

jeremyevans Mar 4, 2022
Maintainer

jeremyevans Mar 4, 2022
Maintainer

janko Mar 4, 2022
Author

janko Mar 4, 2022
Author

jeremyevans
Mar 4, 2022
Maintainer

janko Mar 5, 2022
Author

jeremyevans
Mar 4, 2022
Maintainer

janko Mar 5, 2022
Author

jeremyevans
Mar 4, 2022
Maintainer

janko
Mar 19, 2022
Author