[Slony1-general] Feature Idea: improve performance when a large sl_log backlog exists

Thu Nov 25 08:45:21 PST 2010

On 11/23/2010 11:53 AM, Christopher Browne wrote:
> Steve Singer<ssinger at ca.afilias.info>  writes:
>>  On 10-11-23 09:48 AM, Vick Khera wrote:
>>>  On Tue, Nov 23, 2010 at 9:31 AM, Steve Singer<ssinger at ca.afilias.info>   wrote:
>>>>  Slony can get into a state where it can't keep up/catch up with
>>>>  replication because the sl_log table is so large.
>>>>
>>>>
>>>>  Does this problem bite people often enough in the real world for us to
>>>>  devote effort to fixing?
>>>>
>>>
>>>  It used to happen to me a lot when I had my origin running on spinning
>>>  media.  Ever since I moved to an SSD, it doesn't really happen.  At
>>>  worst when I do a large delete I fall behind by a few minutes but it
>>>  catches up quickly.  For me, it didn't even require taking the DB down
>>>  for any extended period.. just running a large update or delete that
>>>  touched many many rows (ie, generated a lot of events in sl_log) could
>>>  send the system into a tailspin that would take hours or possibly days
>>>  (until we hit a weekend) to recover.
>>>
>>>  I am not sure it was caused by the log being too big... because
>>>  sometimes reindexing the tables on the replica would clear up the
>>>  backlog quickly too.  But I may be sniffing down the wrong trail.
>>>
>>
>>  The other place this will hit busy systems is during the initial sync.
>>  If your database is very large (or very busy) a lot of log rows can
>>  accumulate while that initial sync is going on.   OMIT_COPY doesn't help
>>  you because it requires an outage to get the master and slave in sync
>>  (just the loading time on a 1TB database is a while).
>>
>>  CLONE PREPARE/FINISH also aren't of help because a) these only work if
>>  you already have at least 1 subscriber setup and b) after you do the
>>  clone prepare any later transactions still need to be kept in sl_log
>>  until the new slave is up and running.
>
> I'm not sure that we gain much by splitting the logs into a bunch of
> pieces for that case.
>
> It's still the same huge backlog, and until it gets worked down, it's
> bloated, period.

Agreed.

>
> A different suggestion, that doesn't involve any changes to Slony...
>
> Initially, it might be a good idea to set up the new subscriber with
> FORWARD=no...

I don't think it is actually the new subscriber, that is having a 
problem, but rather the fact that the origin and all forwarders keep the 
log and event data until every subscriber has confirmed it.

Let me ignore the problem of the initial COPY and catch up of the first 
replica for now.

This fully redundant log and event keeping until everyone has confirmed 
everything is done so that any subscriber can be reconfigured to use 
another data provider at any point in time, willy nilly. It also keeps 
things simple for failover. But it does hurt in the cases described, 
like taking a leaf node down for a few days.

The cure for this would be to add configuration information so that 
certain nodes will be allowed to "assume" confirmation from another node 
and go ahead with their cleanup. This certainly complicates the 
configuration, but I "assume" power users are fine with that.

Jan

-- 
Anyone who trades liberty for security deserves neither
liberty nor security. -- Benjamin Franklin