<div dir="ltr"><div><div>Hello Jan,<br><br></div>The Master and Slave DBs talk through a firewall.<br></div><div>VIP IPs and SNAT IPs are used in pg_hba.conf.<br><br></div><div>The corresponding messages in the postgres server log:<br>
</div><div><br><span style="font-size:10.5pt;font-family:"Calibri","sans-serif";color:black">2013-06-13 09:46:21.224
GMT,,,6630,"<a href="http://10.4.2.2:42031">10.4.2.2:42031</a>",51b994ed.19e6,1,"",2013-06-13
09:46:21 GMT,,0,LOG,08P01,"incomplete startup
packet",,,,,,,,,""</span>
<br>2013-06-13 09:57:38.596 GMT,"postgres","db01",6634,"<ip address printed here>:53924",51b994f7.19ea,1,"idle",2013-06-13 09:46:31 GMT,28/0,0,LOG,08006,"could not receive data from client: Connection reset by peer",,,,,,,,,"slon.node_1_listen"<br>
2013-06-13 09:57:38.596 GMT,"postgres","db01",6634,"<ip address printed here>:53924",51b994f7.19ea,2,"idle",2013-06-13 09:46:31 GMT,28/0,0,LOG,08P01,"unexpected EOF on client connection",,,,,,,,,"slon.node_1_listen"<br>
2013-06-13 09:57:38.607 GMT,"postgres","db01",6637,"<ip address printed here>:53926",51b994f9.19ed,1,"idle",2013-06-13 09:46:33 GMT,32/0,0,LOG,08006,"could not receive data from client: Connection reset by peer",,,,,,,,,"slon.subscriber_1_provider_1"<br>
2013-06-13 09:57:38.607 GMT,"postgres","db01",6637,"<ip address printed here>:53926",51b994f9.19ed,2,"idle",2013-06-13 09:46:33 GMT,32/0,0,LOG,08P01,"unexpected EOF on client connection",,,,,,,,,"slon.subscriber_1_provider_1"<br>
2013-06-13 09:57:38.608 GMT,"postgres","db01",6635,"<ip address printed here>:53925",51b994f7.19eb,1,"idle",2013-06-13 09:46:31 GMT,31/0,0,LOG,08006,"could not receive data from client: Connection reset by peer",,,,,,,,,"slon.node_1_listen"<br>
2013-06-13 09:57:38.608 GMT,"postgres","db01",6635,"<ip address printed here>:53925",51b994f7.19eb,2,"idle",2013-06-13 09:46:31 GMT,31/0,0,LOG,08P01,"unexpected EOF on client connection",,,,,,,,,"slon.node_1_listen"<br>
<br></div><div>The client slon log contains:<br>2013-06-13 09:57:38 GMT FATAL cleanupThread: "begin;lock table "_xx_cluster".sl_config_lock;select "_xx_cluster".cleanupEvent('10 minutes'::interval);commit;" - server closed the connection unexpectedly<br>
This probably means the server terminated abnormally <br> before or while processing the request.<br><br><br></div><div>Thanks,<br></div><div>Sridevi<br></div><div><br><br><br></div></div><div class="gmail_extra">
<br><br><div class="gmail_quote">On Thu, Jun 13, 2013 at 12:02 AM, Jan Wieck <span dir="ltr"><<a href="mailto:JanWieck@yahoo.com" target="_blank">JanWieck@yahoo.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div class="im">On 06/12/13 10:17, Sridevi R wrote:<br>
> Jan,<br>
><br>
> Thanks for the reply.<br>
><br>
> The only errors in the slon log are failure of cleanupThread.<br>
> child process is restarting right after the cleanupThread Failure.<br>
> This occurs approximately every 10 minutes since cleanup_interval is set<br>
> to 10 minutes.<br>
><br>
> Here is a sample from the log again:<br>
><br>
> 2013-06-06 14:23:27 GMT FATAL cleanupThread: "begin;lock table<br>
> "_xx_cluster".sl_config_lock;select "_xx_cluster".cleanupEvent('10<br>
> minutes'::interval);commit;" - server closed the connection unexpectedly<br>
> This probably means the server terminated abnormally<br>
> before or while processing the request.<br>
> 2013-06-06 14:23:27 GMT CONFIG slon: child terminated signal: 9; pid:<br>
> 16135, current worker pid: 16135<br>
> 2013-06-06 14:23:27 GMT CONFIG slon: restart of worker in 10 seconds<br>
<br>
</div>"server closed the connection unexpectedly" ...<br>
<br>
Is this connection by any chance through some firewall or NAT gateway<br>
that will drop idle connections?<br>
<br>
What are the corresponding postmaster server log entries? Since slony<br>
reports an unexpected connection drop from the server, the server must<br>
have some message in its log too, because the client never sent the 'X'<br>
libpq protocol message.<br>
<br>
<br>
Jan<br>
<div class="im"><br>
<br>
><br>
> Thanks ,<br>
> Sridevi<br>
><br>
><br>
> On Wed, Jun 12, 2013 at 7:33 PM, Jan Wieck <<a href="mailto:JanWieck@yahoo.com">JanWieck@yahoo.com</a><br>
</div><div><div class="h5">> <mailto:<a href="mailto:JanWieck@yahoo.com">JanWieck@yahoo.com</a>>> wrote:<br>
><br>
> On 06/12/13 07:14, Sridevi R wrote:<br>
> > Hello,<br>
> ><br>
> > The slony logs are consistently posting this error:<br>
> ><br>
> > 2013-06-12 10:01:05 GMT FATAL cleanupThread: "begin;lock table<br>
> > "_xx_cluster".sl_config_lock;select "_xx_cluster".cleanupEvent('10<br>
> > minutes'::interval);commit;" - server closed the connection<br>
> unexpectedly<br>
> > 2013-06-12 10:12:24 GMT FATAL cleanupThread: "begin;lock table<br>
> > "_xx_cluster".sl_config_lock;select "_xx_cluster".cleanupEvent('10<br>
> > minutes'::interval);commit;" - server closed the connection<br>
> unexpectedly<br>
> ><br>
> > checked and found that sl_confirm table is not cleaned up. cleanup<br>
> event<br>
> > never succeeds.<br>
> > Additionally, the child processes terminates and restarts after each<br>
> > such cleanup failure.<br>
> ><br>
> > 2013-06-11 11:20:04 GMT CONFIG slon: child terminated signal: 9; pid:<br>
> > 20172, current worker pid: 20172<br>
> > 2013-06-11 11:20:04 GMT CONFIG slon: restart of worker in 10 seconds<br>
> ><br>
> > When cleanup is run manually, on the psql prompt it runs to completion<br>
> > without any issues and cleans up sl_event and sl_confirm tables<br>
> > "begin;lock table "_xx_cluster".sl_config_lock;select<br>
> > "_xx_cluster".cleanupEvent('10 minutes'::interval);commit;"<br>
> ><br>
> > Soln version: 2.1.2<br>
> ><br>
> > Any help/insight would be greatly appreciated.<br>
><br>
> Slon kills its worker(s) with signal 9 (SIGKILL) when it needs to<br>
> restart, like when there are errors in event processing or if it<br>
> receives certain signals. Are there any other errors in the slon log or<br>
> is something on the machine sending signals to slon?<br>
><br>
><br>
> Jan<br>
><br>
> ><br>
> > Thanks,<br>
> > Sridevi<br>
> ><br>
> ><br>
> ><br>
> > _______________________________________________<br>
> > Slony1-general mailing list<br>
> > <a href="mailto:Slony1-general@lists.slony.info">Slony1-general@lists.slony.info</a><br>
</div></div>> <mailto:<a href="mailto:Slony1-general@lists.slony.info">Slony1-general@lists.slony.info</a>><br>
<div class="HOEnZb"><div class="h5">> > <a href="http://lists.slony.info/mailman/listinfo/slony1-general" target="_blank">http://lists.slony.info/mailman/listinfo/slony1-general</a><br>
> ><br>
><br>
><br>
> --<br>
> Anyone who trades liberty for security deserves neither<br>
> liberty nor security. -- Benjamin Franklin<br>
><br>
><br>
<br>
<br>
--<br>
Anyone who trades liberty for security deserves neither<br>
liberty nor security. -- Benjamin Franklin<br>
</div></div></blockquote></div><br></div>