postgres-xc-developers Mailing List for Postgres-XC (Page 6)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

I ran the same pgbench test after reconfiguring the max_prepared_transactions to 100 (equal to the max_connections configured on either coordinator) just to see how it fares. I have shared findings from the run below.

Node 1 - Coord1, Datanode1, gtm-proxy1

Node 2-  Coord2, Datanode2, gtm-proxy2
Node 3- Datanode3, gtm 

This time I definitely did see a spike in numbers (compared to my last run where max_prepared_transactions  was @10) but started seeing errors with 10 concurrent connections to Coord1. The errors seen were different though.

Test results

111005288

221009012
4410013998
6610017451
8810020450
101010022766 -> 3% bump compared to last run
121210025694  ->24% bump compared to last run 

10 clients:
Client 9 aborted in state 12: ERROR:  GTM error, could not obtain snapshot

12 clients:
Client 11 aborted in state 11: ERROR:  GTM error, could not obtain snapshot
Client 8 aborted in state 11: ERROR:  GTM error, could not obtain snapshot

14 clients:
The run was left hanging after a few GTM errors.

Question, these snapshot errors were seen on all 3 node's consoles. What could cause this error? Is the proxy a bottle neck now due to all load being applied on node 1 instead of being split b/w node 1 and  node 2 ? 

Some more info from the nodes below:

node1 - coordinator1

postgres=# select * from pg_stat_activity;
 datid | datname  | procpid | usesysid | usename  | application_name |  client_addr  | client_hostname | client_port |         backend_start         |          xact_start
  |          query_start          | waiting |          current_query
-------+----------+---------+----------+----------+------------------+---------------+-----------------+-------------+-------------------------------+-----------------------------
--+-------------------------------+---------+---------------------------------
 12804 | postgres |   10016 |       10 | postgres | psql             |               |                 |          -1 | 2012-06-18 11:17:28.781838-05 |
  | 2012-07-05 15:28:45.498451-05 | f       | <IDLE>
 12804 | postgres |   22951 |       10 | postgres | psql             |               |                 |          -1 | 2012-06-21 03:11:11.030994-05 | 2012-07-08 07:13:51.662961-0
5 | 2012-07-08 07:15:23.654176-05 | f       | select * from pg_stat_activity;
 12804 | postgres |   22472 |       10 | postgres |                  | <pgbench client> |                 |       57249 | 2012-06-21 02:52:29.436629-05 |
  | 2012-07-08 06:50:27.698791-05 | f       | <IDLE> in transaction (aborted)
 12804 | postgres |   22475 |       10 | postgres |                  | <pgbench client>  |                 |       57252 | 2012-06-21 02:52:29.44397-05  |
  | 2012-07-08 06:50:27.694543-05 | f       | <IDLE> in transaction (aborted)
(4 rows)

node2 - coordinator2 

postgres=# select * from pg_stat_activity;
 datid | datname  | procpid | usesysid | usename  | application_name |  client_addr  | client_hostname | client_port |         backend_start         |
         xact_start           |          query_start          | waiting |          current_query
-------+----------+---------+----------+----------+------------------+---------------+-----------------+-------------+-------------------------------+-
------------------------------+-------------------------------+---------+---------------------------------
 12804 | postgres |   14601 |       10 | postgres | pgxc             | node1 |                 |       45669 | 2012-07-08 03:17:13.724456-05 |
                              | 2012-07-08 06:24:21.055871-05 | f       | <IDLE>
 12804 | postgres |   17271 |       10 | postgres | psql             |               |                 |          -1 | 2012-07-08 07:08:28.768987-05 |
2012-07-08 07:14:04.887459-05 | 2012-07-08 07:15:18.278305-05 | f       | select * from pg_stat_activity;
(2 rows)

All 3 datanodes1/2/3  had 16 idle connections, with about 14 originating from coord1 and just 1 from coord2. This am guessing because all traffic originated from coord1. Is that right?

thanks,
Shankar

________________________________
 From: "pos...@li..." <pos...@li...>
To: pos...@li... 
Sent: Monday, July 9, 2012 11:33 AM
Subject: Postgres-xc-developers Digest, Vol 25, Issue 23

Send Postgres-xc-developers mailing list submissions to
    pos...@li...

To subscribe or unsubscribe via the World Wide Web, visit
    https://lists.sourceforge.net/lists/listinfo/postgres-xc-developers
or, via email, send a message with subject or body 'help' to
    pos...@li...

You can reach the person managing the list at
    pos...@li...

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Postgres-xc-developers digest..."

Today's Topics:

   1. Trigger support in XC (pramodh mereddy)
   2. Re: Question on gtm-proxy (Shankar Hariharan)

----------------------------------------------------------------------

Message: 1
Date: Mon, 9 Jul 2012 08:32:25 -0500
From: pramodh mereddy <pos...@gm...>
Subject: [Postgres-xc-developers] Trigger support in XC
To: pos...@li...
Message-ID:
    <CAK...@ma...>
Content-Type: text/plain; charset="iso-8859-1"

1) Are triggers fully supported in XC ?

2) Can I setup slony on datanodes to replicate certain tables to different
postgres cluster?

Pramodh Mereddy
-------------- next part --------------
An HTML attachment was scrubbed...

------------------------------

Message: 2
Date: Mon, 9 Jul 2012 09:11:53 -0700 (PDT)
From: Shankar Hariharan <har...@ya...>
Subject: Re: [Postgres-xc-developers] Question on gtm-proxy
To: Ashutosh Bapat <ash...@en...>
Cc: "pos...@li..."
    <pos...@li...>
Message-ID:
    <134...@we...>
Content-Type: text/plain; charset="iso-8859-1"

Thanks Ashutosh. You are right, while running this test i just had pgbench running against one coordinator. Looks like pgbench by itself may not be an apt tool for this kind of testing, I will instead run pgbench's underlying sql script ?from cmdline against either coordinators. ? Thanks for that tip.

I got a lot of input on my problem from a lot of folks on the list, the feedback is much appreciated. Thanks everybody!

On max_prepared_transactions, I will factor in the number of coordinators and the max_connections on each coordinator while arriving at a figure. ? Will also try out Koichi Suzuki's suggestion to have multiple NICs on the GTM. ?I will post my findings here for the same cluster configuration as before.
?
thanks,
Shankar

________________________________
From: Ashutosh Bapat <ash...@en...>
To: Shankar Hariharan <har...@ya...> 
Cc: "pos...@li..." <pos...@li...> 
Sent: Sunday, July 8, 2012 11:02 PM
Subject: Re: [Postgres-xc-developers] Question on gtm-proxy

Hi Shankar,
You have got answers to the prepared transaction problem, I guess. I have something else below.

On Sat, Jul 7, 2012 at 1:44 AM, Shankar Hariharan <har...@ya...> wrote:

As planned I?ran some tests using PGBench on this setup :
>
>
>Node 1 - Coord1, Datanode1, gtm-proxy1
>Node 2-?Coord2, Datanode2, gtm-proxy2
>Node 3-?Datanode3, gtm
>
>I was connecting via Coord1 for these tests:
>- scale factor of 30 used
>- tests run using the following input parameters for pgbench:

Try connecting to both the coordinators, it should give you better performance, esp, when you are using distributed tables. With distributed tables, coordinator gets involved in query execution more than that in the case of replicated tables. So, balancing load across two coordinators would help.
?

>
>ClientsThreadsDurationTransactions
>111006204
>221009960
>4410012880
>661001676
?
8
>8810019758
>101010021944
>121210020674
>
>
>The run went well until the 8 clients. I started seeing errors on 10 clients onwards and eventually the 14 client run has been hanging around for over an hour now. The errors I have been seeing on console are the following :
>
>
>pgbench console :
>Client 8 aborted in state 12: ERROR: ?GTM error, could not obtain snapshot
>
>Client 0 aborted in state 13: ERROR: ?maximum number of prepared transactions reached
>Client 7 aborted in state 13: ERROR: ?maximum number of prepared transactions reached
>Client 11 aborted in state 13: ERROR: ?maximum number of prepared transactions reached
>Client 9 aborted in state 13: ERROR: ?maximum number of prepared transactions reached
>
>
>node console:
>ERROR: ?GTM error, could not obtain snapshot
>STATEMENT: ?INSERT INTO pgbench_history (tid, bid, aid, delta, mtime) VALUES (253, 26, 1888413, -817, CURRENT_TIMESTAMP);
>ERROR: ?maximum number of prepared transactions reached
>HINT: ?Increase max_prepared_transactions (currently 10).
>STATEMENT: ?PREPARE TRANSACTION 'T201428'
>ERROR: ?maximum number of prepared transactions reached
>STATEMENT: ?END;
>ERROR: ?maximum number of prepared transactions reached
>STATEMENT: ?END;
>ERROR: ?maximum number of prepared transactions reached
>STATEMENT: ?END;
>ERROR: ?maximum number of prepared transactions reached
>STATEMENT: ?END;
>ERROR: ?GTM error, could not obtain snapshot
>STATEMENT: ?INSERT INTO pgbench_history (tid, bid, aid, delta, mtime) VALUES (140, 29, 2416403, -4192, CURRENT_TIMESTAMP);
>
>
>I was also watching the processes on each node and see the following for the 14 client run:
>
>
>
>
>Node1 :
>postgres 25571 10511 ?0 04:41 ? ? ? ? ?00:00:02 postgres: postgres postgres ::1(33481) TRUNCATE TABLE waiting
>postgres 25620 11694 ?0 04:46 ? ? ? ? ?00:00:00 postgres: postgres postgres pgbench-address (50388) TRUNCATE TABLE
>
>
>Node2:
>postgres 10979 ?9631 ?0 Jul05 ? ? ? ? ?00:00:42 postgres: postgres postgres coord1-address(57357) idle in transaction
>
>
>
>Node3:
>
>postgres 20264 ?9911 ?0 08:35 ? ? ? ? ?00:00:05 postgres: postgres postgres? coord1-address(51406) TRUNCATE TABLE waiting
>
>
>
>
>
>I was going to restart the processes on all nodes and start over but did not want to lose this data as it could be useful information.
>
>
>Any explanation on the above issue is much appreciated.?I will try the next run with a higher value set for??max_prepared_transactions. Any recommendations for a good value on this front?
>
>
>
>thanks,
>Shankar?
>
>
>
>
>
>________________________________
> From: Shankar Hariharan <har...@ya...>
>To: Ashutosh Bapat <ash...@en...> 
>Cc: "pos...@li..." <pos...@li...> 
>Sent: Friday, July 6, 2012 8:22 AM
>
>Subject: Re: [Postgres-xc-developers] Question on gtm-proxy
> 
>
>
>Hi Ashutosh,
>I was trying to size the load on a server and was wondering if ?a GTM could be shared w/o much performance overhead between a small number of datanodes and coordinators. I will post my findings here.
>thanks,
>Shankar
>
>
>
>________________________________
> From: Ashutosh Bapat <ash...@en...>
>To: Shankar Hariharan <har...@ya...> 
>Cc: "pos...@li..." <pos...@li...> 
>Sent: Friday, July 6, 2012 12:25 AM
>Subject: Re: [Postgres-xc-developers] Question on gtm-proxy
> 
>
>Hi Shankar,
>Running gtm-proxy has shown to improve the performance, because it lessens the load on GTM, by serving requests locally. Why do you want the coordinators to connect directly to the GTM? Are you seeing any performance improvement from doing that?
>
>
>On Fri, Jul 6, 2012 at 10:08 AM, Shankar Hariharan <har...@ya...> wrote:
>
>Follow up to earlier email. In the setup described below, can I avoid using a gtm-proxy? That is, can I just simply point coordinators to the one gtm running on node 3 ?
>>My initial plan was to just run the gtm on node 3 then I thought I could try a datanode without a local coordinator which was why I put these two together on node 3.
>>thanks,
>>Shankar
>>
>>
>>
>>________________________________
>> From: Shankar Hariharan <har...@ya...>
>>To: "pos...@li..." <pos...@li...> 
>>Sent: Thursday, July 5, 2012 11:35 PM
>>Subject: Question on multiple coordinators
>> 
>>
>>Hello,
>>
>>
>>Am trying out XC 1.0 in the following configuraiton.
>>Node 1 - Coord1, Datanode1, gtm-proxy1
>>Node 2-?Coord2, Datanode2, gtm-proxy2
>>Node 3-?Datanode3, gtm
>>
>>
>>I setup all nodes but forgot to add Coord1 to?Coord2 and vice versa. In addition I missed the pg_hba edit as well.?So the first table T1 that I created for distribution from?Coord1?was not "visible| from?Coord2 but was on all the data nodes.?
>>I tried to get Coord2 backinto business in various ways but the first table I created refused to show up on Coord2 :
>>- edit pg_hba and add node on both coord1 and 2. Then run?select pgxc_pool_reload();
>>- restart coord 1 and 2
>>- drop node c2 from c1 and c1 from c2 and add them back followed by?select pgxc_pool_reload();
>>
>>
>>So I tried to create the same table T1 from?Coord2 to observe behavior and it did not like it clearly as all nodes it "wrote" to reported that the table already existed which was good. At this point I could understand that Coord2 and Coord1 are not talking alright so I created a new table from coord1 with replication. This table was visible from both now.?
>>
>>
>>Question is should I expect to see the first table, let me call it T1 after a while from Coord2 also??
>>
>>
>>
>>
>>thanks,
>>Shankar
>>
>>
>>------------------------------------------------------------------------------
>>Live Security Virtual Conference
>>Exclusive live event will cover all the ways today's security and
>>threat landscape has changed and how IT managers can respond. Discussions
>>will include endpoint security, mobile security and the latest in malware
>>threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
>>_______________________________________________
>>Postgres-xc-developers mailing list
>>Pos...@li...
>>https://lists.sourceforge.net/lists/listinfo/postgres-xc-developers
>>
>>
>
>
>-- 
>Best Wishes,
>Ashutosh Bapat
>EntepriseDB Corporation
>The Enterprise Postgres Company
>
>
>
>
>
>

-- 
Best Wishes,
Ashutosh Bapat
EntepriseDB Corporation
The Enterprise Postgres Company
-------------- next part --------------
An HTML attachment was scrubbed...

------------------------------

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/

------------------------------

_______________________________________________
Postgres-xc-developers mailing list
Pos...@li...
https://lists.sourceforge.net/lists/listinfo/postgres-xc-developers

End of Postgres-xc-developers Digest, Vol 25, Issue 23
******************************************************

2010	Jan	Feb	Mar	Apr (10)	May (17)	Jun (3)	Jul	Aug	Sep (8)	Oct (18)	Nov (51)	Dec (74)
2011	Jan (47)	Feb (44)	Mar (44)	Apr (102)	May (35)	Jun (25)	Jul (56)	Aug (69)	Sep (32)	Oct (37)	Nov (31)	Dec (16)
2012	Jan (34)	Feb (127)	Mar (218)	Apr (252)	May (80)	Jun (137)	Jul (205)	Aug (159)	Sep (35)	Oct (50)	Nov (82)	Dec (52)
2013	Jan (107)	Feb (159)	Mar (118)	Apr (163)	May (151)	Jun (89)	Jul (106)	Aug (177)	Sep (49)	Oct (63)	Nov (46)	Dec (7)
2014	Jan (65)	Feb (128)	Mar (40)	Apr (11)	May (4)	Jun (8)	Jul (16)	Aug (11)	Sep (4)	Oct (1)	Nov (5)	Dec (16)
2015	Jan (5)	Feb	Mar (2)	Apr (5)	May (4)	Jun (12)	Jul	Aug	Sep	Oct	Nov	Dec (4)
2019	Jan	Feb	Mar	Apr	May	Jun	Jul (2)	Aug	Sep	Oct	Nov	Dec

S	M	T	W	T	F	S
1	2 (1)	3 (6)	4 (19)	5	6 (15)	7 (2)
8 (2)	9 (22)	10 (20)	11 (20)	12 (14)	13 (12)	14 (2)
15	16 (14)	17 (17)	18 (4)	19 (8)	20 (2)	21 (3)
22	23 (8)	24 (1)	25	26 (2)	27 (1)	28
29	30 (7)	31 (3)

postgres-xc-developers Mailing List for Postgres-XC (Page 6)

postgres-xc-developers — Postgres-XC hackers and developers