| Author |
Message |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 23 Jun 2008 01:43:54
|
Venkat_B
Member
Joined: 16 Jun 2008 08:39:02
Messages: 45
Offline
|
Hi,
when executing the following query in GridSQL i'm getting below error, can anyone suggest what went wrong
GridSQL -> select count(*) from TDR_MSC_060807;
SQLException: ERROR: Failed To Get Results For ( SQL , NodeURL) : ( SELECT COUNT(*) AS XCOL1 FROM TDR_MSC_060807 ) eQS Node 3 has aborted execution, cause is: com.edb.gridsql.exception.XDBServerException : Can not send data to Nodes
Thankx
Venkat
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 23 Jun 2008 08:31:33
|
Mason_S
Senior member
Joined: 1 Apr 2008 09:03:08
Messages: 380
Offline
|
Hi Venkat,
This is usually a permissions error, where one node cannot access another. You may have to change the pg_hba.conf file on each node and configuring .pgpass; please see the Admin Guide. You can test this out by logging in on one node and trying to access the other directly and see if you get prompted for a password. If you do get prompted, then the permissions are not set up properly.
Regards,
Mason
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 23 Mar 2009 20:42:02
|
Vlad_O
Member
Joined: 22 Mar 2009 00:19:06
Messages: 10
Offline
|
Mason_S wrote:Hi Venkat,
This is usually a permissions error, where one node cannot access another. You may have to change the pg_hba.conf file on each node and configuring .pgpass; please see the Admin Guide. You can test this out by logging in on one node and trying to access the other directly and see if you get prompted for a password. If you do get prompted, then the permissions are not set up properly.
Regards,
Mason
Hi Mason,
I am having the same problem. I have all nodes configured on the same box. As a matter of fact, it looks like all queries execute fine, except for those that contain aggregate functions: those all fail with the above error.
Thanks,
Vlad.
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 23 Mar 2009 21:16:38
|
Vlad_O
Member
Joined: 22 Mar 2009 00:19:06
Messages: 10
Offline
|
Vlad_O wrote:Hi Mason,
I am having the same problem. I have all nodes configured on the same box. As a matter of fact, it looks like all queries execute fine, except for those that contain aggregate functions: those all fail with the above error.
Thanks,
Vlad.
Sorry for the complaint, I figured it out Just FYI, perhaps it will help somebody else: the underlying database's log file contained "FATAL: sorry, too many clients already" error. So the problem was that the max_connections parameter in the postgresql.conf was not high enough. I ignored your recommendations regarding this parameter in the Admin Guide, sorry.
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 3 Apr 2009 07:13:55
|
Venkat_B
Member
Joined: 16 Jun 2008 08:39:02
Messages: 45
Offline
|
Hi Mason,
I'm still getting the same problem. I'm able to connect to the node database from all other locations and servers.
May be permissions is one issue but coming to my scenario i think that's not problem with permissions.
And also there is no problem from SQL end. CREATE table AS is one situation where i'm getting this problem.
Thanks,
Venkat.
This message was edited 1 time. Last update was at 3 Apr 2009 07:15:17
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 3 Apr 2009 09:36:08
|
Mason_S
Senior member
Joined: 1 Apr 2008 09:03:08
Messages: 380
Offline
|
Hi Venkat,
If you get the error everytime for queries like SELECT COUNT(*)..., then it is almost definitely a permissions problem.
If it just happens from time to time for really large queries, then it could be a resource issue.
Does it happen everytime for SELECT COUNT(*)? If so, try changing pg_hba.conf to allow connections from everywhere, and restarting Postgres/Advanced Server. You can always tighten up permissions later.
Thanks,
Mason
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 6 Apr 2009 10:55:32
|
Venkat_B
Member
Joined: 16 Jun 2008 08:39:02
Messages: 45
Offline
|
Hi Mason,
I'm not getting the error everytime.
Most of the time i see the error with Create Table AS statement. As i have 20 to 30k records in my test tables, while fetching the data from those tables also i'm getting the same error. I'm trying to trace the problem but it's like once in a while. But still i'll just come with clear scenario where i'm facing this problem for maximum number of times.
Thanks,
Venkat.
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 6 Apr 2009 15:42:54
|
Mason_S
Senior member
Joined: 1 Apr 2008 09:03:08
Messages: 380
Offline
|
Hi Venkat,
Which version are you using? Are you using the latest from CVS?
Thanks,
Mason
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 7 Apr 2009 09:48:04
|
Venkat_B
Member
Joined: 16 Jun 2008 08:39:02
Messages: 45
Offline
|
Hi Mason,
Yes we are using the latest ones.
(PostgresPlus 8.3.3 GridSQL 1.0)
Thanks,
Venkat
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 7 Apr 2009 17:03:56
|
Mason_S
Senior member
Joined: 1 Apr 2008 09:03:08
Messages: 380
Offline
|
Hi Venkat,
I think there was a subtle timing issue or two that were fixed in 1.1. Please try the latest code from CVS and see if it helps.
Thanks,
Mason
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 9 Apr 2009 09:56:56
|
Venkat_B
Member
Joined: 16 Jun 2008 08:39:02
Messages: 45
Offline
|
Hi Mason,
We are using the very recent taken from CVS.
And here is one case where we got the same error:
GridSQL -> create table csdtn_test (col1 varchar(100),col2 varchar(20),col3 date,col4 NUMERIC(17,2),col5 integer,col6 VARCHAR(5)) partitioning key col1 on all tablespace tbs_grid;
OK
Response time: 0m 0s 164ms Total time: 0m 0s 164ms
GridSQL -> copy csdtn_test from '/home/postgress/test.csv' delimiter ',' CSV;
OK
Response time: 0m 0s 429ms Total time: 0m 0s 429ms
GridSQL -> select count(*) from csdtn_test;
+----------+
| count(*) |
+----------+
| 18000 |
+----------+
1 row(s).
Response time: 0m 0s 133ms Total time: 0m 0s 133ms
GridSQL -> create table test_csdtn partitioning key col1 on all tablespace tbs_chennai_grid as select col1,case when ltrim(rtrim(col6))='4' then 'ABC' else 'DEF' end ST,sum(case when col4=0 then 0 else col5 end) from csdtn_test where col2='NW' group by col1,case when ltrim(rtrim(col6))='4' then 'ABC' else 'DEF' end;
SQLException: ERROR: Failed To Get Results For ( SQL , NodeURL) : ( SELECT .....................
And in console log the error is:
eQS Node 6 has aborted execution, cause is: com.edb.gridsql.exception.XDBServerException : Can not send data to Nodes
at com.edb.gridsql.queryproc.QueryProcessor.executeQueryStep(Unknown Source)
at com.edb.gridsql.queryproc.QueryProcessor.executeQueryExecPlan(Unknown Source)
at com.edb.gridsql.queryproc.QueryProcessor.execute(Unknown Source)
at com.edb.gridsql.parser.SqlCreateTable.execute(Unknown Source)
at com.edb.gridsql.engine.ExecutableRequest.execute(Unknown Source)
at com.edb.gridsql.engine.XDBSessionContext.executeRequest(Unknown Source)
at com.edb.gridsql.engine.ServerStatement.execute(Unknown Source)
at com.edb.gridsql.engine.ServerStatement.describe(Unknown Source)
at com.edb.gridsql.engine.XDBSessionContext.describeStatement(Unknown Source)
at com.edb.gridsql.protocol.PgProtocolSession.run(Unknown Source)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:650)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:675)
at java.lang.Thread.run(Thread.java:595)
Caused by: com.edb.gridsql.exception.XDBMessageMonitorException: Node 6 has aborted execution, cause is: com.edb.gridsql.exception.XDBServerException : Can not send data to Nodes
at com.edb.gridsql.engine.MessageMonitor.checkMessages(Unknown Source)
at com.edb.gridsql.engine.MultinodeExecutor.executeStep(Unknown Source)
... 13 more
Caused by: com.edb.gridsql.exception.XDBWrappedException: com.edb.gridsql.exception.XDBServerException : Can not send data to Nodes
at com.edb.gridsql.engine.NodeProducerThread.processStep(Unknown Source)
at com.edb.gridsql.engine.NodeProducerThread.run(Unknown Source)
... 1 more
2009-04-09 18:57:38,763 - ERROR Throwing throwable:
com.edb.gridsql.exception.XDBServerException: Can not send data to Nodes
at com.edb.gridsql.engine.ProducerSender.sendToNodes(Unknown Source)
at com.edb.gridsql.engine.NodeProducerThread.processStep(Unknown Source)
at com.edb.gridsql.engine.NodeProducerThread.run(Unknown Source)
at java.lang.Thread.run(Thread.java:595)
Caused by: java.io.IOException: testdbN3@127.0.0.1: loader failed to load data: com.edb.util.PSQLException: SERRORCXX000Mcould not open relation with OID 11004649Fheapam.cL869Rrelation_open
at com.edb.gridsql.engine.loader.EDBWriter.stopEDBCopyThread(Unknown Source)
at com.edb.gridsql.engine.loader.EDBWriter.finish(Unknown Source)
at com.edb.gridsql.engine.loader.DataProcessorThread.finishLoad(Unknown Source)
at com.edb.gridsql.engine.loader.Loader.finishLoad(Unknown Source)
... 4 more
Thanks,
Venkat.
This message was edited 1 time. Last update was at 9 Apr 2009 09:58:51
|
|
|
 |
![[Post New]](/templates/default/images/icon_minipost_new.gif) 9 Apr 2009 15:53:17
|
Mason_S
Senior member
Joined: 1 Apr 2008 09:03:08
Messages: 380
Offline
|
Hi Venkat,
We are having problems reproducing this problem.
Please see this thread:
http://forums.enterprisedb.com/posts/list/615.page
That is consistent with the error you are reporting, especially when you have multiple logical nodes sharing the same database server instance.
For your error, it only happens occasionally?
What would be interesting to do would be to run a different postmaster on a different port for each logical node, to ensure that there are no schema change concurrency issues, and see if the problem disappears.
Mason
|
|
|
 |
|
|