使用JDBC执行impala SQL出现的问题

tangjunliang

浏览: 106904 次
性别:
来自: 北京

最近访客更多访客>>

lingmincc

bruce__ray

luojianbing

kaogua

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

impala

impala hive

   impala版本：1.1.1
   hive版本：0.10

   最近在使用JDBC执行impala sql的时候遇到一个问题，使用JDBC执行insert overwrite/into table...select...语句的时候，执行的结果显示是成功，但是查看表中的数据的时候，发现数据并没有插入到目标表中。通过查看http://impala-node-hostname:25000/queries 发现刚才执行的SQL的状态是Exception.说明确实执行失败。

   出现这种情况的原因是由于hive驱动的bug造成的，因为sessions在执行的时候，impala将取消正在运行的查询。现在hive 0.10以及 0.11的驱动都存在这样的问题，估计能在0.12的版本中解决这个问题。

   解决方案: 我们到/hive/src/jdbc/src/java/org/apache/hive/jdbc这个目录下找到HiveStatement.java这个文件，在这个类中有个execute方法，我们提交的SQL就是通过这个方法来执行的，它的代码如下：

  public boolean execute(String sql) throws SQLException {
    if (isClosed) {
      throw new SQLException("Can't execute after statement has been closed");
    }

    try {
      closeClientOperation();
      TExecuteStatementReq execReq = new TExecuteStatementReq(sessHandle, sql);
      execReq.setConfOverlay(sessConf);
      TExecuteStatementResp execResp = client.ExecuteStatement(execReq);
      if (execResp.getStatus().getStatusCode().equals(TStatusCode.STILL_EXECUTING_STATUS)) {
        warningChain = Utils.addWarning(warningChain, new SQLWarning("Query execuing asynchronously"));
      } else {
        Utils.verifySuccessWithInfo(execResp.getStatus());
      }
      stmtHandle = execResp.getOperationHandle();
    } catch (SQLException eS) {
      throw eS;
    } catch (Exception ex) {
      throw new SQLException(ex.toString(), "08S01", ex);
    }

    if (!stmtHandle.isHasResultSet()) {
      return false;
    }
    resultSet =  new HiveQueryResultSet.Builder().setClient(client).setSessionHandle(sessHandle)
        .setStmtHandle(stmtHandle).setMaxRows(maxRows).setFetchSize(fetchSize)
        .setScrollable(isScrollableResultset)
        .build();
    return true;
  }

修改上述代码的

    if (!stmtHandle.isHasResultSet()) {
      return false;
    }

部分，修改后的代码如下：

  public boolean execute(String sql) throws SQLException {
    if (isClosed) {
      throw new SQLException("Can't execute after statement has been closed");
    }

    try {
      closeClientOperation();
      TExecuteStatementReq execReq = new TExecuteStatementReq(sessHandle, sql);
      execReq.setConfOverlay(sessConf);
      TExecuteStatementResp execResp = client.ExecuteStatement(execReq);
      if (execResp.getStatus().getStatusCode().equals(TStatusCode.STILL_EXECUTING_STATUS)) {
        warningChain = Utils.addWarning(warningChain, new SQLWarning("Query execuing asynchronously"));
      } else {
        Utils.verifySuccessWithInfo(execResp.getStatus());
      }
      stmtHandle = execResp.getOperationHandle();
    } catch (SQLException eS) {
      throw eS;
    } catch (Exception ex) {
      throw new SQLException(ex.toString(), "08S01", ex);
    }

    if (!stmtHandle.isHasResultSet()) {
       // Poll until the query has completed one way or another. DML queries will not return a result
       // set, but we should not return from this method until the query has completed to avoid
       // racing with possible subsequent session shutdown, or queries that depend on the results
       // materialised here.
       TGetOperationStatusReq statusReq = new TGetOperationStatusReq(stmtHandle);
       boolean requestComplete = false;
       while (!requestComplete) {
       try {
       TGetOperationStatusResp statusResp = client.GetOperationStatus(statusReq);
       Utils.verifySuccessWithInfo(statusResp.getStatus());
       if (statusResp.isSetOperationState()) {
       switch (statusResp.getOperationState()) {
       case CLOSED_STATE:
       case FINISHED_STATE:
         return false;
       case CANCELED_STATE:
       // 01000 -> warning
       throw new SQLException("Query was cancelled", "01000");
       case ERROR_STATE:
       // HY000 -> general error
       throw new SQLException("Query failed", "HY000");
       case UKNOWN_STATE:
       throw new SQLException("Unknown query", "HY000");
       case INITIALIZED_STATE:
       case RUNNING_STATE:
        break;
         }
        }
       } catch (Exception ex) {
         throw new SQLException(ex.toString(), "08S01", ex);
       }
       try {
          Thread.sleep(100);
       } catch (InterruptedException ex) {
          // Ignore
         }
       }
      return false;
    }
    resultSet =  new HiveQueryResultSet.Builder().setClient(client).setSessionHandle(sessHandle)
        .setStmtHandle(stmtHandle).setMaxRows(maxRows).setFetchSize(fetchSize)
        .setScrollable(isScrollableResultset)
        .build();
    return true;
  }

通过上面的修改，会一直等待查询结束。
然后，我们使用ant把hive重新编译一遍，替换掉其中的驱动包。

期望hive能在0.12版本中解决这个问题。

分享到：

hadoop HA 备NN无法启动的问题 | Win7下Eclipse中文字体太小

2013-09-11 11:34
浏览 2195
评论(0)
分类:开源软件
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论