mapreduce编程模型之HDFS数据到HBASE表数据

ganliang13

浏览: 249518 次
性别:
来自: 北京

最近访客更多访客>>

lzb

sosohotsummer

祥云朵朵

fui

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

hadoop

mapreduce hdfs输入 hbase输出编程模型

package com.bfd.util;

import java.io.IOException;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.hbase.HBaseConfiguration;
import org.apache.hadoop.hbase.KeyValue;
import org.apache.hadoop.hbase.client.HTable;
import org.apache.hadoop.hbase.io.ImmutableBytesWritable;
import org.apache.hadoop.hbase.mapreduce.HFileOutputFormat;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.lib.input.TextInputFormat;

public class CopyOfGidAddCartTemp {
	public static final String TABLE_NAME = "_AddCart_TEMP"; 
	public static final String COLUMN_FAMILY = "ci";
	private static Configuration conf = null;

	static {
		conf = HBaseConfiguration.create();
		conf.set("hbase.zookeeper.quorum", Const.ZOOKEEPER_QUORAM);
		conf.set("zookeeper.znode.parent", Const.ZOOKEEPER_ZNODE_PARENT);

	}

	static class Mapper
			extends
			org.apache.hadoop.mapreduce.Mapper<LongWritable, Text, ImmutableBytesWritable, LongWritable> {
		private ImmutableBytesWritable outKey = new ImmutableBytesWritable();
		private LongWritable outValue = new LongWritable();

		@Override
		protected void map(
				LongWritable key,
				Text value,
				org.apache.hadoop.mapreduce.Mapper<LongWritable, Text, ImmutableBytesWritable, LongWritable>.Context context)
				throws IOException, InterruptedException {
					context.write(new ImmutableBytesWritable(), new LongWritable());
		}

	}

	static class Reducer
			extends
			org.apache.hadoop.mapreduce.Reducer<ImmutableBytesWritable, LongWritable, ImmutableBytesWritable, KeyValue> {

		public void reduce(ImmutableBytesWritable key,
				Iterable<LongWritable> values, Context context)
				throws IOException, InterruptedException {
			context.write(key,new KeyValue());
		}

	}

	public static void main(String[] args) throws IOException,
			InterruptedException, ClassNotFoundException {

		Configuration conf = new Configuration();
		Job job = new Job(conf, "_AddCart_TEMP");

		job.setJarByClass(CopyOfGidAddCartTemp.class);

		job.setMapOutputKeyClass(ImmutableBytesWritable.class);
		job.setMapOutputValueClass(LongWritable.class);

		job.setOutputKeyClass(ImmutableBytesWritable.class);
		job.setOutputValueClass(KeyValue.class);

		job.setMapperClass(com.bfd.util.CopyOfGidAddCartTemp.Mapper.class);
		job.setReducerClass(com.bfd.util.CopyOfGidAddCartTemp.Reducer.class);

		job.setInputFormatClass(TextInputFormat.class);
		job.setOutputFormatClass(HFileOutputFormat.class);

		job.setNumReduceTasks(4);
		/* 本地执行 */
		// ((JobConf) job.getConfiguration()).setJar(jarFile.toString());

		TextInputFormat.setInputPaths(job, Const.HDFS_BASE_INPUT + "/l_date="
				+ args[0] + "/*");
		HFileOutputFormat.setOutputPath(job, new Path(Const.HDFS_BASE_OUTPUT
				+ "/addcart"));

		Configuration HBASE_CONFIG = new Configuration();
		HBASE_CONFIG.set("hbase.zookeeper.quorum", Const.ZOOKEEPER_QUORAM);
		HBASE_CONFIG.set("zookeeper.znode.parent", Const.ZOOKEEPER_ZNODE_PARENT);
		HBASE_CONFIG.set("date2", args[0]);
		Configuration cfg = HBaseConfiguration.create(HBASE_CONFIG);
		HTable htable = new HTable(cfg, TABLE_NAME);
		HFileOutputFormat.configureIncrementalLoad(job, htable);
		System.exit(job.waitForCompletion(true) ? 0 : 1);
	}

}

分享到：

mapreduce编程模型之hbase表作为数据源输 ... | hbase关于对表操作API实现

2013-12-18 15:06
浏览 1244
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

mapreduce编程模型之HDFS数据到HBASE表数据

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

mapreduce编程模型之HDFS数据到HBASE表数据

评论

发表评论

相关推荐

mapreduce编程模型之hbase输入hdfs多路输出

mapreduce编程模型之hbase表作为数据源输入输出

mapreduce 求最大值最小值问题

基于hadoop的多个reduce 输出

mapreduce编程模型之mysql 输入数据至hbase表数据

Eclipse本地机提交hadoop程序至集群

hadoop 集群Eclipse设置

java api 操作hdfs文件

zookeeper-3.4.5安装

Hadoop Shell命令

hadoop 本地文件复制到hdfs目录

hadoop下的examples运行

hadoop hdfs.DFSClient: DataStreamer Exception

hive 全面学习

hive表创建，删除，导入数据，删除数据

hadoop1.0.1单机安装

hive数据存储格式

最近访客更多访客>>