hadoop学习——arrayWritable的应用 -

yehao0716

浏览: 22149 次
性别:
来自: 武汉

最近访客更多访客>>

日出斯图加特

luanblue

18585703436

fmq2008

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

hadoop学习——arrayWritable的应用

博客分类：

hadoop

hadoop mapreduce arraywritable

package kpi;

import java.io.IOException;
import java.net.URI;
import java.net.URISyntaxException;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.FileSystem;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.ArrayWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.NullWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.io.Writable;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class ArrayWritableTest {
	public static void main(String[] args) throws IOException,
			ClassNotFoundException, InterruptedException, URISyntaxException {
		Configuration conf = new Configuration();
		FileSystem fileSystem = FileSystem.get(new URI("hdfs://hadoop:9000/"),
				conf);
		fileSystem.delete(new Path("/kpi__data_out_1"), true);

		Job job = new Job(conf, KpiWritable1.class.getName());
		job.setJarByClass(KpiWritable1.class);

		FileInputFormat.setInputPaths(job, new Path(
				"hdfs://hadoop:9000/kpi_data"));
		FileOutputFormat.setOutputPath(job, new Path(
				"hdfs://hadoop:9000/kpi__data_out_1"));

		job.setMapperClass(MyMapper.class);
		job.setReducerClass(MyReducer.class);

		job.setMapOutputKeyClass(Text.class);
		job.setMapOutputValueClass(LongArrayWritable.class);
		job.setOutputKeyClass(Text.class);
		job.setOutputValueClass(NullWritable.class);

		job.waitForCompletion(true);
	}

	static class MyMapper extends
			Mapper<LongWritable, Text, Text, LongArrayWritable> {
		Text key2 = new Text();

		@Override
		protected void map(
				LongWritable key,
				Text value,
				org.apache.hadoop.mapreduce.Mapper<LongWritable, Text, Text, LongArrayWritable>.Context context)
				throws IOException, InterruptedException {
			String[] split = value.toString().split("\t");
			key2.set(split[1]);
			String[] traffic = new String[4];
			traffic[0] = split[6];
			traffic[1] = split[7];
			traffic[2] = split[8];
			traffic[3] = split[9];

			LongArrayWritable arrayWritable = new LongArrayWritable(traffic);
			context.write(key2, arrayWritable);
		}
	}

	static class MyReducer extends
			Reducer<Text, LongArrayWritable, Text, NullWritable> {
		private Text key3 = new Text();

		@Override
		protected void reduce(
				Text key2,
				Iterable<LongArrayWritable> val2s,
				org.apache.hadoop.mapreduce.Reducer<Text, LongArrayWritable, Text, NullWritable>.Context context)
				throws IOException, InterruptedException {
			long sum1 = 0;
			long sum2 = 0;
			long sum3 = 0;
			long sum4 = 0;
			for (LongArrayWritable traffic : val2s) {
				Writable[] writables = traffic.get();
				sum1 += Long.valueOf(writables[0].toString());
				sum2 += Long.valueOf(writables[1].toString());
				sum3 += Long.valueOf(writables[2].toString());
				sum4 += Long.valueOf(writables[3].toString());
			}
			key3.set(key2 + " " + sum1 + " " + sum2 + " " + sum3 + " " + sum4);
			context.write(key3, NullWritable.get());
		}
	}

	static class LongArrayWritable extends ArrayWritable {

		public LongArrayWritable() {
			super(LongWritable.class);
		}

		public LongArrayWritable(String[] string) {
			super(LongWritable.class);
			LongWritable[] longs = new LongWritable[string.length];
			for (int i = 0; i < longs.length; i++) {
				longs[i] = new LongWritable(Long.valueOf(string[i]));
			}
			set(longs);
		}
	}
}

分享到：

使用MapReduce对数据文件进行切分 | hadoop2.5.1集群搭建：（四）配置historyS ...

2015-01-22 16:38
浏览 2441
评论(0)
分类:开源软件
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

hadoop学习——arrayWritable的应用

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

hadoop学习——arrayWritable的应用

评论

发表评论

相关推荐

使用MapReduce对数据文件进行切分

hadoop2.5.1集群搭建：（四）配置historyServer

hadoop2.5.1集群搭建：（三）搭建yarn集群

hadoop2.5.1集群搭建：（二）搭建自动切换HA的HDFS集群

hadoop2.5.1集群搭建：（一）搭建手工切换ha的hdfs集群

CentOS编译hadoop2.5.1源码

hbase安装问题处理

hbase简介

去除hadoop启动时的警告信息

hadoop学习笔记-prc通信原理

hadoop的RPC通信

hadoop学习笔记-java操作hdfs

最近访客更多访客>>