【甘道夫】MapReduce实现矩阵乘法--实现代码

之前写了一篇分析MapReduce实现矩阵乘法算法的文章：

【甘道夫】Mapreduce实现矩阵乘法的算法思路

为了让大家更直观的了解程序运行，今天编写了实现代码供大家參考。

编程环境：

java version "1.7.0_40"

Eclipse Kepler

Windows7 x64

Ubuntu 12.04 LTS

Hadoop2.2.0

Vmware 9.0.0 build-812388

输入数据：

A矩阵存放地址：hdfs://singlehadoop:8020/workspace/dataguru/hadoopdev/week09/matrixmultiply/matrixA/matrixa

A矩阵内容：
3 4 6
4 0 8

matrixa文件已处理为(x,y,value)格式：

0 0 3

0 1 4

0 2 6

1 0 4

1 1 0

1 2 8

B矩阵存放地址：hdfs://singlehadoop:8020/workspace/dataguru/hadoopdev/week09/matrixmultiply/matrixB/matrixb

B矩阵内容：
2 3
3 0
4 1

matrixb文件已处理为(x,y,value)格式：

0 0 2

0 1 3

1 0 3

1 1 0

2 0 4

2 1 1

实现代码：

一共三个类：

驱动类MMDriver
Map类MMMapper
Reduce类MMReducer

大家可依据个人习惯合并成一个类使用。

package dataguru.matrixmultiply;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.FileSystem;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class MMDriver {
	
	public static void main(String[] args) throws Exception {
		
		// set configuration
		Configuration conf = new Configuration();

		// create job
		Job job = new Job(conf,"MatrixMultiply");
		job.setJarByClass(dataguru.matrixmultiply.MMDriver.class);
		
        //  specify Mapper & Reducer
		job.setMapperClass(dataguru.matrixmultiply.MMMapper.class);
		job.setReducerClass(dataguru.matrixmultiply.MMReducer.class);
		
		// specify output types of mapper and reducer
		job.setOutputKeyClass(Text.class);
		job.setOutputValueClass(Text.class);
		job.setMapOutputKeyClass(Text.class);
		job.setMapOutputValueClass(Text.class);
		
		// specify input and output DIRECTORIES 
		Path inPathA = new Path("hdfs://singlehadoop:8020/workspace/dataguru/hadoopdev/week09/matrixmultiply/matrixA");
		Path inPathB = new Path("hdfs://singlehadoop:8020/workspace/dataguru/hadoopdev/week09/matrixmultiply/matrixB");
		Path outPath = new Path("hdfs://singlehadoop:8020/workspace/dataguru/hadoopdev/week09/matrixmultiply/matrixC");
		FileInputFormat.addInputPath(job, inPathA);
		FileInputFormat.addInputPath(job, inPathB);
        FileOutputFormat.setOutputPath(job,outPath);

		// delete output directory
		try{
			FileSystem hdfs = outPath.getFileSystem(conf);
			if(hdfs.exists(outPath))
				hdfs.delete(outPath);
			hdfs.close();
		} catch (Exception e){
			e.printStackTrace();
			return ;
		}
		
		//  run the job
		System.exit(job.waitForCompletion(true) ? 0 : 1);
	}
}

package dataguru.matrixmultiply;



import java.io.IOException;

import java.util.StringTokenizer;



import org.apache.hadoop.io.Text;

import org.apache.hadoop.mapreduce.Mapper;

import org.apache.hadoop.mapreduce.lib.input.FileSplit;



public class MMMapper extends Mapper

package dataguru.matrixmultiply;

import java.io.IOException;
import java.util.HashMap;
import java.util.Iterator;
import java.util.Map;
import java.util.StringTokenizer;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.Reducer.Context;

public class MMReducer extends Reducer {

	public void reduce(Text key, Iterable values, Context context)
			throws IOException, InterruptedException {

		Map matrixa = new HashMap();
		Map matrixb = new HashMap();
		
		for (Text val : values) {   //values example : b,0,2   or   a,0,4
			StringTokenizer str = new StringTokenizer(val.toString(),",");
			String sourceMatrix = str.nextToken();
			if ("a".equals(sourceMatrix)) {
				matrixa.put(str.nextToken(), str.nextToken());  //(0,4)
			}
			if ("b".equals(sourceMatrix)) {
				matrixb.put(str.nextToken(), str.nextToken());  //(0,2)
			}
		}
		
		int result = 0;
		Iterator iter = matrixa.keySet().iterator();
		while (iter.hasNext()) {
			String mapkey = iter.next();
			result += Integer.parseInt(matrixa.get(mapkey)) * Integer.parseInt(matrixb.get(mapkey));
		}

		context.write(key, new Text(String.valueOf(result)));
	}
}

终于输出结果：

0,0 42
0,1 15
1,0 40
1,1 20

posted on 2019-04-22 12:40 xfgnongmin 阅读(396) 评论(0) 编辑收藏举报

会员力量，点亮园子希望

刷新页面返回顶部

xfgnongmin

【甘道夫】MapReduce实现矩阵乘法--实现代码

导航

公告