MapReduce:通过数据具有爷孙关系的结果-快上网网站建设公司

MapReduce:通过数据具有爷孙关系的结果

1)启动环境

成都创新互联专注于孟村企业网站建设,成都响应式网站建设公司,商城网站建设。孟村网站建设公司,为孟村等地区提供建站服务。全流程按需网站建设，专业设计，全程项目跟踪，成都创新互联专业和态度为您提供的服务

start-all.sh

2)产看状态

jps

0613 NameNode

10733 DataNode

3455 NodeManager

15423 Jps

11082 ResourceManager

10913 SecondaryNameNode

3)利用Eclipse编写jar

1.编写 MapCal类

package com.mp;

import java.io.IOException;

import org.apache.hadoop.io.LongWritable;

import org.apache.hadoop.io.Text;

import org.apache.hadoop.mapreduce.Mapper;

public class MapCal extends Mapper {

@Override

protected void map(LongWritable lon, Text value, Context context)

throws IOException, InterruptedException {

String line = value.toString();

String[] peps = line.split("-");

// 键值对

context.write(new Text(peps[0]), new Text("s" + peps[1]));

context.write(new Text(peps[1]), new Text("g" + peps[0]));

}

2.编写ReduceCal类

public class ReduceCal extends Reducer {

@Override

protected void reduce(Text arg0, Iterable arg1, Context context)

throws IOException, InterruptedException {

ArrayList grands = new ArrayList();

ArrayList sons = new ArrayList();

// 把这些值写入集合

for (Text text : arg1) {

String str = text.toString();

if (str.startsWith("g")) {

grands.add(text);

} else {

sons.add(text);

}

// 输出

for (int i = 0; i < sons.size(); i++) {

for (int j = 0; j < grands.size(); j++) {

context.write(grands.get(i), sons.get(j));

}

3. 编写Jobrun类

public class RunJob {

// 全限定名

public static void main(String[] args) {

Configuration conf = new Configuration();

// 本地多线程模拟执行。

// conf.set("fs.defaultFS", "hdfs://node3:8020");

// conf.set("mapred.jar", "C:\\Users\\Administrator\\Desktop\\wc.jar");

try {

FileSystem fs = FileSystem.get(conf);

Job job = Job.getInstance(conf);

job.setJobName("wc");

job.setJarByClass(RunJob.class);

job.setMapperClass(WordCountMapper.class);

job.setReducerClass(WordCountReduce.class);

job.setMapOutputKeyClass(Text.class);

job.setMapOutputValueClass(IntWritable.class);

// job 输入数据和输出数据的目录

FileInputFormat.addInputPath(job, new Path("/word.txt"));

Path outPath = new Path("/output/wc2");// job执行结果存放的目录。该目录在执行前不能存在。

if (fs.exists(outPath)) {

fs.delete(outPath, true);

}

FileOutputFormat.setOutputPath(job, outPath);

boolean f = job.waitForCompletion(true);

if (f) {

System.out.println("任务执行成功！");

}

} catch (Exception e) {

e.printStackTrace();

}

4)导出jar包.

MapReduce :通过数据具有爷孙关系的结果

5)通过ftp上传jar到linux目录

6)运行jar包

hadoop jar shuju.jar com.mc.RunJob / /outg

7)如果map和reduce都100%

Shuffle Errors

BAD_ID=0

CONNECTION=0

IO_ERROR=0

WRONG_LENGTH=0

WRONG_MAP=0

WRONG_REDUCE=0

File Input Format Counters

Bytes Read=45

File Output Format Counters

Bytes Written=18

表示运行成功!!

8)产看结果

hadoop fs -tail /outg/part-r-00000

文章名称：MapReduce:通过数据具有爷孙关系的结果
转载来源：http://www.cdkjz.cn/article/pphohj.html

多年建站经验

多一份参考，总有益处

联系快上网，免费获得专属《策划方案》及报价

咨询相关问题或预约面谈，可以通过以下方式与我们联系

网站建设

网站推广

案例

方案

电商网站开发

微信小程序

我们

联系

精准传达 • 有效沟通

查看其它板块

MapReduce:通过数据具有爷孙关系的结果

多一份参考，总有益处

联系快上网，免费获得专属《策划方案》及报价

大客户专线成都：13518219792 座机：028-86922220

友情链接交换友情链接

网络推广

Network promotion

网站方案

Solution

电商网站开发

E-commerce & System

我们

About Us

联系

Contact Us

精准传达 • 有效沟通

查看其它板块

MapReduce:通过数据具有爷孙关系的结果

相关资讯

织梦发布文章内页不显示图片

网站策划内容有哪些，网站策划书包括哪些内容

云南百度建站官方,百度建站云南服务中心

dns域名系统认识,域名:dns是互联网一项服务的缩写

限制域名,域名被运营商限制

mac网络dns怎么设置,网络DNs设置指的?是什么

二次开发是什么意思，UG的二次开发是什么意思

用ps如何快速制作网页,快速制作网页的软件

多一份参考，总有益处

联系快上网，免费获得专属《策划方案》及报价

大客户专线 成都：13518219792 座机：028-86922220

友情链接 交换友情链接

大客户专线成都：13518219792 座机：028-86922220

友情链接交换友情链接