Storm系列(三):创建Maven项目打包提交wordcount到Storm集群

在上一篇博客中,我们通过Storm.Net.Adapter创建了一个使用Csharp编写的Storm Topology - wordcount。本文将介绍如何编写Java端的程序以及如何发布到测试的Storm环境中运行。

如果你觉得对你有帮助,欢迎Star和Fork,让更多人看到来帮助完善这个项目。

STEP1: 克隆storm官方示例项目 storm-starter

 $ git clone git://github.com/apache/storm.git && cd storm/examples/storm-starter

STEP2: 增加csharp的多语言支持:

将上一篇博客 使用Csharp创建你的第一个Storm拓扑 中完成的项目编译,把生产的组件拷贝到 /multilang/resources/ 文件夹中。

STEP3:使用JAVA创建Topology:

在 /src/jvm/storm/starter/ 新增 WordCountTopologyCsharp.java

/**
 * Licensed to the Apache Software Foundation (ASF) under one
 * or more contributor license agreements.  See the NOTICE file
 * distributed with this work for additional information
 * regarding copyright ownership.  The ASF licenses this file
 * to you under the Apache License, Version 2.0 (the
 * "License"); you may not use this file except in compliance
 * with the License.  You may obtain a copy of the License at
 *
 * http://www.apache.org/licenses/LICENSE-2.0
 *
 * Unless required by applicable law or agreed to in writing, software
 * distributed under the License is distributed on an "AS IS" BASIS,
 * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 * See the License for the specific language governing permissions and
 * limitations under the License.
 */
package storm.starter;

import backtype.storm.Config;
import backtype.storm.LocalCluster;
import backtype.storm.StormSubmitter;
import backtype.storm.spout.ShellSpout;
import backtype.storm.task.ShellBolt;
import backtype.storm.topology.IRichBolt;
import backtype.storm.topology.IRichSpout;
import backtype.storm.topology.OutputFieldsDeclarer;
import backtype.storm.topology.TopologyBuilder;
import backtype.storm.tuple.Fields;

import java.util.Map;

/**
 * This topology demonstrates Storm's stream groupings and multilang capabilities.
 */
public class WordCountTopologyCsharp {
    public static class Generator extends ShellSpout implements IRichSpout {

        public Generator() {
            super("cmd", "/k", "CALL", "StormSimple.exe", "generator");
            
        }

        @Override
        public void declareOutputFields(OutputFieldsDeclarer declarer) {
            declarer.declare(new Fields("word"));
        }

        @Override
        public Map<String, Object> getComponentConfiguration() {
            return null;
        }
    }    
    
    public static class Splitter extends ShellBolt implements IRichBolt {

        public Splitter() {
            super("cmd", "/k", "CALL", "StormSimple.exe", "splitter");
        }

        @Override
        public void declareOutputFields(OutputFieldsDeclarer declarer) {
            declarer.declare(new Fields("word", "count"));
        }

        @Override
        public Map<String, Object> getComponentConfiguration() {
            return null;
        }
    }
    
    public static class Counter extends ShellBolt implements IRichBolt {
        
        public Counter(){
            super("cmd", "/k", "CALL", "StormSimple.exe", "counter");
        }
        
        @Override
        public void declareOutputFields(OutputFieldsDeclarer declarer) {
            declarer.declare(new Fields("word", "count"));
        }

        @Override
        public Map<String, Object> getComponentConfiguration() {
            return null;
        }
    }
    

    public static void main(String[] args) throws Exception {

        TopologyBuilder builder = new TopologyBuilder();

        builder.setSpout("generator", new Generator(), 1);

        builder.setBolt("splitter", new Splitter(), 1).fieldsGrouping("generator",
                new Fields("word"));
        
        builder.setBolt("counter", new Counter(), 1).fieldsGrouping("splitter",
                new Fields("word", "count"));

        Config conf = new Config();
        conf.setDebug(true);

        if (args != null && args.length > 0) {
            conf.setNumWorkers(3);

            StormSubmitter.submitTopologyWithProgressBar(args[0], conf,
                    builder.createTopology());
        } else {
            conf.setMaxTaskParallelism(3);

            LocalCluster cluster = new LocalCluster();
            cluster.submitTopology("WordCount", conf, builder.createTopology());

            Thread.sleep(10000);

            cluster.shutdown();
        }
    }
}

本例是在window平台使用.Net执行,如果你使用Mono,或者在其它平台通过Mono运行,请将

super("cmd", "/k", "CALL", "StormSimple.exe", "xxxxxx");

替换为

super("mono", "StormSimple.exe", "xxxxxx");

STEP4:编译并提交Topology:

  • 初始化安装storm所需依赖:$ mvn clean install -DskipTests=true
  • 使用Maven打包storm拓扑:$ mvn package
  • 搭建好运行环境并提交:

$ storm jar storm-starter-*-jar-with-dependencies.jar storm.starter.WordCountTopologyCsharp wordcount

storm集群的搭建请参考系列文章第一篇 搭建dotNet开发Storm拓扑的环境

image

image

Storm系列文章

(一):搭建dotNet开发Storm拓扑的环境

(二):使用Csharp创建你的第一个Storm拓扑(wordcount)

(三):创建Maven项目打包提交wordcount到Storm集群

posted @   Carey Tzou  阅读(2908)  评论(1编辑  收藏  举报
编辑推荐:
· dotnet 源代码生成器分析器入门
· ASP.NET Core 模型验证消息的本地化新姿势
· 对象命名为何需要避免'-er'和'-or'后缀
· SQL Server如何跟踪自动统计信息更新?
· AI与.NET技术实操系列:使用Catalyst进行自然语言处理
阅读排行:
· 官方的 MCP C# SDK:csharp-sdk
· 一款 .NET 开源、功能强大的远程连接管理工具,支持 RDP、VNC、SSH 等多种主流协议!
· 提示词工程师自白:我如何用一个技巧解放自己的生产力
· 一文搞懂MCP协议与Function Call的区别
· 如何不购买域名在云服务器上搭建HTTPS服务
点击右上角即可分享
微信分享提示