Storm系列(三):创建Maven项目打包提交wordcount到Storm集群
在上一篇博客中,我们通过Storm.Net.Adapter创建了一个使用Csharp编写的Storm Topology - wordcount。本文将介绍如何编写Java端的程序以及如何发布到测试的Storm环境中运行。
如果你觉得对你有帮助,欢迎Star和Fork,让更多人看到来帮助完善这个项目。
STEP1: 克隆storm官方示例项目 storm-starter:
$ git clone git://github.com/apache/storm.git && cd storm/examples/storm-starter
STEP2: 增加csharp的多语言支持:
将上一篇博客 使用Csharp创建你的第一个Storm拓扑 中完成的项目编译,把生产的组件拷贝到 /multilang/resources/
文件夹中。
STEP3:使用JAVA创建Topology:
在 /src/jvm/storm/starter/
新增 WordCountTopologyCsharp.java
/**
* Licensed to the Apache Software Foundation (ASF) under one
* or more contributor license agreements. See the NOTICE file
* distributed with this work for additional information
* regarding copyright ownership. The ASF licenses this file
* to you under the Apache License, Version 2.0 (the
* "License"); you may not use this file except in compliance
* with the License. You may obtain a copy of the License at
*
* http://www.apache.org/licenses/LICENSE-2.0
*
* Unless required by applicable law or agreed to in writing, software
* distributed under the License is distributed on an "AS IS" BASIS,
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
* See the License for the specific language governing permissions and
* limitations under the License.
*/
package storm.starter;
import backtype.storm.Config;
import backtype.storm.LocalCluster;
import backtype.storm.StormSubmitter;
import backtype.storm.spout.ShellSpout;
import backtype.storm.task.ShellBolt;
import backtype.storm.topology.IRichBolt;
import backtype.storm.topology.IRichSpout;
import backtype.storm.topology.OutputFieldsDeclarer;
import backtype.storm.topology.TopologyBuilder;
import backtype.storm.tuple.Fields;
import java.util.Map;
/**
* This topology demonstrates Storm's stream groupings and multilang capabilities.
*/
public class WordCountTopologyCsharp {
public static class Generator extends ShellSpout implements IRichSpout {
public Generator() {
super("cmd", "/k", "CALL", "StormSimple.exe", "generator");
}
@Override
public void declareOutputFields(OutputFieldsDeclarer declarer) {
declarer.declare(new Fields("word"));
}
@Override
public Map<String, Object> getComponentConfiguration() {
return null;
}
}
public static class Splitter extends ShellBolt implements IRichBolt {
public Splitter() {
super("cmd", "/k", "CALL", "StormSimple.exe", "splitter");
}
@Override
public void declareOutputFields(OutputFieldsDeclarer declarer) {
declarer.declare(new Fields("word", "count"));
}
@Override
public Map<String, Object> getComponentConfiguration() {
return null;
}
}
public static class Counter extends ShellBolt implements IRichBolt {
public Counter(){
super("cmd", "/k", "CALL", "StormSimple.exe", "counter");
}
@Override
public void declareOutputFields(OutputFieldsDeclarer declarer) {
declarer.declare(new Fields("word", "count"));
}
@Override
public Map<String, Object> getComponentConfiguration() {
return null;
}
}
public static void main(String[] args) throws Exception {
TopologyBuilder builder = new TopologyBuilder();
builder.setSpout("generator", new Generator(), 1);
builder.setBolt("splitter", new Splitter(), 1).fieldsGrouping("generator",
new Fields("word"));
builder.setBolt("counter", new Counter(), 1).fieldsGrouping("splitter",
new Fields("word", "count"));
Config conf = new Config();
conf.setDebug(true);
if (args != null && args.length > 0) {
conf.setNumWorkers(3);
StormSubmitter.submitTopologyWithProgressBar(args[0], conf,
builder.createTopology());
} else {
conf.setMaxTaskParallelism(3);
LocalCluster cluster = new LocalCluster();
cluster.submitTopology("WordCount", conf, builder.createTopology());
Thread.sleep(10000);
cluster.shutdown();
}
}
}
本例是在window平台使用.Net执行,如果你使用Mono,或者在其它平台通过Mono运行,请将
super("cmd", "/k", "CALL", "StormSimple.exe", "xxxxxx");
替换为
super("mono", "StormSimple.exe", "xxxxxx");
STEP4:编译并提交Topology:
- 初始化安装storm所需依赖:
$ mvn clean install -DskipTests=true
- 使用Maven打包storm拓扑:
$ mvn package
- 搭建好运行环境并提交:
$ storm jar storm-starter-*-jar-with-dependencies.jar storm.starter.WordCountTopologyCsharp wordcount
storm集群的搭建请参考系列文章第一篇 搭建dotNet开发Storm拓扑的环境
【推荐】还在用 ECharts 开发大屏?试试这款永久免费的开源 BI 工具!
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· dotnet 源代码生成器分析器入门
· ASP.NET Core 模型验证消息的本地化新姿势
· 对象命名为何需要避免'-er'和'-or'后缀
· SQL Server如何跟踪自动统计信息更新?
· AI与.NET技术实操系列:使用Catalyst进行自然语言处理
· 官方的 MCP C# SDK:csharp-sdk
· 一款 .NET 开源、功能强大的远程连接管理工具,支持 RDP、VNC、SSH 等多种主流协议!
· 提示词工程师自白:我如何用一个技巧解放自己的生产力
· 一文搞懂MCP协议与Function Call的区别
· 如何不购买域名在云服务器上搭建HTTPS服务