Flume常见问题
问题一:
ERROR [PollableSourceRunner-KafkaSource-r2] (org.apache.flume.source.kafka.KafkaSource.doProcess:314) - KafkaSource EXCEPTION, {}
org.apache.kafka.clients.consumer.CommitFailedException: Commit cannot be completed due to group rebalance
这个错误提示比较直白,意思是消费者消费了数据,但在规定时间内没有commit,所以kafka认为这个consumer挂掉了,这时对consumer的group进行再平衡。
增加消费超时时间。
消费超时时间通过heartbeat.interval.ms设置,heartbeat.interval.ms的大小不能超过session.timeout.ms。
session.timeout.ms必须在[group.min.session.timeout.ms, group.max.session.timeout.ms]范围内
问题二:
ERROR [PollableSourceRunner-KafkaSource-r2] (org.apache.flume.source.kafka.KafkaSource.doProcess:314) - KafkaSource EXCEPTION, {}
org.apache.flume.ChannelFullException: The channel has reached it's capacity. This might be the result of a sink on the channel having too low of batch size, a downstream system running slower than normal, or t
hat the channel capacity is just too low. [channel=c3]
问题三:
ERROR [hdfs-k3-call-runner-1] (org.apache.flume.sink.hdfs.AbstractHDFSWriter.hflushOrSync:268) - Error while trying to hflushOrSync!
[SinkRunner-PollingRunner-DefaultSinkProcessor] (org.apache.flume.sink.hdfs.HDFSEventSink.process:443) - HDFS IO error
java.io.IOException: Callable timed out after 10000 ms on file: hdfs://mycluster/test_flume/FlumeData.1597876537632.tmp
问题四:
在file channel堆积数据过多,磁盘爆满,channel中的data数据积压过多
注意相关参数的设置(最好倍数关系):
batchsize <=transactionCapacity<=capacity
问题五:
oom问题解决:
在flume/bin/flume_ng文件中的 JAVA_OPTS="-Xmx20m"
改配置文件flume/conf/flume-env.sh export JAVA_OPTS="-Xms100m -Xmx2000m -Dcom.sun.management.jmxremote" 参数
问题六:
10 十月 2022 16:08:22,668 WARN [hdfs- group - event -call-runner-11] (org.apache.hadoop.hdfs.DFSUtil.getAddressesForNameserviceId:688) - Namenode for babeltimecluster remains unresolved for ID nn1. Check your hdfs-site.xml file to ensure namenodes are configured properly. 10 十月 2022 16:08:22,669 WARN [hdfs- group - event -call-runner-11] (org.apache.hadoop.hdfs.DFSUtil.getAddressesForNameserviceId:688) - Namenode for babeltimecluster remains unresolved for ID nn2. Check your hdfs-site.xml file to ensure namenodes are configured properly. 10 十月 2022 16:08:22,669 ERROR [SinkRunner-PollingRunner-DefaultSinkProcessor] (org.apache.flume.sink.hdfs.HDFSEventSink.process:447) - process failed java.lang.IllegalArgumentException: java.net.UnknownHostException: yz-tpl-hadoop-xxx-xx |
解决方式:把hadoop对应的hdfs-site.xml和core-site.xml 文件copy到 $flume/conf目录下;
对应jar包copy到lib下:
posted on 2021-02-25 22:30 RICH-ATONE 阅读(1581) 评论(5) 编辑 收藏 举报
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· AI与.NET技术实操系列:向量存储与相似性搜索在 .NET 中的实现
· 基于Microsoft.Extensions.AI核心库实现RAG应用
· Linux系列:如何用heaptrack跟踪.NET程序的非托管内存泄露
· 开发者必知的日志记录最佳实践
· SQL Server 2025 AI相关能力初探
· winform 绘制太阳,地球,月球 运作规律
· 震惊!C++程序真的从main开始吗?99%的程序员都答错了
· AI与.NET技术实操系列(五):向量存储与相似性搜索在 .NET 中的实现
· 【硬核科普】Trae如何「偷看」你的代码?零基础破解AI编程运行原理
· 超详细:普通电脑也行Windows部署deepseek R1训练数据并当服务器共享给他人