Flink反序列化失败metric

kafka作为source会用KafkaFetcher去拉取数据核心代码：

@Override
    public void runFetchLoop() throws Exception {
        try {
            // kick off the actual Kafka consumer
            consumerThread.start();

            while (running) {
                // this blocks until we get the next records
                // it automatically re-throws exceptions encountered in the consumer thread
                final ConsumerRecords<byte[], byte[]> records = handover.pollNext();

                // get the records for each topic partition
                for (KafkaTopicPartitionState<T, TopicPartition> partition : subscribedPartitionStates()) {

                    List<ConsumerRecord<byte[], byte[]>> partitionRecords =
                        records.records(partition.getKafkaPartitionHandle());

                    partitionConsumerRecordsHandler(partitionRecords, partition);
                }
            }
        }
        finally {
            // this signals the consumer thread that no more work is to be done
            consumerThread.shutdown();
        }

        // on a clean exit, wait for the runner thread
        try {
            consumerThread.join();
        }
        catch (InterruptedException e) {
            // may be the result of a wake-up interruption after an exception.
            // we ignore this here and only restore the interruption state
            Thread.currentThread().interrupt();
        }
    }

可以通过 MetricGroup consumerMetricGroup 添加Counter

private final Counter counter;

//构造方法添加
this.counter = consumerMetricGroup.counter("dirtyDataNum");

//捕获反序列化异常

for (ConsumerRecord<byte[], byte[]> record : partitionRecords) {
   try {
      deserializer.deserialize(record, kafkaCollector);
   } catch (Exception e) {
      this.counter.inc();
      LOG.info("deserialize exception");
   }


   // emit the actual records. this also updates offset state atomically and emits
   // watermarks
   emitRecordsWithTimestamps(
      kafkaCollector.getRecords(),
      partition,
      record.offset(),
      record.timestamp());

   if (kafkaCollector.isEndOfStreamSignalled()) {
      // end of stream signaled
      running = false;
      break;
   }
}

获取 Metrics(每个子任务有一个指标) ：flink_taskmanager_job_task_operator_KafkaConsumer_dirtyDataNum

posted @ 2020-12-23 23:50 楠鸽阅读(392) 评论(0) 编辑收藏举报

刷新页面返回顶部

楠鸽

Flink反序列化失败metric

公告