使用ResponseSelector实现校园招聘FAQ机器人

  本文主要介绍使用ResponseSelector实现校园招聘FAQ机器人,回答面试流程和面试结果查询的FAQ问题。FAQ机器人功能分为业务无关的功能和业务相关的功能2类。

一.data/nlu.yml文件
  与普通意图相比,ResponseSelector训练数据中的意图采用group/intent格式(检索意图)。比如,普通意图intent: greet,而后者intent: faq/notes。如下所示:

version: "3.1"
nlu:
  - intent: goodbye
    examples: |
      - 拜拜
      - 再见
      - 拜
      - 退出
      - 结束
  - intent: greet
    examples: |
      - 你好
      - 您好
      - hello
      - hi
      - 喂
      - 在么
  - intent: faq/notes
    examples: |
      - 应聘ACME校园招聘职位的注意事项?
  - intent: faq/work_location
    examples: |
      - 校园招聘录取的应届生主要工作地点在哪里?
  - intent: faq/max_job_request
    examples: |
      - 最多申请几个职位?
  - intent: faq/audit
    examples: |
      - 各阶段审核说明
  - intent: faq/write_exam_participate
    examples: |
      - 怎样参加笔试?
  - intent: faq/write_exam_location
    examples: |
      - 笔试考试地点如何安排?
  - intent: faq/write_exam_again
    examples: |
      - 笔试只安排一次吗?我笔试当天没有参加,是否还有再次笔试的机会?
  - intent: faq/write_exam_with-out-offer
    examples: |
      - 如果我没有收到笔试通知,但我很想进入ACME,能否直接进入考场参加考试?
  - intent: faq/interview_arrangement
    examples: |
      - 面试什么时候开始?会提前多少天通知面试安排?
  - intent: faq/interview_times
    examples: |
      - 一般会安排几次面试?
  - intent: faq/interview_from
    examples: |
      - 面试的形式是怎样的?是单独面试还是小组面试?
  - intent: faq/interview_clothing
    examples: |
      - 对面试的服装有什么具体的要求?
  - intent: faq/interview_paperwork
    examples: |
      - 面试时需要携带什么资料?
  - intent: faq/interview_result
    examples: |
      - 如何查询面试结果?

二.data/responses.yml文件
  主要是根据相关intent来进行相应的response。比如,utter_faq/notes的response对应于意图faq/notes。如下所示:

version: "3.1"
responses:
  utter_faq/notes:
    - text: 1、登在校园招聘板块内的职位信息才适用于应届毕业生招聘,请所有的应届毕业生去校园招聘的版块寻找您感兴趣的职位。2、列出的每个职位的要求是该职位的最低要求,为了保证您应聘的成功率,希望您严格按照职位的要求考虑您的选择。3、提交成功后,在招聘结束前,您将不能修改或再次提交简历,因此,请于仔细确认填写信息后提交简历。
  utter_faq/work_location:
    - text: 招聘信息中包含各职位的工作地点内容,请参考各职位内容的详细介绍。
  utter_faq/max_job_request:
    - text: 对于校园招聘,最多申请2个职位。
  utter_faq/audit:
    - text: 1、简历审核:应聘者需要通过ACME网站,填写并提交个人简历,ACME的招聘专员将对收取的简历进行认真的审查和筛选。了解应聘者的情况,并筛选出符合职位要求的简历,同时确认简历记载内容是否属实。2、笔试审核:ACME技术类测试主要针对应聘者的专业技能进行检查和评价。3、面试审核:经过实施评价应聘者基本素质的第一阶段面试和评价专业知识的第二阶段面试,对应聘者是否符合ACME人才理念以及应聘者的工作能力做出客观的综合评价,从而决定是否录用该应聘者。
  utter_faq/write_exam_participate:
    - text: 通过简历审核的应聘者,我们将采用短信、e-mail、ACME公告栏以及电话通知的方式告知您
  utter_faq/write_exam_location:
    - text: 笔试地点将根据您在简历中填写的学校所在城市进行统筹安排
  utter_faq/write_exam_again:
    - text: 校园招聘的大规模的笔试仅安排一次,请收到笔试通知的同学认真对待笔试机会。
  utter_faq/write_exam_with-out-offer:
    - text: 由于我们是按照严格的招聘流程筛选出的笔试名单,所以非常抱歉,对于没有收到笔试通知的同学,就不能参加本次校园招聘的笔试。
  utter_faq/interview_arrangement:
    - text: 不同的职位面试进度安排不同,除特殊安排外,笔试结束一周左右会安排面试。
  utter_faq/interview_times:
    - text: 一般情况下,业务部门和人力资源部会同时或者分别安排一次面试。个别特殊职位需要2次及以上的面试。
  utter_faq/interview_from:
    - text: 面试一般以单独面试的形式进行,但根据各公司的面试安排,也会进行小组面试。
  utter_faq/interview_clothing:
    - text: 面试着装没有统一要求,但建议您尽量穿着较为正式的职业装参加。
  utter_faq/interview_paperwork:
    - text: 面试时,请您携带可以证明您身份的有效证件,有特殊要求的职位请携带好能证明您专业水平的证书原件以及复印件。
  utter_faq/interview_result:
    - text: 我们会通过邮件或电话的形式,通知您面试结果。

三.data/stories.yml文件
  story即场景编排,如下所示:

version: "3.1"
stories:
  - story: greet
    steps:
      - intent: greet
      - action: utter_greet
  - story: say goodbye
    steps:
      - intent: goodbye
      - action: utter_goodbye

四.data/rules.yml文件
  定义了规则名"respond to FAQs",当检索意图是faq时,执行utter_faq,如下所示:

version: "3.1"
rules:
  - rule: respond to FAQs
    steps:
      - intent: faq
      - action: utter_faq

五.domain.yml文件
  该文件主要包含intents、responses和actions等信息,如下所示:

version: "3.1"

session_config:
  session_expiration_time: 60
  carry_over_slots_to_new_session: true
intents:
  - goodbye
  - greet
  - faq
responses:
  utter_greet:
    - text: 你好,我是 Silly,我是一个基于 Rasa 的 FAQ 机器人
  utter_goodbye:
    - text: 再见!
  utter_default:
    - text: 系统不明白您说的话
actions:
  - utter_goodbye
  - utter_greet
  - utter_default
  - utter_faq

六.config.yml文件
  主要是pipeline和policies设置。前者基本思路是分词、特征化、意图识别和实体抽取,后者定义各种策略。特别注意,FAQ机器人需要将ResponseSelector组件加入NLU的流水线,并且还需要启用RulePolicy和设置rule(参考四.data/rules.yml文件)。如下所示:

recipe: default.v1
language: "zh"

pipeline:
- name: JiebaTokenizer
- name: LanguageModelFeaturizer
  model_name: "bert"
#  model_weights: "bert-base-chinese"
  model_weights: "L:/20230713_HuggingFaceModel/20231004_BERT/bert-base-chinese"
- name: "DIETClassifier"
  epochs: 100
  tensorboard_log_directory: ./log
  learning_rate: 0.001
- name: "ResponseSelector"

policies:
- name: MemoizationPolicy
- name: TEDPolicy
- name: RulePolicy
assistant_id: 20231109-225257-frayed-branch

七.endpoints.yml文件
  action_endpoint、tracker_store和event_broker通常使用默认配置,如下所示:

# This file contains the different endpoints your bot can use.

# Server where the models are pulled from.
# https://rasa.com/docs/rasa/user-guide/running-the-server/#fetching-models-from-a-server/

#models:
#  url: http://my-server.com/models/default_core@latest
#  wait_time_between_pulls:  10   # [optional](default: 100)

# Server which runs your custom actions.
# https://rasa.com/docs/rasa/core/actions/#custom-actions/

action_endpoint:
  url: "http://localhost:5055/webhook"

# Tracker store which is used to store the conversations.
# By default the conversations are stored in memory.
# https://rasa.com/docs/rasa/api/tracker-stores/

#tracker_store:
#    type: redis
#    url: <host of the redis instance, e.g. localhost>
#    port: <port of your redis instance, usually 6379>
#    db: <number of your database within redis, e.g. 0>
#    password: <password used for authentication>

#tracker_store:
#    type: mongod
#    url: <url to your mongo instance, e.g. mongodb://localhost:27017>
#    db: <name of the db within your mongo instance, e.g. rasa>
#    username: <username used for authentication>
#    password: <password used for authentication>

# Event broker which all conversation events should be streamed to.
# https://rasa.com/docs/rasa/api/event-brokers/

#event_broker:
#  url: localhost
#  username: username
#  password: password
#  queue: queue

八.模型训练和运行Rasa服务器
1.模型训练

rasa train

2.运行Rasa服务器

rasa run --cors "*"

3.开启http server服务

python -m http.server

说明:测试FAQ机器人可以通过Web页面,还可通过命令行rasa shell --debug。

九.PyCharm调试Rasa代码
1.Rasa中的DAG
  Rasa中DAG图节点可能是NLP组件,也可能是Policy组件,本质上都可以抽象为Graph Component。如下所示:   Rasa会把训练过的Component缓存到磁盘中,当某个Component发生变化的时候,比如CountVectorizer,只会把依赖CountVectorizer的组件(DIETClassifier、TEDPolicy和Policy Ensemble)再训练,而其它的组件不变。如下所示: 2.PyCharm调试Rasa代码
  PyCharm调试Rasa源码也比较方便,主要是设置脚本路径、参数和工作目录,如下所示:   然后就可以调试训练数据是如何被处理的,DAG是如何被构建的,Component是如何被加载和运行的,最终模型文件是如何被存储的等。Rasa中的fingerprint_key可能是唯一标识的意思。
3.rasa train nlu --debug日志
  通过控制台输出日志,可辅助理解Rasa执行过程,以及源码调试,如下所示:

L:\20231106_ConversationSystem\20220407_RasaEcosystem\RasaBooks\RasaInAction\rasa_chinese_book_code\Chapter04\venv\Scripts\python.exe "D:/Program Files/JetBrains/PyCharm 2023.1.3/plugins/python/helpers/pydev/pydevd.py" --multiprocess --qt-support=auto --client 127.0.0.1 --port 38019 --file L:\20231106_ConversationSystem\20220407_RasaEcosystem\RasaBooks\RasaInAction\rasa_chinese_book_code\Chapter04\venv\Lib\site-packages\rasa\__main__.py train nlu --debug
Connected to pydev debugger (build 232.9559.58)

2023-11-10 23:24:32 DEBUG    h5py._conv  - Creating converter from 7 to 5
2023-11-10 23:24:32 DEBUG    h5py._conv  - Creating converter from 5 to 7

2023-11-10 23:26:17 DEBUG    rasa.shared.nlu.training_data.loading  - Training data format of 'data\nlu.yml' is 'rasa_yml'.  # nul.yml文件(rasa_yml数据格式)
2023-11-10 23:26:17 DEBUG    rasa.shared.nlu.training_data.loading  - Training data format of 'data\responses.yml' is 'rasa_yml'.  # responses.yml文件(rasa_yml数据格式)
2023-11-10 23:26:17 DEBUG    rasa.shared.nlu.training_data.loading  - Training data format of 'data\rules.yml' is 'unk'.  # rules.yml文件(unk数据格式)
2023-11-10 23:26:17 DEBUG    rasa.shared.nlu.training_data.loading  - Training data format of 'data\stories.yml' is 'unk'.  # stories.yml文件(unk数据格式)

2023-11-10 23:26:33 DEBUG    rasa.telemetry  - Skipping telemetry reporting: no license hash found.  # 跳过telemetry报告:找不到许可证哈希。
2023-11-10 23:27:24 DEBUG    rasa.engine.training.graph_trainer  - Starting training.  # 开始训练

2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'train_JiebaTokenizer0' loading 'FingerprintComponent.create' and kwargs: '{}'.  # train_JiebaTokenizer0
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'run_JiebaTokenizer0' loading 'FingerprintComponent.create' and kwargs: '{}'.  # run_JiebaTokenizer0
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'run_LanguageModelFeaturizer1' loading 'FingerprintComponent.create' and kwargs: '{}'.  # run_LanguageModelFeaturizer1
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'train_DIETClassifier2' loading 'FingerprintComponent.create' and kwargs: '{}'.  # train_DIETClassifier2
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'train_ResponseSelector3' loading 'FingerprintComponent.create' and kwargs: '{}'.  # train_ResponseSelector3
2023-11-10 23:27:24 DEBUG    rasa.engine.training.graph_trainer  - Running the train graph in fingerprint mode.  # 在fingerprint模式下运行训练图。
2023-11-10 23:27:24 DEBUG    rasa.engine.runner.dask  - Running graph with inputs: {'__importer__': NluDataImporter}, targets: None and ExecutionContext(model_id=None, should_add_diagnostic_data=False, is_finetuning=False, node_name=None).
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'schema_validator' loading 'DefaultV1RecipeValidator.create' and kwargs: '{}'.  # schema_validator
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'schema_validator' running 'DefaultV1RecipeValidator.validate'.  # schema_validator
2023-11-10 23:27:24 DEBUG    rasa.shared.nlu.training_data.training_data  - Validating training data...  # 验证训练数据...
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'finetuning_validator' loading 'FinetuningValidator.create' and kwargs: '{}'.  # finetuning_validator
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'finetuning_validator' running 'FinetuningValidator.validate'.  # finetuning_validator
2023-11-10 23:27:24 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'finetuning_validator' was requested for writing.  # finetuning_validator
2023-11-10 23:27:24 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'finetuning_validator' was persisted.  # finetuning_validator
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'nlu_training_data_provider' loading 'NLUTrainingDataProvider.create' and kwargs: '{}'.  # nlu_training_data_provider
2023-11-10 23:27:24 DEBUG    rasa.engine.graph  - Node 'nlu_training_data_provider' running 'NLUTrainingDataProvider.provide'.  # nlu_training_data_provider
2023-11-10 23:27:24 DEBUG    rasa.shared.nlu.training_data.loading  - Training data format of 'data\nlu.yml' is 'rasa_yml'.  # nul.yml文件(rasa_yml数据格式)
2023-11-10 23:27:25 DEBUG    rasa.shared.nlu.training_data.loading  - Training data format of 'data\responses.yml' is 'rasa_yml'.  # responses.yml文件(rasa_yml数据格式)
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'train_JiebaTokenizer0' running 'FingerprintComponent.run'.  # train_JiebaTokenizer0
2023-11-10 23:27:25 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key '963f41cf1cdb9cadc8914a14e070fb8e' for class 'JiebaTokenizer'.  # 计算类'JiebaTokenizer'的指纹密钥
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'run_JiebaTokenizer0' running 'FingerprintComponent.run'.  # run_JiebaTokenizer0
2023-11-10 23:27:25 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key 'ae36d2dae4cc78840b153d44fee8f81a' for class 'JiebaTokenizer'.  # 计算类'JiebaTokenizer'的指纹密钥
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'run_LanguageModelFeaturizer1' running 'FingerprintComponent.run'.  # run_LanguageModelFeaturizer1
2023-11-10 23:27:25 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key 'f2bfce545dd2c1c12fb895b075954315' for class 'LanguageModelFeaturizer'.  # 计算类'LanguageModelFeaturizer'的指纹密钥
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'train_DIETClassifier2' running 'FingerprintComponent.run'.  # train_DIETClassifier2
2023-11-10 23:27:25 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key '1d3616cf6980e5f0f38aa9ceb51f1e7a' for class 'DIETClassifier'.  # 计算类'DIETClassifier'的指纹密钥
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'train_ResponseSelector3' running 'FingerprintComponent.run'.  # train_ResponseSelector3
2023-11-10 23:27:25 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key 'b91434757a05a4178cdc7f7882cfd9aa' for class 'ResponseSelector'.  # 计算类'ResponseSelector'的指纹密钥
2023-11-10 23:27:25 DEBUG    rasa.engine.training.graph_trainer  - Running the pruned train graph with real node execution.  # 使用真实节点执行修剪的训练图。
2023-11-10 23:27:25 DEBUG    rasa.engine.runner.dask  - Running graph with inputs: {'__importer__': NluDataImporter}, targets: None and ExecutionContext(model_id=None, should_add_diagnostic_data=False, is_finetuning=False, node_name=None).
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_before_node' running for node 'nlu_training_data_provider'.  # nlu_training_data_provider
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_before_node' running for node 'nlu_training_data_provider'.  # nlu_training_data_provider
2023-11-10 23:27:25 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key '1fbfa24243412736ce1002efbeba382f' for class 'NLUTrainingDataProvider'.  # 计算类'NLUTrainingDataProvider'的指纹密钥
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'nlu_training_data_provider' loading 'PrecomputedValueProvider.create' and kwargs: '{}'.  # nlu_training_data_provider
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'nlu_training_data_provider' running 'PrecomputedValueProvider.get_value'.  # nlu_training_data_provider
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_after_node' running for node 'nlu_training_data_provider'.  # nlu_training_data_provider
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_after_node' running for node 'nlu_training_data_provider'.  # nlu_training_data_provider
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_before_node' running for node 'train_JiebaTokenizer0'.  # train_JiebaTokenizer0
2023-11-10 23:27:25 INFO     rasa.engine.training.hooks  - Starting to train component 'JiebaTokenizer'.  # 开始训练组件'JiebaTokenizer'。
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_before_node' running for node 'train_JiebaTokenizer0'.  # train_JiebaTokenizer0
2023-11-10 23:27:25 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key '963f41cf1cdb9cadc8914a14e070fb8e' for class 'JiebaTokenizer'.  # 计算类'JiebaTokenizer'的指纹密钥
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'train_JiebaTokenizer0' loading 'JiebaTokenizer.create' and kwargs: '{}'.  # train_JiebaTokenizer0
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Node 'train_JiebaTokenizer0' running 'JiebaTokenizer.train'.  # train_JiebaTokenizer0
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_after_node' running for node 'train_JiebaTokenizer0'.  # train_JiebaTokenizer0
2023-11-10 23:27:25 INFO     rasa.engine.training.hooks  - Finished training component 'JiebaTokenizer'.  # 完成训练组件'JiebaTokenizer'。
2023-11-10 23:27:25 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_after_node' running for node 'train_JiebaTokenizer0'.  # train_JiebaTokenizer0
2023-11-10 23:27:25 DEBUG    rasa.engine.training.hooks  - Caching 'Resource' with fingerprint_key: '963f41cf1cdb9cadc8914a14e070fb8e' and output_fingerprint '141a681b80024953b9b7865284b9fece'.
2023-11-10 23:27:25 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_JiebaTokenizer0' was requested for reading.  # train_JiebaTokenizer0
2023-11-10 23:27:25 DEBUG    rasa.engine.storage.resource  - Skipped caching resource 'train_JiebaTokenizer0' as no persisted data was found.  # 跳过缓存资源'train_JiebaTokenizer0',因为找不到持久化数据。
2023-11-10 23:27:25 DEBUG    rasa.engine.caching  - Caching output of type 'Resource' succeeded.  # 缓存类型为'Resource'的输出成功。
2023-11-10 23:27:26 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_before_node' running for node 'run_JiebaTokenizer0'.  # run_JiebaTokenizer0
2023-11-10 23:27:26 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_before_node' running for node 'run_JiebaTokenizer0'.  # run_JiebaTokenizer0
2023-11-10 23:27:26 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key '496a8741f1dfb458bbfedb535d343623' for class 'JiebaTokenizer'.  # 计算类'JiebaTokenizer'的指纹密钥
2023-11-10 23:27:26 DEBUG    rasa.engine.graph  - Node 'run_JiebaTokenizer0' loading 'JiebaTokenizer.load' and kwargs: '{'resource': Resource(name='train_JiebaTokenizer0', output_fingerprint='141a681b80024953b9b7865284b9fece')}'.
2023-11-10 23:27:26 DEBUG    rasa.engine.graph  - Node 'run_JiebaTokenizer0' running 'JiebaTokenizer.process_training_data'.  # run_JiebaTokenizer0

# jieba分词
Building prefix dict from the default dictionary ...
2023-11-10 23:27:26 DEBUG    jieba  - Building prefix dict from the default dictionary ...
Loading model from cache C:\Users\ADMINI~1\AppData\Local\Temp\jieba.cache
2023-11-10 23:27:26 DEBUG    jieba  - Loading model from cache C:\Users\ADMINI~1\AppData\Local\Temp\jieba.cache
Loading model cost 1.116 seconds.
2023-11-10 23:27:27 DEBUG    jieba  - Loading model cost 1.116 seconds.
Prefix dict has been built successfully.
2023-11-10 23:27:27 DEBUG    jieba  - Prefix dict has been built successfully.

2023-11-10 23:27:27 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_after_node' running for node 'run_JiebaTokenizer0'.
2023-11-10 23:27:27 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_after_node' running for node 'run_JiebaTokenizer0'.
2023-11-10 23:27:27 DEBUG    rasa.engine.training.hooks  - Caching 'TrainingData' with fingerprint_key: '496a8741f1dfb458bbfedb535d343623' and output_fingerprint '1baa8435dc0351e013e3b8f3635e83d6'.
2023-11-10 23:27:27 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_before_node' running for node 'run_LanguageModelFeaturizer1'.
2023-11-10 23:27:27 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_before_node' running for node 'run_LanguageModelFeaturizer1'.
2023-11-10 23:27:27 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key 'de5a4adf999a20fb8e5716903003508c' for class 'LanguageModelFeaturizer'.
2023-11-10 23:27:27 DEBUG    rasa.engine.graph  - Node 'run_LanguageModelFeaturizer1' loading 'LanguageModelFeaturizer.load' and kwargs: '{}'.
2023-11-10 23:27:28 DEBUG    rasa.nlu.featurizers.dense_featurizer.lm_featurizer  - Loading Tokenizer and Model for bert

2023-11-10 23:27:32 DEBUG    rasa.engine.graph  - Node 'run_LanguageModelFeaturizer1' running 'LanguageModelFeaturizer.process_training_data'.
2023-11-10 23:27:41 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_after_node' running for node 'run_LanguageModelFeaturizer1'.
2023-11-10 23:27:41 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_after_node' running for node 'run_LanguageModelFeaturizer1'.
2023-11-10 23:27:41 DEBUG    rasa.engine.training.hooks  - Caching 'TrainingData' with fingerprint_key: 'de5a4adf999a20fb8e5716903003508c' and output_fingerprint '1192d8329eb2a6d87f6e965765d10871'.
2023-11-10 23:27:41 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_before_node' running for node 'train_DIETClassifier2'.
2023-11-10 23:27:41 INFO     rasa.engine.training.hooks  - Starting to train component 'DIETClassifier'.
2023-11-10 23:27:41 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_before_node' running for node 'train_DIETClassifier2'.
2023-11-10 23:27:41 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key '7d66b69a551ffbc2a45237a02ffc5aa7' for class 'DIETClassifier'.
2023-11-10 23:27:41 DEBUG    rasa.engine.graph  - Node 'train_DIETClassifier2' loading 'DIETClassifier.create' and kwargs: '{}'.

2023-11-10 23:27:41 DEBUG    rasa.engine.graph  - Node 'train_DIETClassifier2' running 'DIETClassifier.train'.
2023-11-10 23:27:41 DEBUG    rasa.nlu.classifiers.diet_classifier  - No label features found. Computing default label features.
2023-11-10 23:27:41 DEBUG    rasa.nlu.classifiers.diet_classifier  - You specified 'DIET' to train entities, but no entities are present in the training data. Skipping training of entities.
2023-11-10 23:27:42 DEBUG    rasa.nlu.classifiers.diet_classifier  - Following metrics will be logged during training:
2023-11-10 23:27:42 DEBUG    rasa.nlu.classifiers.diet_classifier  -   t_loss (total loss)
2023-11-10 23:27:42 DEBUG    rasa.nlu.classifiers.diet_classifier  -   i_acc (intent acc)
2023-11-10 23:27:42 DEBUG    rasa.nlu.classifiers.diet_classifier  -   i_loss (intent loss)
2023-11-10 23:27:42 DEBUG    rasa.utils.tensorflow.data_generator  - The provided batch size is a list, this data generator will use a linear increasing batch size.

Epochs:   0%|          | 0/100 [00:00<?, ?it/s]
Epochs: 100%|██████████| 100/100 [01:26<00:00,  1.15it/s, t_loss=0.258, i_loss=0.0123, i_acc=1]
2023-11-10 23:29:09 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_DIETClassifier2' was requested for writing.
2023-11-10 23:29:09 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_DIETClassifier2' was persisted.
2023-11-10 23:29:09 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_after_node' running for node 'train_DIETClassifier2'.
2023-11-10 23:29:09 INFO     rasa.engine.training.hooks  - Finished training component 'DIETClassifier'.
2023-11-10 23:29:09 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_after_node' running for node 'train_DIETClassifier2'.
2023-11-10 23:29:09 DEBUG    rasa.engine.training.hooks  - Caching 'Resource' with fingerprint_key: '7d66b69a551ffbc2a45237a02ffc5aa7' and output_fingerprint '9a50714386a54eebbd0b5eb4ab2fd23c'.
2023-11-10 23:29:09 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_DIETClassifier2' was requested for reading.
2023-11-10 23:29:09 DEBUG    rasa.engine.caching  - Caching output of type 'Resource' succeeded.
2023-11-10 23:29:11 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_before_node' running for node 'train_ResponseSelector3'.
2023-11-10 23:29:11 INFO     rasa.engine.training.hooks  - Starting to train component 'ResponseSelector'.
2023-11-10 23:29:11 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_before_node' running for node 'train_ResponseSelector3'.
2023-11-10 23:29:11 DEBUG    rasa.engine.training.fingerprinting  - Calculated fingerprint_key '0e102b0ba0b459b1556ae9eb4aaac987' for class 'ResponseSelector'.
2023-11-10 23:29:11 DEBUG    rasa.engine.graph  - Node 'train_ResponseSelector3' loading 'ResponseSelector.create' and kwargs: '{}'.
2023-11-10 23:29:11 DEBUG    rasa.engine.graph  - Node 'train_ResponseSelector3' running 'ResponseSelector.train'.
2023-11-10 23:29:11 INFO     rasa.nlu.selectors.response_selector  - Retrieval intent parameter was left to its default value. This response selector will be trained on training examples combining all retrieval intents.
2023-11-10 23:29:11 DEBUG    rasa.nlu.classifiers.diet_classifier  - No label features found. Computing default label features.
2023-11-10 23:29:11 DEBUG    rasa.nlu.selectors.response_selector  - Following metrics will be logged during training:
2023-11-10 23:29:11 DEBUG    rasa.nlu.selectors.response_selector  -   t_loss (total loss)
2023-11-10 23:29:11 DEBUG    rasa.nlu.selectors.response_selector  -   r_acc (response acc)
2023-11-10 23:29:11 DEBUG    rasa.nlu.selectors.response_selector  -   r_loss (response loss)
2023-11-10 23:29:11 DEBUG    rasa.utils.tensorflow.data_generator  - The provided batch size is a list, this data generator will use a linear increasing batch size.
Epochs: 100%|██████████| 300/300 [00:39<00:00,  7.55it/s, t_loss=2.93, r_loss=1.17, r_acc=1]
2023-11-10 23:29:51 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_ResponseSelector3' was requested for writing.
2023-11-10 23:29:51 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_ResponseSelector3' was persisted.
2023-11-10 23:29:51 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_ResponseSelector3' was requested for writing.
2023-11-10 23:29:51 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_ResponseSelector3' was persisted.
2023-11-10 23:29:51 DEBUG    rasa.engine.graph  - Hook 'LoggingHook.on_after_node' running for node 'train_ResponseSelector3'.
2023-11-10 23:29:51 INFO     rasa.engine.training.hooks  - Finished training component 'ResponseSelector'.
2023-11-10 23:29:51 DEBUG    rasa.engine.graph  - Hook 'TrainingHook.on_after_node' running for node 'train_ResponseSelector3'.
2023-11-10 23:29:51 DEBUG    rasa.engine.training.hooks  - Caching 'Resource' with fingerprint_key: '0e102b0ba0b459b1556ae9eb4aaac987' and output_fingerprint '300fbcfe9f004bf2a6870e283e7b4f92'.
2023-11-10 23:29:51 DEBUG    rasa.engine.storage.local_model_storage  - Resource 'train_ResponseSelector3' was requested for reading.
2023-11-10 23:29:51 DEBUG    rasa.engine.caching  - Caching output of type 'Resource' succeeded.
2023-11-10 23:29:51 DEBUG    rasa.engine.storage.local_model_storage  - Start to created model package for path 'models\nlu-20231110-232632-arid-seasoning.tar.gz'.
2023-11-10 23:29:58 DEBUG    rasa.engine.storage.local_model_storage  - Model package created in path 'models\nlu-20231110-232632-arid-seasoning.tar.gz'.
Your Rasa model is trained and saved at 'models\nlu-20231110-232632-arid-seasoning.tar.gz'.
2023-11-10 23:29:58 DEBUG    rasa.telemetry  - Skipping telemetry reporting: no license hash found.

Process finished with exit code 0

参考文献:
[1]《Rasa实战》

posted on 2023-11-11 22:55  扫地升  阅读(252)  评论(0编辑  收藏  举报