2024年2月15日

摘要: 1. deploy worker, parameter server on kubernetes cluster 1.1 build container image of worker, parameter server $ git clone https://github.com/tensorfl 阅读全文
posted @ 2024-02-15 16:37 zhenxia-jiuyou 阅读(28) 评论(0) 推荐(0) 编辑
 
摘要: [ERROR: tf distribute strategy parameter server: tfx component trainer: model.save(): failed to connect to all addresses] log of pod tfx-component-tra 阅读全文
posted @ 2024-02-15 00:01 zhenxia-jiuyou 阅读(59) 评论(0) 推荐(0) 编辑