摘要:
https://zhuanlan.zhihu.com/p/617807098 阅读全文
2024年2月19日
2024年2月18日
摘要:
1. build kubeflow/serving container image which contains serving_model [1] # run container tensorflow/serving, the image of this container # is the ba 阅读全文
摘要:
1. save the trained model # in module file of tfx component trainer def _apply_preprocessing(raw_features, tft_layer): transformed_features = tft_laye 阅读全文
2024年2月15日
摘要:
1. deploy worker, parameter server on kubernetes cluster 1.1 build container image of worker, parameter server $ git clone https://github.com/tensorfl 阅读全文
摘要:
[ERROR: tf distribute strategy parameter server: tfx component trainer: model.save(): failed to connect to all addresses] log of pod tfx-component-tra 阅读全文
2024年2月14日
摘要:
[ERROR: tf distribute strategy parameter server: tfx component trainer: OutOfRangeError(), Node: 'cond/IteratorGetNext' End of sequence] log of pod tf 阅读全文
摘要:
[ERROR: NOT_FOUND: /tfx/tfx_pv/pipelines/detect_anomolies_on_wafer_tfdv_schema/ImportExampleGen/examples/67/Split-train/data_tfrecord-00000-of-00001.g 阅读全文
摘要:
[ERROR: Failed to start a transaction to create a new pipeline and a new pipeline version: dial tcp: lookup mysql on 10.96.0.10:53: no such host","] > 阅读全文
摘要:
[ERROR: stuck at "INFO:tensorflow:ParameterServerStrategyV2 is now connecting to cluster with cluster_spec: ClusterSpec({'ps': ['dist-strat-example-ps 阅读全文
摘要:
[ERROR: unknown field "spec.template.spec.nodeAffinity"] (base) maye@maye-Inspiron-5547:~/github_repository/tensorflow_ecosystem/distribution_strategy 阅读全文