随笔 - 27  文章 - 16  评论 - 1  阅读 - 36892

A guide to deploying Machine/Deep Learning model(s) in Production

https://blog.usejournal.com/a-guide-to-deploying-machine-deep-learning-model-s-in-production-e497fd4b734a

Image for post
Source: Algorithmia
 
Image for post
Image for post
Illustration of the workflow (from client API requests to server prediction responses). You are free to use the image.

Components

 

Architecture Setup

gunicorn --workers 1 --timeout 300 --bind 0.0.0.0:8000 api:app- workers (INT): The number of worker processes for handling requests.
- timeout (INT): Workers silent for more than this many seconds are killed and restarted.
- bind (ADDRESS): The socket to bind. [['127.0.0.1:8000']]
- api: The main Python file containing the Flask application.
- app: An instance of the Flask class in the main Python file 'api.py'.
 
 

Additional Setup (Add-ons)

 

Alternate platforms

Image for post
Image for post
Source: https://opensource.googleblog.com/2016/02/running-your-models-in-production-with.html
Image for post
Image for post
Source: https://codingpackets.com/virtualization/docker/
Image for post
Image for post
Source: https://eng.uber.com/michelangelo/

Additional Resources

If you liked this article — I’d really appreciate if you hit the like button to recommend it to others. You can also follow me on Twitter and Medium. Peace! 😎

posted on   medsci  阅读(49)  评论(0编辑  收藏  举报
编辑推荐:
· 基于Microsoft.Extensions.AI核心库实现RAG应用
· Linux系列:如何用heaptrack跟踪.NET程序的非托管内存泄露
· 开发者必知的日志记录最佳实践
· SQL Server 2025 AI相关能力初探
· Linux系列:如何用 C#调用 C方法造成内存泄露
阅读排行:
· 无需6万激活码!GitHub神秘组织3小时极速复刻Manus,手把手教你使用OpenManus搭建本
· Manus爆火,是硬核还是营销?
· 终于写完轮子一部分:tcp代理 了,记录一下
· 别再用vector<bool>了!Google高级工程师:这可能是STL最大的设计失误
· 单元测试从入门到精通
< 2025年3月 >
23 24 25 26 27 28 1
2 3 4 5 6 7 8
9 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30 31 1 2 3 4 5

点击右上角即可分享
微信分享提示