一、MAC部署

Mac安装Python3

方式一：

1. 在终端输入:

 brew install python3

2. 等待自动安装完成，再进行配置

方式二：

1. 官网：https://www.python.org/downloads/

2. 下载对应版本，一路默认安装，安装完成如下图：

Mac安装pip，检验你是否安装了pip

python3 -m pip --version

手动安装pip

curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py

获取pip

python3 get-pip.py

pygame安装（如果需要的话）

pip install pygame

安装：https://github.com/apple/ml-stable-diffusion

git clone https://github.com/apple/ml-stable-diffusion
// 下载的文件夹运行
pip install -e .

如果出现

WARNING: The script tqdm is installed in '/Users/strengthen/Library/Python/3.9/bin' which is not on PATH.

访达打开 ~/.bash_profile 文件，在打开的文件的末尾添加以下内容：

export PATH=$PATH:/path/to/your/folder

注册登陆：hugging face

点击生成token并复制：https://huggingface.co/settings/tokens

通过以下命令使用 pip 安装 huggingface-cli：

pip install huggingface_hub

报错则输入：

# linux or mac
echo "export PATH=\"`python3 -m site --user-base`/bin:\$PATH\"" >> ~/.bashrc
source ~/.bashrc

接下去命令行输入指令，按提示输入

huggingface-cli login

运行指令，在文件夹下运行，顺便创建一个放置ml模型的文件夹，替换下面的-o

python3 -m python_coreml_stable_diffusion.torch2coreml --convert-unet --convert-text-encoder --convert-vae-decoder --convert-safety-checker -o ./output_ml

官方还给了一个api：

--model-version runwayml/stable-diffusion-v1-5 #可以指定其他版本的diffusion模型，默认是 CompVis/stable-diffusion-v1-4
--bundle-resources-for-swift-cli  #将ml文件整理成一个swift包，python生成不需要使用
--chunk-unet #ios和ipados部署需要，后面两个之后有机会我想去尝试一下在真机上的部署
--attention-implementation #在Apple芯片上的npu上实现

如果想部署手机和平板可以参考：

python3 -m python_coreml_stable_diffusion.torch2coreml --convert-unet --convert-text-encoder --convert-vae-decoder --convert-safety-checker -o ./output_ml --chunk-unet --model-version coreml-stable-diffusion-v1-4 --bundle-resources-for-swift-cli

解释：
-m python_coreml_stable_diffusion.torch2coreml：使用 python_coreml_stable_diffusion 包中的 torch2coreml 模块进行模型转换。
--convert-unet：将 U-Net 模型转换为 Core ML 模型。
--convert-text-encoder：将文本编码器模型转换为 Core ML 模型。
--convert-vae-decoder：将 VAE 解码器模型转换为 Core ML 模型。
--convert-safety-checker：将安全检查器模型转换为 Core ML 模型。
-o ./sd2_ml：将生成的 Core ML 模型保存到 ./sd2_ml 目录中。
--chunk-unet：对 U-Net 模型进行分块处理，以便在内存中处理大型模型。
--model-version stabilityai/stable-diffusion-2-1-base：指定要转换的 PyTorch 模型的版本。
--bundle-resources-for-swift-cli：将必要的资源打包成 Swift 代码。

大约20几分钟，会生成下列文件：

接着还是在ml-stable-diffusion的文件夹下运行，对了再创建一个放图片的文件夹：

python -m python_coreml_stable_diffusion.pipeline --prompt "a photo of an astronaut riding a horse on mars" -i ./output_ml -o ./output_image --compute-unit ALL --seed 93

swift用下面这个：

swift run StableDiffusionSample "A photo of a little girl walking on the beach with the Jenny Turtle" --resource-path ./sd2_ml/Resources/ --seed 93 --output-path ./output_image

--model-version #如果前面修改了这个也要修改
--num-inference-steps #默认推理50次，用这个可以自定义次数

参考了官方的基准指南--compute-unit 选择了CPU_AND_NE，swift 2秒左右一个step，快了很多。

就是从上图中我们可以看到似乎无后缀的M芯片由于GPU数量较少吧我猜--compute-unit 都推荐选CPU_AND_NE，pro系列芯片选ALL，以上选CPU_AND_GPU。

--attention-implementation 直接看GPU核心数，小于等于16个使用SPLLIT_EINSUM，就是默认的其实啥也不用加。大于16个使用ORIGINAL。

我猜因为M芯片是16核NPU的，GPU核心数比NPU小那肯定用NPU，GPU核心数大大超过NPU，那还是GPU的效率更高吧。

二、iPhone & iPad部署

打开Xcode，导入库和上面生成的ML文件

 1 import SwiftUI
 2 import StableDiffusion
 3 import CoreML
 4 
 5 struct ContentView: View {
 6     @State var prompt: String = "a photo of an astronaut riding a horse on mars"
 7     @State var step = 10
 8     @State var seed = 100
 9     @State var image: CGImage?
10     @State var progress = 0.0
11     @State var generating = false
12     @State var booting = true
13     
14     @State var pipeline: StableDiffusionPipeline?
15     
16     private let disableSafety = false
17 
18     
19     var body: some View {
20         VStack {
21             if booting {
22                 Text("Initializing...")
23             } else {
24                 if let image {
25                     Image(uiImage: UIImage(cgImage: image))
26                         .resizable()
27                         .scaledToFit()
28                 }
29                 if generating {
30                     ProgressView(value: progress)
31                 }
32                 if !generating {
33                     TextField("Prompt", text: $prompt)
34                     Stepper(value: $step, in: 1...100) {
35                         Text("steps: \(step)")
36                     }
37                     Stepper(value: $seed, in: 0...10000) {
38                         Text("Seed: \(seed)")
39                     }
40                     Button("Generate") {
41                         progress = 0.0
42                         image = nil
43                         generating = true
44                         Task.detached(priority: .high) {
45                             var images: [CGImage?]?
46                             do {
47                                 print("generate")
48                                 images = try pipeline?.generateImages(prompt: prompt, stepCount: step,seed: seed, disableSafety: disableSafety, progressHandler: { progress in
49                                     print("test")
50                                     self.progress = Double(progress.step) / Double(step)
51                                     if let image = progress.currentImages.first {
52                                         self.image = image
53                                     }
54                                     return true
55                                 })
56                             } catch let error {
57                                 print(error.localizedDescription)
58                             }
59                             print("finish")
60                             if let image = images?.first {
61                                 self.image = image
62                             }
63                             generating = false
64                         }
65                     }
66                 }
67             }
68         }
69         .padding()
70         .onAppear{
71             Task.detached(priority: .high) {
72                 do {
73                     print(os_proc_available_memory())
74                     guard let path = Bundle.main.path(forResource: "CoreMLModels", ofType: nil, inDirectory: nil) else {
75                         fatalError("Fatal error: failed to find the CoreML models.")
76                     }
77                     let resourceURL = URL(fileURLWithPath: path)
78                     let config = MLModelConfiguration()
79                     config.computeUnits = .cpuAndNeuralEngine
80                     pipeline = try StableDiffusionPipeline(resourcesAt: resourceURL, configuration: config,reduceMemory: true)
81                     try pipeline?.loadResources()
82                     print("initialized pipeline")
83                 } catch let error {
84                     print("error initializing pipeline")
85                     print(error.localizedDescription)
86                 }
87                 booting = false
88             }
89         }
90     }
91 }
92 
93 struct ContentView_Previews: PreviewProvider {
94     static var previews: some View {
95         ContentView()
96     }
97 }

在iPad和Mac上建议config.computeUnits = .cpuAndNeuralEngine。如果要继续在iPhone上部署的话改为这个config.computeUnits = .cpuAndGPU，然后来到Signing界面，点击Capability，选择Increased Memory Limit。这样就可以在iPhone真机上运行了。这个项目在iPhone真机上需要3GB多一点的内存，我是iPhone 14pro，程序默认可用内存也是3GB多一点刚好杀死。所以通过Increased Memory Limit将可用内存提升至4GB左右才可以运行。另外，即使提升了内存使用神经引擎还是会抱内存错误，只有用GPU才可以，iPad air5没有这种报错两个都可。GPU的运行速度是比神经引擎慢一点的，但是一台手机可以本地运行diffusion还是蛮酷的。

三、Windows部署

下载库：去github网址下载 - > https://github.com/CompVis/stable-diffusion

git clone https://github.com/CompVis/stable-diffusion.git

配置环境和文件

简单的操作，在下载后的文件夹下直接输下面两个：

conda env create -f environment.yaml
conda activate ldm

这样就配置好了。或者就像我一样傻傻的报错一个安装一个，环境是pytorch torchvision。

pip install transformers==4.19.2 diffusers invisible-watermark
pip install -e .
pip install OmegaConf einops taming-transformers pytorch-lighnting clip kornia

问题

第一个问题官网是有说明的，但就是这个下载地址藏得很深不好找。

找了一会才找到https://huggingface.co/CompVis/stable-diffusion-v-1-4-original，把这个sd-v1-4.ckpt文件下载下来，随便哪个都行，大概四个G。

下载模型文件放置到这个文件夹下，就是上面官方说明的那个位置，命名成model.ckpt。

运行下面的代码，不出意外会报错。

python scripts/txt2img.py --prompt "a photograph of an astronaut riding a horse" --plms

问题好像是原作者修改了这个库，将你的quantize.py（报错的信息里包含文件所在的绝对路径）文件替换成这个网址的文件
https://github.com/CompVis/taming-transformers/blob/master/taming/modules/vqvae/quantize.py

再运行一次又报错

现在有一个简单的办法了，感谢数学系之耻的建议，直接降精度就可以释放显存了。如下修改txt2img.py文件第244行

效果图：

或者接下去看比较复杂的方法！
不知道需要多大的内存才可以，网上找到方法是用优化的库。网上还有一种办法说注释掉什么安全检查的我试了没有变化。
https://github.com/basujindal/stable-diffusion
下载完后在新的库上也是需要安装一些环境，在新的文件夹下运行下面的安装代码

pip install -e .

优化的库代码放在optimizedSD文件夹下，也保留了之前的源代码，不要搞错了。

重新安装一下这个优化库的环境，将ckpt放到对应的位置。

python optimizedSD/optimized_txt2img.py --prompt "Cyberpunk style image of a Tesla car reflection in rain" --H 512 --W 512 --seed 27 --n_iter 2 --n_samples 5 --ddim_steps 50

运行后报这个错。查了一下，好像是最近优化的作者也换了一个库https://github.com/basujindal/stable-diffusion/issues/175 用下面的办法就能解决了。

pip install git+https://github.com/crowsonkb/k-diffusion.git

然后打开编辑optimizedSD/ddpm.py文件，将from samplers…改成上面图片的三个from k_diffusion…然后贫穷的显卡的电脑也就可以跑了，不说了要努力搬砖买24g的显卡了。

posted @ 2023-04-13 15:21 为敢技术阅读(1738) 评论(0) 编辑收藏举报

刷新页面返回顶部

山青咏芝

感觉到了的东西,我们不能立刻理解它;只有理解了的东西才能更深刻地感觉它.

Mac ｜ iOS ｜ Windows：安装Stable diffusion教程

下载库：去github网址下载 - > https://github.com/CompVis/stable-diffusion

配置环境和文件

问题

公告