Fork me on GitHub

Langchain——chatchat3.1版本docker部署流程Langchain-Chatchat

Langchain——chatchat3.1版本docker部署流程Langchain-Chatchat

1. 项目地址

#项目地址
https://github.com/chatchat-space/Langchain-Chatchat
#dockerhub地址
https://hub.docker.com/r/chatimage/chatchat/tags

2. docker部署

  • 参考官方文档
#官方文档
https://github.com/chatchat-space/Langchain-Chatchat/blob/master/docs/install/README_docker.md
  • 配置docker-compose环境
cd ~
wget https://github.com/docker/compose/releases/download/v2.27.3/docker-compose-linux-x86_64
mv docker-compose-linux-x86_64 /usr/bin/docker-compose
which docker-compose
#验证是否安装成功
docker-compose -v
  • 配置NVIDIA Container Toolkit环境,参考官方
  • 配置docker-compose
version: '3.9'
services:
  xinference:
    #xprobe/xinference:latest
    image: xinferebce:1.0
    restart: always
    command: xinference-local -H 0.0.0.0
    ports: # 不使用 host network 时可打开.
     - "9997:9997"
    # network_mode: "host"
    # 将本地路径(~/xinference)挂载到容器路径(/root/.xinference)中,
    # 详情见: https://inference.readthedocs.io/zh-cn/latest/getting_started/using_docker_image.html
    volumes:
      - /home/isi/LLM/models/xinference:/root/.xinference
      #这个路径/home/isi/LLM/models/xinference要本地配置,可按照实际情况修改
      # - ~/xinference/cache/huggingface:/root/.cache/huggingface
      # - ~/xinference/cache/modelscope:/root/.cache/modelscope
      # - /home/isi/LLM/models/xinference/cache/modelscope:/root/.cache/modelscope
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: all
              capabilities: [gpu]
    runtime: nvidia
    # 模型源更改为 ModelScope, 默认为 HuggingFace
    environment:
      - XINFERENCE_MODEL_SRC=modelscope
  chatchat:
    # 版本参考https://hub.docker.com/r/chatimage/chatchat/tags
    # docker pull chatimage/chatchat:0.3.1.1-2024-0714
    image: chatchat:1.0
    restart: always
    ports: # 不使用 host network 时可打开.
      - "7861:7861"
      - "8501:8501"
    # network_mode: "host"
    # 将本地路径(~/chatchat/data)挂载到容器默认数据路径(/usr/local/lib/python3.11/site-packages/chatchat/data)中
    volumes:
      - /home/isi/chatchat/data:/root/chatchat_data/data
  • 启动容器
docker-compose up -d

xinference

#1. 访问http://IP:9997/
#2. 加载Langguage models——glm4
#3. 加载embedding models——bge-large-zh-v1.5

  • 模型加载成功

chatchat

docker exec -it chat容器id bash

vi /root/chatchat_data/model_settings.yaml
#1.将embedding改成下面的bge-large-zh-V1.5
#2.将下面内容改成服务器自身的ip地址,不能改成127.0.0.1,因为127.0.0.1在ip映射的模式下不适用

知识库部署成功

访问http://IP:8501/ 走以下知识库创建的流程

以及知识库问答的流程

posted @ 2024-08-02 10:09  壶小旭  阅读(669)  评论(0编辑  收藏  举报