Qwen-7B (vLLM)

Model Description

Qwen-7B is a cutting-edge large language model developed as part of the Qwen series, offering advanced natural language processing capabilities. With 7 billion parameters, it demonstrates exceptional performance across various downstream tasks. The model comes in two variants: the base pretrained version and the Qwen-Chat version, which is fine-tuned using human alignment techniques. Notably, Qwen-7B exhibits strong tool-use and planning abilities, making it suitable for developing intelligent agent applications. It also includes specialized versions for coding (Code-Qwen) and mathematics (Math-Qwen), showcasing improved performance in these domains compared to other open-source models.

Supported Environments

Iluvatar GPU	IXUCA SDK
MR-V100	4.2.0

Model Preparation

Prepare Resources

Model: - Model: https://modelscope.cn/models/qwen/Qwen-7B/summary

cd ${DeepSparkInference}/models/nlp/large_language_model/qwen-7b/vllm
mkdir -p data/qwen
ln -s /path/to/Qwen-7B ./data/qwen

Install Dependencies

In order to run the model smoothly, you need to get the sdk from resource center of Iluvatar CoreX official website.

# Install libGL
## CentOS
yum install -y mesa-libGL
## Ubuntu
apt install -y libgl1-mesa-glx

# Contact the iluvatar manager to get adapted install packages of vllm, triton, and ixformer
pip3 install vllm
pip3 install triton
pip3 install ixformer

Model Inference

export CUDA_VISIBLE_DEVICES=0,1
python3 offline_inference.py --model ./data/qwen/Qwen-7B-Chat --max-tokens 256 -tp 2 --trust-remote-code --temperature 0.0

DeepSpark/DeepSparkInference

Qwen-7B (vLLM)

Model Description

Supported Environments

Model Preparation

Prepare Resources

Install Dependencies

Model Inference

简介

发行版 (4)

贡献者 (7)

近期动态

DeepSpark/DeepSparkInference .gitee-modal { width: 500px !important; }

Qwen-7B (vLLM)

Model Description

Supported Environments

Model Preparation

Prepare Resources

Install Dependencies

Model Inference

简介

发行版 (4)

开源评估指数源自 OSS-Compass 评估体系，评估体系围绕以下三个维度对项目展开评估：

贡献者 (7)

近期动态

搜索帮助

DeepSpark/DeepSparkInference