Fuyu-8B (vLLM)

Model Description

Fuyu-8B is a multi-modal text and image transformer trained by Adept AI.

Architecturally, Fuyu is a vanilla decoder-only transformer - there is no image encoder. Image patches are instead linearly projected into the first layer of the transformer, bypassing the embedding lookup. We simply treat the transformer decoder like an image transformer (albeit with no pooling and causal attention).

Supported Environments

GPU	IXUCA SDK	Release
MR-V100	4.2.0	25.03

Model Preparation

Prepare Resources

Model: https://huggingface.co/adept/fuyu-8b

cp -r ../../vllm_public_assets/ ./

# Download model from the website and make sure the model's path is "data/fuyu-8b"
mkdir data/

Install Dependencies

In order to run the model smoothly, you need to get the sdk from resource center of Iluvatar CoreX official website.

# Install libGL
## CentOS
yum install -y mesa-libGL
## Ubuntu
apt install -y libgl1-mesa-glx

Model Inference

export VLLM_ASSETS_CACHE=../vllm/
python3 offline_inference_vision_language.py --model ./data/fuyu-8b --max-tokens 256 -tp 2 --trust-remote-code --temperature 0.0

DeepSpark/DeepSparkInference

Fuyu-8B (vLLM)

Model Description

Supported Environments

Model Preparation

Prepare Resources

Install Dependencies

Model Inference

简介

发行版 (6)

贡献者

语言

近期动态

DeepSpark/DeepSparkInference .gitee-modal { width: 500px !important; }

Fuyu-8B (vLLM)

Model Description

Supported Environments

Model Preparation

Prepare Resources

Install Dependencies

Model Inference

简介

发行版 (6)

开源评估指数源自 OSS-Compass 评估体系，评估体系围绕以下三个维度对项目展开评估：

贡献者

语言

近期动态

搜索帮助

DeepSpark/DeepSparkInference