CodeFuse的使命是开发专门设计用于支持整个软件开发生命周期的大型代码语言模型(Code LLMs),涵盖设计、需求、编码、测试、部署、运维等关键阶段。我们致力于打造创新的解决方案,让软件开发者们在研发的过程中如丝般顺滑。
Introduction

输入图片说明

Hello World! This is CodeFuse!

The mission of CodeFuse is to develop Code Large Language Models (Code LLMs) specifically designed to support the entire software development lifecycle, covering crucial stages such as design, requirements, coding, testing, deployment, operations, and maintenance. We are passionate about creating innovative solutions that empower developers throughout the software development process.

In this release, we are open sourcing

  1. The MFT (Multi-Task Fine-Tuning) framework, known as MFTCoder;
  2. Two datasets for enhancing the coding capabilities of LLMs, that is, Code Exercise and Evol-Instruction;
  3. A faster and more reliable deployment framework based on FasterTransformer.

The resulting model ensemble, which includes CodeFuse-13B (ModelScope Repo)and CodeFuse-CodeLlama-34B(ModelScope Repo), supports various code-related tasks such as code completion, text-to-code conversion, and unit test generation. In particular, CodeFuse-CodeLlama-34B, built upon CodeLlama as the base model and fine-tuned using the proposed MFT framework, achieves an impressive score of 74.4% (greedy decoding) in the HumanEval Python pass@1 evaluation, even surpassing the performance of GPT-4 (67%). We have plans to incorporate additional base LLMs into the ensemble in the near future.

We believe that our solution can significantly enhance the performance of pretrained LLMs across multiple related tasks simultaneously. We are committed to further exploring this direction and providing more open-source contributions. We also encourage engineers and researchers within this community to join us in co-constructing CodeFuse.

Achievement
179
Star
53
Fork
People(2)
王兴龙-hacke2
王兴龙
闪烁-ss41979310
闪烁

Search