LLMs之Nemotron-4：《Nemotron-4 340B Technical Report》翻译与解读|电子爱好者

admin管理员组
文章数量:1532285

LLMs之Nemotron-4：《Nemotron-4 340B Technical Report》翻译与解读

导读：
>> 背景痛点：越来越大的语言模型需要大量高质量数据进行对齐训练，但人工标注数据的成本非常高昂。现有公开数据集已经不足以训练最佳对齐的大型语言模型。
>> 解决方案：NVIDIA开发了一个合成数据生成(SDG)流水线，用于生成大量的高质量训练数据，支持监督微调和偏好微调两种对齐方法。发布了Nemotron-4 340B模型家族，包括Nemotron-4-340B-Base、Nemotron-4-340B-Instruct和Nemotron-4-340B-Reward三种模型。
>> 核心思路步骤：
使用大型指令模型生成多样化的合成提示，涵盖不同任务、话题和指令。
基于提示生成响应和对话数据，使用奖励模型进行质量过滤和偏好排序。
利用经过筛选的高质量合成数据进行监督微调和偏好微调，分别得到Instruct模型和Reward模型。
>> 优势：
开源的大规模语言模型，可促进该领域的研究进展和商业应用。
有效利用合成数据大幅降低了对齐训练的成本。
开源的合成数据生成流水线有助于社区构建定制的训练数据。
总的来说，这项工作提出了一种基于合成数据的高效对齐方法，并发布了一系列优秀的开源大模型，为语言模型的发展做出了重要贡献。

《Nemotron-4 340B Technical Report》翻译与解读

Abstract

4 Conclusion

Figure 6: Percentage of unsafe responses over all model responses in AEGIS safety evaluations. Lower is better.在AEGIS安全评估中，不安全响应占所有模型响应的百分比。越低越好。

《Nemotron-4 340B Technical Report》翻译与解读

地址	论文地址：https://arxiv/abs/2406.11704
时间	2024 年6月17日
作者	NVIDIA

Abstract

We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4- 340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation benchmarks, and were sized to fit on a single DGX H100 with 8 GPUs when deployed in FP8 precision. We believe that the community can benefit from these models in various research studies and commercial applications, especially for generating synthetic data to train smaller language models. Notably, over 98% of data used in our model alignment process is synthetically generated, showcasing the effectiveness of these models in generating synthetic data. To further support open research and facilitate model development, we are also open-sourcing the synthetic data generation pipeline used in our model alignment process.

我们发布了Nemotron-4 340B模型系列，包括Nemotron-4-340B- base, Nemotron-4-340B- instruct和Nemotron-4-340B- reward。我们的模型是根据NVIDIA开放模型许可协议开放获取的，这是一项允许分发、修改和使用模型及其输出的宽松模型许可协议。这些模型在广泛的评估基准上与开放存取模型相比具有竞争力，并且在FP8精度部署时，其尺寸适合单个DGX H100与8个GPU。我们相信社区可以在各种研究和商业应用中受益于这些模型，特别是在生成合成数据以训练较小的语言模型方面。值得注意的是，我们的模型校准过程中使用的98%以上的数据是综合生成的，这表明这些模型在生成综合数据方面是有效的。为了进一步支持开放研究和促进模型开发，我们还开放了模型校准过程中使用的合成数据生成管道的源代码。

4 Conclusion

We present a family of Nemotron-4 340B models: Nemotron-4-340B-Base, Nemotron-4-340B-Instruct and Nemotron-4-340B-Reward. They are provided under a permissive open access license, and we detail their ability across a broad range of tasks. We release the training and inference code for these models. We also provide comprehensive details about our synthetic data generation pipeline and illustrate its effectiveness. We believe these models will stimulate the further development of LLMs and AI applications.

我们提出了Nemotron-4 340B系列模型:Nemotron-4-340B- base, Nemotron-4-340B- instruct和Nemotron-4-340B- reward。它们是在宽松的开放访问许可下提供的，我们详细介绍了它们在广泛任务中的能力。我们发布了这些模型的训练和推理代码。我们还提供了有关合成数据生成管道的详细信息，并说明了其有效性。我们相信这些模型将刺激LLMs和人工智能应用的进一步发展。

Figure 6: Percentage of unsafe responses over all model responses in AEGIS safety evaluations. Lower is better.在AEGIS安全评估中，不安全响应占所有模型响应的百分比。越低越好。

本文标签： Nemotron LLMs Report Technical

版权声明：本文标题：LLMs之Nemotron-4：《Nemotron-4 340B Technical Report》翻译与解读内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/dianzi/1725767373a1041285.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

LLMs之Nemotron-4：《Nemotron-4 340B Technical Report》翻译与解读

《Nemotron-4 340B Technical Report》翻译与解读

Abstract

4 Conclusion

Figure 6: Percentage of unsafe responses over all model responses in AEGIS safety evaluations. Lower is better.在AEGIS安全评估中，不安全响应占所有模型响应的百分比。越低越好。

更多相关文章

LLMs之Vicuna：《Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality》翻译与解读

【模型精调LoRA】LoRA 低秩适应微调的工作原理和代码实现示例 What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED

LLMs：《Building LLM applications for production构建用于生产的LLM应用程序》翻译与解读

LLMs：《BLOOM: A 176B-Parameter Open-Access Multilingual Language Model》翻译与解读

LLMs之Guanaco：《QLoRA：Efficient Finetuning of Quantized LLMs》翻译与解读

LLMs之GLM-130BChatGLM-1：《GLM-130B: AN OPEN BILINGUAL PRE-TRAINED MODEL》翻译与解读

LLMs之Nemotron-4：《Nemotron-4 340B Technical Report》翻译与解读

How to Draw Useful Technical Architecture Diagrams

crash report for adobe photoshop cc 2019

LLMs之RAGLong-Context：《检索增强生成还是长上下文LLMs？一项综合研究与混合方法Retrieval Augmented Generation or Long-Context LL

Internal error. Please report to https:jb.ggidecritical-startup-errors

IDEA启动报错Internal error. Please report to http:jb.ggidecritical-startup-errors

idea启动报错：Internal error. Please report to http:jb.ggidecritical-startup-errors

LLMs：《Efficient And Effective Text Encoding For Chinese Llama And Alpaca—6月15日版本》翻译与解读

Assessment Report Regarding Data Compliance

Failed to upload report - An error has occurred. Please contact your administrator

Sonar ERROR:Failed to upload report - An error has occurred. Please contact your administrator

sonar扫描时报Failed to upload report - An error has occurred. Please contact your administrator

LLMs模型速览（GPTs、LaMDA、GLMChatGLM、PaLMFlan-PaLM、BLOOM、LLaMA、Alpaca）

LLMs之GopherChinchilla：《Training Compute-Optimal Large Language Models》的翻译与解读

发表评论

推荐文章

测试路由器的防火墙配置，wan：入站数据，出站数据，转发

thinkpad e480安装win7

计算机常年开机,电脑长时间开机的危害

搜狗拼音输入法找不到mfc140u.dll怎么办？一文教你快速修复搜狗拼音输入法mfc140u.dll缺失的五种解决方案

在线拍照翻译怎么用

热门文章

uc浏览器黑莓java下载安装_（黑莓软件）黑莓最新版UC浏览器下载安装！8.1最新版UC浏览器！...

几款浏览器兼容性测试工具

error C4668: 没有将“_WIN32_WINNT_WIN10_TH2”定义为预处理器宏，用“0”替换“#if#elif”

AMD兼容好的matlab,是不是AMD的CPU不适合MATLAB？

AMD64和i386的区别

电脑开机运行内存占用过高的解决办法

试水Windows10内置Linux子系统

ubuntu20.04搜狗输入法安装后未显示

Macbook Pro M1(macOS 12.0)读取NTFS移动硬盘方法

Java开发常见面试题详解（LockSupport，AQS，Spring循环依赖，Redis）

最新文章

计算机初级证书 英语怎么说,常见职业资格证书英文翻译（含英语、计算机等）...

统考英语和计算机作弊,统考英语-大学英语b网络统考作弊?

高效翻译工具GPT插件的使用教程

【如何训练一个中英翻译模型】LSTM机器翻译seq2seq字符编码（一）

学术英语理工（第二版）Unit2课文翻译

常见英语人名及其音标和中文翻译

利用Python爬取翻译网站的翻译功能

学术英语理工（第二版）Unit4课文翻译

python爬虫入门——13行代码制作英语翻译器教程，小白入门一点通

全新版大学英语综合教程第四册学习笔记（原文及全文翻译）——8A - In the Jungle（在丛林中）

学术英语理工（第二版）Unit3课文翻译

2016-2020英语四级翻译汇总

学术英语理工（第二版）Unit6课文翻译

这学期她选修了英语 计算机 驾驶三门课程,大一英语翻译答案

学术英语社科Unit8原文翻译

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

计算机初级证书英语怎么说,常见职业资格证书英文翻译（含英语、计算机等）...

这学期她选修了英语计算机驾驶三门课程,大一英语翻译答案

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载