百度UIE：Unified Structure Generation for Universal Information Extraction paper详细解读和相关资料|电子爱好者

admin管理员组
文章数量:1531793

Prompt
learning系列之信息抽取模型UIE：https://mp.weixin.qq/s/0lNUlUF_x95mED5B9iBpGg
作者解读：https://www.bilibili/video/BV19g411Z7rZ/?spm_id_from=autoNext
bilibili解读：https://www.bilibili/video/BV1LW4y1U7ch?spm_id_from=333.337.search-card.all.click
官方代码：https://github/universal-ie/UIE
代码：https://github/heiheiyoyo/uie_pytorch paddle
paddle使用介绍：https://github/PaddlePaddle/PaddleNLP/blob/develop/docs/model_zoo/taskflow.md#%E4%BF%A1%E6%81%AF%E6%8A%BD%E5%8F%96
其他NER模型：https://github/z814081807/DeepNER

一、概述

二、相关问题

问题一：UIE三种语义单元到底是什么意思？

问题二、UIE中三种语义单元和prompt的关系？

问题三、loss函数是啥？

生成模型，交叉熵

问题四、预训练如何做？

Dpair: text-to-structure变换能力，Drecord: 解码能力，Dtext：语义encoding能力

问题四、实验效果

4.2监督学习：没有预训练效果都不错，加上预训练效果更好了。

4.3 few-shot和low-resource效果：证明UIE强大的通用的信息抽取的能力

4.4 消融实验
不同的预训练任务的作用

曝光偏差优化带来的提升（10-shot）

问题5：structural Schema Instructor如何设置？
不应该是一个模板抽取一个关系吗？咋感觉好像打平全都放进去了

问题六、finetune如何做？

三、原文详细内容

Abstract

信息抽取对不同的抽取目标，有不同的schema
unified text-to-structure generation的方法贡献统一了信息抽取的架构
可同时学习不同源的知识
实现：
prompt，structural schema instructor
大规模的text-to-structure的预训练模型，来学习通用的IE抽取能力实验
成果：
4个IEtask，12种datasets监督学习，low-resource，few-shot数据实体，关系，事件，情感抽取都取得了state-of-the-art的performance

1 introduction

1.1 通用方法缺点：

varying targets：（entity，relation，event，sentiment，etc）
heterogeneous structrues：（spans，triplets，records，etc）
demand-specific schemas现在大多数模型都是task-specialized，不利于学习交叉领域的IE能力
构建specialized任务对于不同的IE task，非常耗时间

1.2 IE是什么

IE：text-to-structure transformations
entity：span structrue
event：schema-defined recordatomic eperations

1.3 如果转化成通用的模型：

spotting：想要抽取的实体词的类型desirable spans，例如人，情感实体等
associating：schemas中的关系类型，例如work forentity extraction：
spotting mention spans of entity typesevent detection：
spotting triggers spans with event typesspotting abilities can be shared between these two tasks
UIE extraction language (SEL) ：将不同的抽取任务统一成同一种生成的方式来做。
structural schema instructor (SSI)：schema-based prompt mechanism：控制抽取什么实体，什么关系，以及生成什么（what to spot，what to associate，what to generate）

1.4 如何提升通用抽取能力

如何学习通用的抽取能力：在大量的，各式各样的数据集上进行预训练->通用抽取能力更好的适应supervised，
效果：
low-resource，few-shot的任务supervised：提升1.42%，
few-shot或者low-resource setting：带来了巨大的提升。

1.5 contributions：

UIE：同义抽取框架适应不同IE任务，可以联合学习通用的抽取能力设置了unified structure generation network：
通过structural extraction language控制what to spot，which to associate and which to generatea
large-scale text-to-structure pre-trained extraction model

2 UIE Unified Structure Generation for Universal Information Extraction

指导期：structural schema instructor (SSI)：schema-based prompt机制
结构化抽取语言：extraction language (SEL)： to uniformly encode heterogeneous extraction structures

2.1 atomic operationsspotting（目标信息片段）：

实体，事件触发词：Spotting indicates locating target information pieces from the sentence, e.g., the entity and the trigger word in the event.
associating：relation的目标实体，或者事件中的role和argumentAssociating indicates connecting different information pieces based on the desirable associations, e.g., the relation between entity pair or the role between event and its argument（论点）.
优点

统一了IE的encodes方式
有效表达了抽取的结果，自然可以用于联合抽取
降低了解码的复杂度
example实体抽取：(SpotName: InfoSpan)关系抽取&事件抽取：(SpotName: In- foSpan (AssoName: InfoSpan), …)

2.2.1 SSIstructural schema instructor (SSI)：s

chema-based prompt机制y = UIE(s + x)s = [s1, …, s|s|] is the structural schema instructor, and y = [y1, …, y|y|] is a SEL sequence that can be easily converted into the extracted information record
example: [spot] person [spot] com- pany [asso] work for [text]作用有效的指导UIE中SEL的生成可以控制which to spot，which to associate，which to generate

2.2.2 Structure Generation with UIE

(s+x) => linearized SELauto-regressive style.
结束位置：eos
yi , hdi = Decoder([H; hd1 , …, hdi−1 ])
可以用BART或者T5等模型

3 pre-training and fine-tuning for UIE

how to pre-train a large-scale UIE model which captures common IE abilities for different IE tasks;
how to adapt UIE to different IE tasks in different settings via quick fine-tuning.

如何预训练获得通用的抽取能力
如何进行finetune先大量预料预训练 -> 然后特殊下游人物finetune

3.1 pre-training corpus construction

Dpair = {token sequence x, structured record y}我们通过将 Wikidata 与英语 Wikipedia 对齐来收集大规模的并行文本结构对。 Dpair 用于预训练 UIE 的文本到结构的转换能力。
Drecord is the structure dataset where each in- stance is structured record y. We collect structured records from ConceptNet (Speer et al., 2017) and Wikidata. Drecord is used to pre-train the structure decoding ability of UIE.
Drecord 是结构数据集，其中每个实例都是结构化记录 y。我们从 ConceptNet (Speer et al., 2017) 和 Wikidata 收集结构化记录。 Drecord用于预训练UIE的结构解码能力。
Dtext is the unstructured text dataset, and we use all plain texts in English Wikipedia. Dtext is used to pre-train the semantic encoding ability of UIE.
Dtext 是非结构化文本数据集，我们使用英文维基百科中的所有纯文本。 Dtext用于预训练UIE的语义编码能力。Dpair: text-to-structure变换能力，
Drecord: 解码能力，Dtext：语义encoding能力3.2 pre-training

Text-to-Structure Pre-training using DpairFor example, person and work for is the positive schema in the record “((person: Steve (work for: Apple)))”, and we sample vehicle and located in as the negative schema to construct meta- schema.这个是在干啥？让他具有啥能力？
Structure Generation Pre-training with Drecord（解码能力）.
MLM + span corruption （这个提升比较大）=> 减轻spotname和assoname的灾难性遗忘： catastrophic forgetting of token semantics especially on SPOTNAME and ASSONAME tokens.L = LPair + LRecord + LText

3.3 On-Demand Fine-tuningDtask = {(s,x,y)} -> 交叉熵Rejection Mechanism => 减轻曝光偏差问题exposure biasRM：注入噪音通过RM，UIE能学会拒绝错误生成的NULL的结果

4 Experiments

4.1 dataset

13 IE benchmarks（ACE，CoNLL）， 4 well-representative IE tasks
entity extraction, relation extraction, event extraction, structured sentiment extraction
UIE only generates text spans -> finding the first matched offsets -> offsets

4.2 supervised settings

SEL+不加预训练：基本都state-of-the-art了
UIE（带预训练）：效果都state-of-the-art了improves 1.42% F1 on average

4.3 Low-resource settingslow-resource：1/5/10-shot, 1/5/10% ratiofew-shot: sample 1/5/10 sentences ofr each entity/relation/event/sentiment type

本文标签：相关资料详细 Structure Generation UIE

版权声明：本文标题：百度UIE：Unified Structure Generation for Universal Information Extraction paper详细解读和相关资料内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/dianzi/1725839546a1044865.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

百度UIE：Unified Structure Generation for Universal Information Extraction paper详细解读和相关资料

一、概述

二、相关问题

问题一：UIE三种语义单元到底是什么意思？

问题二、UIE中三种语义单元和prompt的关系？

问题三、loss函数是啥？

问题四、预训练如何做？

问题四、实验效果

三、原文详细内容

Abstract

1 introduction

1.1 通用方法缺点：

1.2 IE是什么

1.3 如果转化成通用的模型：

1.4 如何提升通用抽取能力

1.5 contributions：

2 UIE Unified Structure Generation for Universal Information Extraction

2.1 atomic operationsspotting（目标信息片段）：

2.2.1 SSIstructural schema instructor (SSI)：s

2.2.2 Structure Generation with UIE

3 pre-training and fine-tuning for UIE

3.1 pre-training corpus construction

3.3 On-Demand Fine-tuningDtask = {(s,x,y)} -> 交叉熵Rejection Mechanism => 减轻曝光偏差问题exposure biasRM：注入噪音通过RM，UIE能学会拒绝错误生成的NULL的结果

4 Experiments

4.1 dataset

4.2 supervised settings

4.3 Low-resource settingslow-resource：1/5/10-shot, 1/5/10% ratiofew-shot: sample 1/5/10 sentences ofr each entity/relation/event/sentiment type

更多相关文章

网络安全--通过握手包找回WiFi密码(详细教程)

最详细的k8s中部署Jenkins教程+基于jenkins+k8s(container)实现CICD

Zookeeper+Hadoop+Spark+Flink+Kafka+Hbase+Hive 完全分布式高可用集群搭建(保姆级超详细含图文)

Docker最新超详细版教程通俗易懂(基础版)

亚马逊开店详细教程（4）- 亚马逊品牌备案

cil2安装详细教程

2024史上最详细的Camtasia2023破解Crack下载，安装和图文激活教程

操作系统精讲(0) | 操作系统详细简介

【PyCharm安装】2024全面详细教程。{附激活码}

Intellij IDEA快捷键大全（超详细）

【利用ChatGPT学习英语口语(包括如何安装插件的详细教程)】

超详细 Centos7下Prometheus Alertmanager配置钉钉告警与邮箱告警（已亲手验证）

TwinCAT3中授权码激活操作的详细步骤和注意事项

超详细整理Detectron2目标检测源码在Win10下的环境配置

科学计算机怎么还原,怎么恢复win7系统计算器快捷键的详细

2020最详细安装Ubuntu系统指南_惠普电脑ubuntu系统安装教程

IDEA2020.3的详细安装教程

ESXI挂载移动（机械）硬盘详细教程

文件或目录损坏,详细教您文件或目录损坏且无法读取怎么办

VMware虚拟机安装windows server 2012 R2教程（图文版 超详细！）

发表评论

推荐文章

Python+Selenium程序执行完，chrome浏览器自动关闭解决方案

维盟虚拟服务器,「包教会」WayOS路由PPPoE设置教程！

与其说技巧不如叫它血泪史 路由器你真的会挑

计算机开机慢的原因及解决方法,电脑开机速度慢怎么解决？Win10电脑开机速度变慢的原因及解决方法(2)...

移动硬盘无法读取？学会这3个方法，快速解决问题！

热门文章

golang pederson commitment 实现，基于椭圆曲线

QQ邮箱初始容量不够用扩容方法

如何在Win10系统下统计某目录下所有文件的数量

win10安装SQL2008出现重启系统问题

在Linux上使用Selenium驱动Chrome浏览器无头模式

公司里开发用的机器，虚拟机、网络、转发、ssh连接、远程桌面、远程开机……等一系列骚操作的操作概述

springboot网上购物平台的设计与实现-附源码141422

微星z370安装linux系统,微星主板Z370、Z270、H110、H170设置U盘启动,U盘启动快捷键...

用WiFi万能钥匙的，请不要跟别人说你真正懂得WiFi密码的破解

Mac直接拔掉移动硬盘无法识别或识别要很久的解决方法

最新文章

微信QQ中已停止访问该网页的处理办法

Win10升级后执行系统封装(Sysprep)报错

win7与win10系统哪个好？你听过中兴新支点操作系统么

win7下vs2008过期升级没有提示序列号

XPWin7Ubuntu多系统下修改C盘UUID造成Ubuntu启动错误的解决办法

前端html第三方登录集合，微信，微博，企鹅

微信公众号之用户登录

在64位Win7操作系统中安装Microsoft Access Engine的解决方案(转)

计算机无法启动打印服务,win7打印服务无法启动怎么办？打印服务无法启动修复...

nvidia控制面板点了没反应win7_控制面板无响应怎么办_为什么nvidia控制面板打不开图文步骤...

公众号开发(2) —— 盛派.net SDK + vue搭建微信公众号网页开发框架

科普：黑客盗QQ究竟是怎么回事？

VMware虚拟机安装windows server 2012 R2教程（图文版超详细！）

与其说技巧不如叫它血泪史路由器你真的会挑

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载