Bootstrap Your Own Latent A New Approach to Self-Supervised Learning|电子爱好者

admin管理员组
文章数量:1652189

1. framwork

1) two distribution of augmentation

5) q uses the same architecture as g(FC+BN+RELU+FC)

6) with k the current training step and K the maximum number of training steps.(, the exponential moving average parameter τ starts from τbase = 0.996 and is increased to one during training.)

2 Intuitions on BYOL’s behavior

a collapsed constant representation

1)BYOL’s target parameters ξ updates are not in the direction of ∇ξL BYOL θ,ξ ,There is therefore no a priori reason why BYOL’s parameters would converge to a minimum of L BYOL θ,ξ .

2)assuming BYOL’s predictor to be optimal

====>

hence our hypothesis on these collapsed constant equilibria being unstable.

3.Building intuitions with ablations

batch size

only drops for smaller values due to batch normalization layers in the encoder

image augmentation

bootstrapping

ablation to contrastive methods

. To evaluate the influence of the target network, the predictor and the coefficient β, we perform an ablation over them

2) target network

using a target network is beneficial but it has two distinct effects we would like to understand from which effect the improvement comes from.(stopping the gradient through the prediction targets and stabilizing the targets with averaging)

conclusion:making the prediction targets stable and stale is the main cause of the improvement rather than the change in the objective due to the stop gradient.

3）predictor

In this setup, we remove the exponential moving average (i.e., set τ = 0 over the full training in Eq. 1), and multiply the learning rate of the predictor by a constant λ compared to the learning rate used for the rest of the network; all other hyperparameters are unchanged. As shown in Table 21, using sufficiently large values of λ provides a reasonably good level of performance and the performance sharply decreases with λ to 0.01% top-1 accuracy (no better than random) for λ = 0.

To show that this effect is directly related to a change of behavior in the predictor, and not only to a change of learning rate in any subpart of the network, we perform a similar experiment by using a multiplier λ on the predictor’s learning rate, and a different multiplier µ for the projector.

conclusion : one of the contributions of the target network is to maintain a near optimal predictor at all times

Optimal linear predictor in closed form

At 300 epochs, when using the closed form optimal predictor, and directly hard copying the weights of the online network into the target, we obtain a top-1 accuracy of fill.

Network hyperparameters

removing the weight decay in either BYOL or SimCLR leads to network divergence

本文标签： Latent bootstrap Approach Learning supervised

版权声明：本文标题：Bootstrap Your Own Latent A New Approach to Self-Supervised Learning 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/dongtai/1729579187a1207398.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

Bootstrap Your Own Latent A New Approach to Self-Supervised Learning

更多相关文章

DenseFuse: A Fusion Approach to Infrared and Visible Images 阅读笔记

DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs 阅读笔记

【预训练语言模型】RoBERTa: A Robustly Optimized BERT Pretraining Approach

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Bootstrap your own latent ：A new approach to self-supervised Learning（BYOL）（论文解读）

【论文阅读】Attributed Graph Clustering: A Deep Attentional Embedding Approach

算法设计技巧与分析（五）：贪心算法(The Greedy Approach)

论文阅读”A deep variational approach to clustering survival data“(ICLR2022)

文献阅读笔记【12】：A novel hybrid approach for crack detection【一种新型的混合裂缝检测方法】

IMAGE CODING FOR MACHINES: AN END-TO-END LEARNED APPROACH 2021

An Integrated Neighborhood Dependent Approach for Nonlinear Enhancement of Color Images

A Minimalist Approach to Offline Reinforcement Learning[TD3+BC]阅读笔记

Revisiting Dilated Convolution:A Simple Approach for WeaklyAnd SemiSupervised Semantic Segmentation

（二）PositionRank: An Unsupervised Approach to Keyphrase Extractionfrom Scholarly Documents解读一

（四十五）混合法（hybrid approach）计算VaR

VizML: A Machine Learning Approach to Visualization Recommendation

approach to doing还是to do

论文笔记：Bootstrap Your Own Latent A New Approach to Self-Supervised Learning

【论文笔记】An Improved Deep Learning Approach for Retrieving Outfalls Into Rivers From UAS Imagery

发表评论

推荐文章

计算机电脑照片大小,如何在电脑上修改打印图片的大小

【若依前后端分离】前端vue页面查看服务器本地的PDF

没什么！只想做您的人生第一台双目深度相机！

FeatureLayer要素图层的三种不同的查询显示查询模式

仿酷狗音乐播放器开发日志十六——各个右键菜单的实现

热门文章

Apple: your account cannot be create at this time.

Grammarly Premium Account LIFETIME (Warranty)

一起了解Windows——win10开启服务器管理器的图文操作步骤

报错“SyntaxError: future feature annotations is not defined”

计算机桌面图标被挡怎么办,电脑桌面图标偏左,被挡住了大半,就连网页窗口也...

戴尔怎样把计算机放在桌面,戴尔台式机桌面图标不见了怎么办

酷狗mv php解析api接口,酷狗音乐API接口大全（40 个）

如何使用DiskGenius恢复删除的文件？

使用DiskGenius来读写VMware的.vmdk文件?

用DiskGenius找回丢失分区

最新文章

完全免费又超级好用的万能视频播放器PotPlayer安装教程分享

CDR2024破解完整版下载安装永久激活最新

windows系统激活时间查询

中文linux 老旧电脑,安装Bodhi Linux让老旧电脑重新焕发活力

网络安全初学者工具安装：Kali，Windows xp虚拟机，pikachu靶场，burpsuite安装配置，phpstudy安装（学习笔记）

【Windows 11】 24H2 在线更新、全新安装

Autodesk 3DS Max v2025 激活版下载及安装教程

win7虚拟机黑苹果_苹果Mac虚拟机安装Win7系统的方法【图文教程】

MathType7永久免费无需激活版下载，数学神器轻松get！

QT历届版本下载总汇

重复照片清理软件分享，看看这5个重复文件删除工具（新）

[Hyper-v]删除系统保留分区，修复克隆win7win8虚拟磁盘后无法引导问题

Windows server 2022datacenter版本的j激活过程

mathtype2024最新破解永久激活码密钥序列号+下载安装教程

【C++软件调试技术】使用 Windbg 分析软件异常时的诸多细节与技巧总结

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

如何通过treenode实现二叉树

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载