【吴恩达深度学习】05_week2_quiz Natural Language Processing & Word Embeddings|电子爱好者

admin管理员组
文章数量:1530068

(1)Suppose you learn a word embedding for a vocabulary of 10000 words. Then the embedding vectors should be 10000 dimensional, so as to capture the full range of variation and meaning in those words.
[A]True
[B]False
答案：B
解析：注意和one-hot的区别。

(2)What is t-SNE?
[A]A linear transformation that allows us to solve analogies on word vectors.
[B]A non-linear dimensionality reduction technique.
[C]A supervised learning algorithm for learning word embeddings.
[D]An open-source sequence modeling library.
答案：B
解析：t-SNE是一种非线性的降维算法。

(3)Suppose you download a pre-trained word embedding which has been trained on a huge corpus of text. You then use this word embedding to train an RNN for a language task of recognizing if someone is happy from a short snippet of text, using a small training set.

x(input text)	y(happy?)
I’m feeling wonderful today!	1
I’m bummed my cat is ill	0
Really enjoying this!	1

Then even if the word “ecstatic” does not appear in your small training set, your RNN might reasonably be expected to recognize “I’m ecstatic” as deserving a label y=1.
[A]True
[B]False
答案：A
解析：正向积极的词会有相似的特征向量。

(4)Which of these equations do you think should hold for a good word embedding?(Check all that apply)
[A] e b o y − e g i r l ≈ e b r o t h e r − e s i s t e r e_{boy}-e_{girl} \approx e_{brother}-e_{sister} eboy−egirl≈ebrother−esister
[B] e b o y − e g i r l ≈ e s i s t e r − e b r o t h e r e_{boy}-e_{girl} \approx e_{sister}-e_{brother} eboy−egirl≈esister−ebrother
[C] e b o y − e b r o t h e r ≈ e g i r l − e s i s t e r e_{boy}-e_{brother} \approx e_{girl}-e_{sister} eboy−ebrother≈egirl−esister
[D] e b o y − e b r o t h e r ≈ e s i s t e r − e g i r l e_{boy}-e_{brother} \approx e_{sister}-e_{girl} eboy−ebrother≈esister−egirl
答案：A,C

(5)Let E E E be an embedding matrix, and let o 1234 o_{1234} o1234 be a one-hot vector, corresponding to word 1234. Then to get the embedding of word 1234, why don’t we call E T ∗ o 1234 E^T*o_{1234} ET∗o1234 in Python?
[A]it is computationally wasteful.
[B]The correct formula is E T ∗ e 1234 E^T*e_{1234} ET∗e1234
[C]This doesn’t handle unknown words (<UNK>)
[D]None of the above: Calling the Python snippet as described above is fine.
答案：A
解析：one-hot向量维度高，并且大多数为0，所以 E E E和 o 1234 o_{1234} o1234 进行相乘效率很低。

(6)When learning word embeddings, we create an artificial task of estimating P ( t a r g e t ∣ c o n t e x t ) P(target|context) P(target∣context). It is okay if we do poorly on this artificial prediction task; the more important by-product of this task is that we learn a useful set of word embeddings.
[A]True
[B]False
答案：B
解析：错在artificial人工。

(7)In the word2vec algorithm, you estimate P ( t ∣ c ) P(t|c) P(t∣c), where t t t is the target word and c c c is a context word, How are t t t and c c c chosen from the training set? Pick the best answer.
[A] c c c is the one word that comes immediately before t t t.
[B] c c c is the sequence of all the words in the sentence before t t t
[C] c c c is a sequence of several words immediately before t t t.
[D] c c c and t t t are chosen to be nearby words.
答案：D

(8)Suppose you have a 10000 word vocabulary, and are learning 500-dimensional word embeddings. The word2vec mode: uses the following softmax function:
P ( t ∣ c ) = e θ t T e c ∑ t ′ = 1 10000 e θ t ′ T e c P\left( t|c \right) =\frac{e^{\theta _t^Te_c}}{\sum_{t'=1}^{10000}{e^{\theta _{t'}^{T}e_c}}} P(t∣c)=∑t′=110000eθt′TeceθtTec
Which of these statements are correct? Check all that apply.
[A] θ t \theta_t θt and e c e_c ec are both 500 dimensional vectors.
[B] θ t \theta_t θt and e c e_c ec are both 10000 dimensional vectors.
[C] θ t \theta_t θt and e c e_c ec are both trained with an optimization algorithm such as Adam or gradient descent.
[D]After training, we should expect θ t \theta_t θt to be very close to e c e_c ec when t t t and c c c are the same word.
答案：A，C
解析：由题意embedding的大小为500维度，所以 θ t \theta_t θt 和 e c e_c ec的维度都为500。
D选项有点争议，具体见
Why does word2vec use 2 representations for each word?
Word2Vec哪个矩阵是词向量？
word2Vec的CBOW，SKIP-gram为什么有2组词向量？
本人认为 θ \theta θ向量和 e e e向量均可作为词向量，只是表达的方式和所表达的特征有所不同，所以数值上也会不同。
表达方式不同可以理解为半径为1的圆和面积为 π \pi π的圆，他们表达方式不同但都表示同一个圆。也可以理解为处于不同基底的向量空间。
表达的特征不同可以理解为对于同一个词不同向量提取到的特征不同。就比如“juice”这个词，一个提取到的特征这是一种液体，另一个提取到的特征这是由水果制成的。
如有错误，请大佬指出。

(9)Suppose you have a 10000 word vocabulary, and are learning 500-dimensional word embeddings. The GloVe model minimizes this objective:
min ⁡ ∑ i = 1 10000 ∑ j = 1 10000 f ( X i j ) ( θ i T e j + b i + b j ′ − log ⁡ X i j ) 2 \min \sum_{i=1}^{10000}{\sum_{j=1}^{10000}{f\left( X_{ij} \right) \left( \theta _i^Te_j+b_i+b_j'-\log X_{ij} \right) ^2}} mini=1∑10000j=1∑10000f(Xij)(θiTej+bi+bj′−logXij)2
Which of these statements are correct? Check all that apply.
[A] θ i \theta_i θi and e j e_j ej should be initialized to 0 at the beginning of training.
[B] θ i \theta_i θi and e j e_j ej should be initialized randomly at the beginning of training.
[C] X i j X_{ij} Xij is the number of times word i appears in the context of word j.
[D]The weighting function f ( . ) f(.) f(.) must satisfy f ( 0 ) = 0 f(0)=0 f(0)=0
答案：B，C，D

(10)You have trained word embeddings using a text dataset of m1 words. You are considering using these word embeddings for a language task, for which you have separate labeled dataset of m2 words. keeping in mind that using word embeddings of a form of transfer learning, under which of these circumstance would you expect the word embeddings to be helpful?
[A] m 1 > > m 2 m1>>m2 m1>>m2
[B] m 1 < < m 2 m1<<m2 m1<<m2
答案：A

本文标签：深度 week2quiz Natural 吴恩达 word

版权声明：本文标题：【吴恩达深度学习】05_week2_quiz Natural Language Processing & Word Embeddings 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/dongtai/1726617243a1078325.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

【吴恩达深度学习】05_week2_quiz Natural Language Processing &amp; Word Embeddings

更多相关文章

iso 绝对pe_深度 WinPE 4.2 维护光盘ISO（含U盘PE制作工具） 下载地址

深度完美 GHOST XP SP3 快速装机优化版 V2010.06推荐给大家

安装Mathtype之后，打开word出现错误的解决方法

学python用ubuntu还是win10_win10+Ubuntu16.04双系统下深度学习环境的搭建

机器学习及深度学习相关资料汇总

Win11 RTX 4090显卡深度学习环境配置（Nvidia显卡驱动、CUDA11.7.0）

deepin efi 启动u盘_深度系统（Deepin Linux）U盘安装教程！！附学习资料，干货

Go语言实战：《Effective Go》深度指南

深度学习环境配置：华硕主板的Win 10 + UEFI + GPT条件下且在SSD + HDD双硬盘下安装Ubuntu+Gtx 1080Ti显卡驱动 + CUDA 9.1 + Cudnn 7.1...

致《上网记录深度擦除工具》用户的说明

基于YOLOv8YOLOv7YOLOv6YOLOv5的农作物害虫检测系统（深度学习模型+UI界面+训练数据集）

Deepfake检测模型-PyDeepFakeDet项目复现实验报告-深度学习

XP系统忘记密码？第一篇-U深度PE系统

【吴恩达深度学习】05_week2_quiz Natural Language Processing &amp; Word Embeddings

深度测试oppo软件,OPPO深度测试app

Note-Adversarial Examples Are a Natural Consequence of Test Error in Noise

【踩坑记录】搭建 RTX3090 深度学习服务器 (从系统重装到跑起DL)

matlab amd gpu,没有N卡？那就用AMD GPU训练深度学习模型

AMD显卡安装Caffe|深度学习|Ubuntu

2020年汽车芯片行业深度报告-1

发表评论

推荐文章

numpy.ndarray size changed, may indicate binary incompatibility. Expected 96 from C header, got 80 f

破解锐捷--实现路由器认证锐捷，轻松开WiFi

解决谷歌chrome浏览器双击没反应，不能启动（亲测好用）

amd cpu 安卓模拟器_AMD的CPU如何运行安卓模拟器？

AMD CPU安装Intel HAXM

热门文章

实时调度算法和可调度性测试为可持续性(sustainability)和自可持续性(self-sustainability)基本概念以及作用详解

五种方法帮你解决Win10找不到wlan选项

如何恢复电脑硬盘删除数据？提供一套实用恢复方案

对计算机硬盘进行格式化操作,如何在计算机上格式化硬盘，硬盘分区的格式化过程...

Chrome浏览器手动添加Cookie的方法

计算机黑屏无法启动,电脑黑屏无法启动

Oracle安装的一些问题收集[转]

amd r7 200 linux驱动,amd radeon r7 200 series驅動程式更新

天选三超频实战适用于部分AMD笔记本的全面超频指南：CPU频率，功耗墙，Infinity Fabric总线（FCLK），内存时序，CPU电压，核显频率，核显电压......

webvtt字幕转srt字幕方法

最新文章

IPC，一直被人们忽视的“东西”。

2008R2 WDS (五）--实现win7客户端无人值守安装

微信小程序复习1~6章

IntelliJ IDEA 16 本地LicenseServer激活（破解）

Intellij IDEA安装过程

Keil MDK最新版 5.25介绍及下载地址

Java_00000_00002_3_（1、2、3_共3篇）_开发环境

SpringBoot基础知识

Java 初学笔记（2024.9.3始 更新中）

ps2模拟器bios3dm_龙漫ps2模拟器最新版 - 模拟器综合讨论区 -3DMGAME论坛 -Powered by Discuz!...

（附源码）ssm网上零食销售系统 毕业设计 180826

linux下访问windows的共享

《跟着装Windows 7——多样化系统安装（常规安装）》 （原创）

(附源码）SSM网上商城的开发 毕业设计-89386

Django青海特色农产品直播带货平台 毕业设计源码40994

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

【生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD】生化危机2：重制版（Resident Evil 2 Remake）克莱尔红头发深色服装MOD 官方免费下载

【模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311】模拟人生4（The Sims 4）性感露背深V领吊带裙MOD V20190311 官方免费下载

【模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311】模拟人生4（The Sims 4）科幻风宇宙飞船家庭住宅MOD V20190311 官方免费下载

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改 官方免费下载

如何实现高效的treenode搜索算法

treenode与链表有何本质区别

在哪些场景下应优先考虑使用treenode

treenode在树形结构中的角色是什么

如何通过treenode实现二叉树

【吴恩达深度学习】05_week2_quiz Natural Language Processing & Word Embeddings

iso 绝对pe_深度 WinPE 4.2 维护光盘ISO（含U盘PE制作工具）下载地址

【吴恩达深度学习】05_week2_quiz Natural Language Processing & Word Embeddings

Java 初学笔记（2024.9.3始更新中）

（附源码）ssm网上零食销售系统毕业设计 180826

《跟着装Windows 7——多样化系统安装（常规安装）》（原创）

(附源码）SSM网上商城的开发毕业设计-89386

Django青海特色农产品直播带货平台毕业设计源码40994

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载