R语言参数自抽样法Bootstrap：估计MSE、经验功效、杰克刀Jackknife、非参数自抽样法可视化|电子爱好者

admin管理员组
文章数量:1660165

最近我们被客户要求撰写关于抽样法的研究报告，包括一些图形和统计输出。

参数引导：估计 MSE

统计学问题：级别(k\)修剪后的平均值的MSE是多少？

我们如何回答它：估计从标准柯西分布（t 分布 w/df = 1）生成的大小为 20 的随机样本的水平 \(k\) 修剪均值的 MSE。目标参数 \(\theta\) 是中心或中位数。柯西分布不存在均值。在表中总结 MSE 的估计值 \(k = 1, 2, ... 9\)。

参数自抽样法：经验功效计算

统计问题：随着零假设与现实之间的差异发生变化，功效如何变化？

我们如何回答：绘制 t 检验的经验功效曲线。

t 检验的原假设是。另一种选择是。

您将从具有的正态分布总体中抽取大小为 20 的样本。您将使用 0.05 的显着性水平。

显示当总体的实际平均值从 350 变为 650（增量为 10）时，功效如何变化。

y 轴是经验功效（通过 bootstrap 估计），x 轴是 \(\mu\) 的不同值（350、360、370 … 650）。



    x <- rnorm(n, mean = muA, sd = sigma) #抽取平均值=450的样本
    ts <- t.test(x, mu = mu0) #对无效的mu=500进行t检验
    ts$p.value

参数自抽样法：经验功效计算

统计问题：样本量如何影响功效？

我们如何回答：创建更多的功效曲线，因为实际均值在 350 到 650 之间变化，但使用大小为 n = 10、n = 20、n = 30、n = 40 和 n = 50 的样本生成它们。同一图上的所有 5 条功效曲线。


pvals <- replicate(m, pvalue())
power <- mean(pvals <= 0.05)


points(sequence,final2[2,],col="red",pch=1)

points(sequence,final2[3,],col="blue",pch=2)

参数自抽样法：经验置信水平

统计问题：在制作 95% CI 时，如果我们的样本很小并且不是来自正态分布，我们是否仍有 95% 的置信度？

我们如何回答它：根据样本为总体的平均值创建一堆置信区间 (95%)。

您的样本大小应为 16，取自具有 2 个自由度的卡方分布。

找出未能捕捉总体真实均值的置信区间的比例。（提醒：自由度为 \(k\) 的卡方分布的平均值为 \(k\)。）


for(i in 1:m){
  samp=rchisq(n,df=2)
  mean=mean(samp)
  sd=sd(samp)
  upper=mean+qt(0.975,df=15)*sd/4

非参数自抽样法置信区间

统计问题：基于一个样本，我们可以为总体相关性创建一个置信区间吗？

我们如何回答：为相关统计量创建一个 bootstrap t 置信区间估计。


boot.ti <-
  function(x, B = 500, R = 100, level = .95, stattic){
    
    x <- as.matrix(x)
 
library(boot)       #for boot and boot.ci

data(law, package = "bootstrap")

dat <- law

ci <- boot.t.ci(dat, statistic = stat, B=2000, R=200)
ci

自抽样法后的Jackknife

统计问题：R 的标准误差的 bootstrap 估计的标准误差是多少？

我们如何回答它： data(law) 像上一个问题一样使用。在 bootstrap 后执行 Jackknife 以获得标准误差估计的标准误差估计。（bootstrap 用于获得总体中 R 的 SE 的估计值。然后使用折刀法获得该 SE 估计值的 SE。）


indices <- matrix(0, nrow = B, ncol = n)

# 进行自举
for(b in 1:B){
    i <- sample(1:n, size = n, replace = TRUE)
    LSAT <- law$LSAT[i]
 
#  jackknife

for(i in 1:n){
    keepers <- function(k){
         !any(k == i)   
    }

自测题

Submit the rendered HTML file. Make sure all requested output (tables, graphs, etc.) appear in your document when you submit.

Parametric Bootstrap: Estimate MSE

Statistical question: What is the MSE of a level \(k\) trimmed mean?

How we can answer it: Estimate the MSE of the level \(k\) trimmed mean for random samples of size 20 generated from a standard Cauchy distribution (t-distribution w/df = 1). The target parameter \(\theta\) is the center or median. The mean does not exist for a Cauchy distribution. Summarize the estimates of MSE in a table for \(k = 1, 2, ... 9\).

Parametric Bootstrap: Empirical Power Calculations

Statistical question: How does power change as the difference between the null hypothes and the reality changes?

How we can answer it: Plot an empirical power curve for a t-test.

The null hypothesis of the t-test is \(\mu = 500\). The alternative is \(\mu \ne 500\).

You will draw samples of size 20, from a normally distributed population with \(\sigma = 100\). You will use a significance level of 0.05.

Show how the power changes as the actual mean of the population changes from 350 to 650 (increments of 10).

On the y-axis will be the empirical power (estimated via bootstrap) and the x-axis will be the different values of \(\mu\) (350, 360, 370 … 650).

Parametric Bootstrap: Empirical Power Calculations

Statistical question: How does sample size affect power?

How we can answer it: Create more power curves as the actual mean varies from 350 to 650, but produce them for using samples of size n = 10, n = 20, n = 30, n = 40, and n = 50. Put all 5 power curves on the same plot.

Parametric Bootstrap: Empirical Confidence Level

Statistical question: When making a 95% CI, are we still 95% confident if our samples are small and do not come from a normal distribution?

How we can answer it: Create a bunch of Confidence Intervals (95%) for the mean of a population based on a sample.

\[\bar{x} \pm t^{*} \times \frac{s}{\sqrt{n}}\]

Your samples should be of size 16, drawn from a chi-squared distribution with 2 degrees of freedom.

Find the proportion of Confidence Intervals that fail to capture the true mean of the population. (Reminder: a chi-squared distribution with \(k\) degrees of freedom has a mean of \(k\).)

Non Parametric Bootstrap Confidence Interval

Statistical question: Based on one sample, can we create a confidence interval for the correlation of the population?

How we can answer it: Create a bootstrap t confidence interval estimate for the correlation statistic.

Jackknife after bootstrap

Statistical question: What is the standard error of the bootstrap estimate of the standard error of R?

How we can answer it: Use data(law) like the previous problem. Perform Jackknife after bootstrap to get a standard error estimate of the standard error estimate. (The bootstrap is used to get an estimate of the SE of R in the population. The jackknife is then used to get an SE of that SE estimate.)

本文标签：杰克参数功效语言经验

版权声明：本文标题：R语言参数自抽样法Bootstrap：估计MSE、经验功效、杰克刀Jackknife、非参数自抽样法可视化内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：https://m.elefans.com/dongtai/1729850814a1215383.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

xp系统

电子爱好者 - 最新技术资讯及电子产品介绍！

R语言参数自抽样法Bootstrap：估计MSE、经验功效、杰克刀Jackknife、非参数自抽样法可视化

相关视频：什么是Bootstrap自抽样及应用R语言线性回归预测置信区间实例

参数自抽样法：经验功效计算

参数自抽样法：经验功效计算

统计问题：样本量如何影响功效？

参数自抽样法：经验置信水平

统计问题：在制作 95% CI 时，如果我们的样本很小并且不是来自正态分布，我们是否仍有 95% 的置信度？

非参数自抽样法置信区间

统计问题：基于一个样本，我们可以为总体相关性创建一个置信区间吗？

自抽样法后的Jackknife

统计问题：R 的标准误差的 bootstrap 估计的标准误差是多少？

自测题

Submit the rendered HTML file. Make sure all requested output (tables, graphs, etc.) appear in your document when you submit.

更多相关文章

Mangos某人经验

基于R语言的seasonal包使用手册_08.identify(x,type=c(“ao“,“tc“,“ls“),...)

十二代 i7-12700 和 i5-12600K 参数对比

i7 1165g7和i5 10200h 参数对比哪个好

i5-10400F和i5-9400F 参数对比 有什么区别

i9 13900hx参数 i9 13900hx功耗 酷睿i913900hx相当于台式机什么cpu

CPU的重要参数

i7 12700T参数 i7 12700T怎么样

Ubuntu中的MySQL重新卸载安装过程（是新手，但是整个好几个小时，说一下自己成功地经验，便于后续查看）

手机、浏览器的分辨率、状态栏参数

linux打开gaussian16软件,win平台下最新版Gaussian16使用经验分享

C语言学习——sprintf函数详细解释及其用法

CPUFREQ 参数解释

ROS中move_base功能包参数配置：Dijsktra+Dwa

ML之XGBoost：XGBoost参数调优的优秀外文翻译—《XGBoost中的参数调优完整指南(带python中的代码)》(二)

人工智能 | 搭建企业内部的大语言模型系统

LViT：语言与视觉Transformer在医学图像分割

机器学习之参数估计

Simulink Design Optimization的参数估计（续）

MeanShift参数含义

发表评论

推荐文章

使用Arduino、DHT11温湿度传感器 和 ESP-01S 实现在乐为物联上传输数据

Android Studio模拟器太卡怎么办？看这里！！！

android 开发者模式 手机变慢,手机太卡？手机中的“开发者模式”你会用吗？试一下，瞬间流畅！...

使用基于Linux的文件系统进行重复数据删除

戴尔外星人m15r2 m15r3 m15r4 m15r5 m15r6 m15r7原厂出厂恢复系统带F12 Support Assist OS Recovery恢复功能

热门文章

在线制作微信跳转浏览器下载app打开指定页面源码

移动端开发-体检预约

最新织梦CMS程序 小黑屋QQ技术导航新增手机版源码分享

KMS工具使用方法

linux上的手机管家,lvse手机管家

去哪里找英语实时翻译软件？这里有你要的答案

TCP在FIN_WAIT1状态到底能持续多久以及TCP假连接问题

Github优秀Android开源项目,值得引用与学习（图文结合~~~）

Git学习——删除文件

ArduPilot开源代码之MatekSys Optical Flow 3901-L0X

最新文章

三星U盘格式化后数据不见了？3个方法帮您找回珍贵文件

格式化后数据恢复全解析

华恒2410常见问题

Windows Mobile平台智能系统存储器ROM和RAM解释

移动硬盘加密

联想e480一键恢复小孔_联想自带一键恢复没用了怎么处理

如何恢复U盘里格式化数据？别慌，有带图详细步骤！

ubuntu2

转载：基于AT91RM9200与LINUX2.6.26内核的嵌入式平台开发全过程

ArchLinux 2009.08 硬盘安装

u盘格式化后数据能恢复吗？这四款工具别错过！

u盘快速格式化后怎么恢复文件：深入解析与全面指南

授之以鱼不如授之以渔！五分钟教会您手工查杀***！

|--------硬件故障专题--------| 主板.CPU.硬盘.内存.显卡.声卡

s3c2410 一些移植常见问题

小米手机肿么还原时钟

15000流明是多少瓦

一般普通投影机功率多大?

苹果绿联转换器有些投影机不能用

坚果V9投影机具体参数?

有关九年级作文850字精选

80后90后_高一作文

中级卫生专业资格中医全科学主治医师中级模拟题2021年(9)案与解析

(精品)师范大学招考硕士研究生课程八六0试卷

ZXMVC8900(V3

【模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313】模拟人生4（The Sims 4）性感露背黑色亮片礼服MOD V20190313 官方免费下载

i5-10400F和i5-9400F 参数对比有什么区别

i9 13900hx参数 i9 13900hx功耗酷睿i913900hx相当于台式机什么cpu

使用Arduino、DHT11温湿度传感器和 ESP-01S 实现在乐为物联上传输数据

android 开发者模式手机变慢,手机太卡？手机中的“开发者模式”你会用吗？试一下，瞬间流畅！...

最新织梦CMS程序小黑屋QQ技术导航新增手机版源码分享

【鬼泣5（Devil May Cry V）v1.0十四项修改】鬼泣5（Devil May Cry V）v1.0十四项修改官方免费下载