Bootstrap

Discussion:

Bootstrap

(too old to reply)

v***@yahoo.fr

2014-03-03 23:15:52 UTC

Hi,
It is me again !

I have 2 questions this time about bootstrap.
Many thanks for your precious help.

1) One way of carrying out the bootstrap is to average equally over all possible bootstrap samples from the original data set (where two bootstrap data sets are different if they have the same data points but in different order). Unlike the usual implementation of the bootstrap, this method has the advantage of not introducing extra noise due to resampling randomly.
To carry out this implementation on a data set with n data points, how many bootstrap data sets would we need to average over?

2) If we have n data points, what is the probability that a given data point does not appear in a bootstrap sample?

Best,

Rich Ulrich

2014-03-05 01:18:59 UTC

Permalink

Post by v***@yahoo.fr
Hi,
It is me again !
I have 2 questions this time about bootstrap.
Many thanks for your precious help.
1) One way of carrying out the bootstrap is to average equally over all possible bootstrap samples from the original data set (where two bootstrap data sets are different if they have the same data points but in different order). Unlike the usual implementation of the bootstrap, this method has the advantage of not introducing extra noise due to resampling randomly.
To carry out this implementation on a data set with n data points, how many bootstrap data sets would we need to average over?

If you are referring to the usual sort of bootstrap,
where N cases are drawn with replacement from the
sample of N, then "all possible samples" is N raised to
the Nth power.

An N of 10 is nearly the max, for modern computers.

Depending on what statistics you are bootstrapping,
you might have to figure what you want to do for
those exceptional samples where the same case is
drawn all 10 times.

Post by v***@yahoo.fr
2) If we have n data points, what is the probability that a given data point does not appear in a bootstrap sample?

The chance that it is not drawn first is (1-1/N).
Ditto, for each next draw; so raise that quantity to N.

--
Rich Ulrich

v***@yahoo.fr

2014-03-06 23:48:02 UTC

Permalink

Post by Rich Ulrich

If you are referring to the usual sort of bootstrap,
where N cases are drawn with replacement from the
sample of N, then "all possible samples" is N raised to
the Nth power.
An N of 10 is nearly the max, for modern computers.
Depending on what statistics you are bootstrapping,
you might have to figure what you want to do for
those exceptional samples where the same case is
drawn all 10 times.

Post by v***@yahoo.fr
2) If we have n data points, what is the probability that a given data point does not appear in a bootstrap sample?

The chance that it is not drawn first is (1-1/N).
Ditto, for each next draw; so raise that quantity to N.
--
Rich Ulrich

Dear Professor Ulrich,

Once more many thanks for your responses.

Best,

w***@gmail.com

2018-11-26 02:52:18 UTC

Permalink

Post by Rich Ulrich

If you are referring to the usual sort of bootstrap,
where N cases are drawn with replacement from the
sample of N, then "all possible samples" is N raised to
the Nth power.
An N of 10 is nearly the max, for modern computers.
Depending on what statistics you are bootstrapping,
you might have to figure what you want to do for
those exceptional samples where the same case is
drawn all 10 times.

Post by v***@yahoo.fr
2) If we have n data points, what is the probability that a given data point does not appear in a bootstrap sample?

The chance that it is not drawn first is (1-1/N).
Ditto, for each next draw; so raise that quantity to N.
--
Rich Ulrich

I dont understand the 2nd part where Professor ulrich said to raise that quantity to N.
I understand that the probability of getting in draw is 1/N and not getting will be 1-1/N
But I dont get what do we mean by raising it to N

David Duffy

2018-11-30 03:29:45 UTC

Permalink

Post by w***@gmail.com

Post by Rich Ulrich

Post by v***@yahoo.fr
2) If we have n data points, what is the probability that a given
data point does not appear in a bootstrap sample?

The chance that it is not drawn first is (1-1/N).
Ditto, for each next draw; so raise that quantity to N.
--
Rich Ulrich

I dont understand the 2nd part where Professor ulrich said to raise
that quantity to N. I understand that the probability of getting in
draw is 1/N and not getting will be 1-1/N But I dont get what do we
mean by raising it to N

How many draws are needed to make one bootstrap sample?