# Changes between Version 1 and Version 2 of NumberOfEvents

Ignore:
Timestamp:
04/06/12 16:33:03 (8 years ago)
Comment:

--

Unmodified
Removed
Modified
• ## NumberOfEvents

 v1 == Number of events per channel/subprocess == Suppose I generate 50k events, and divide them in two subprocesses with a 50% probability. Naively I would get 25000 +/- sqrt(25000) = 25000 +/- 158 events in the first subprocess and the same for the second subprocess. However, the total number of events is fixed and this makes that the uncertainties are correlated. We have to include a sqrt(2) for the uncertainty

25000 +/- 158/Sqrt = 25000 +/- 112 events per subprocess.

You can easily convince yourself that the uncertainty has to be smaller by taking the limit of 1 subprocess: I always have 50k events, '''without any spread'''.

Now, I looked a bit on wikipedia, and found actually a formula how to compute it in more general cases:

If the total number of events is fixed to be 50000 divided into two sets, the uncertainties in the two sets have to be equal. The distribution over the two sets is given by a binomial (see second bullet point of http://en.wikipedia.org/wiki/Poisson_distribution#Related_distributions ) and we can compute the square root of the variance (http://en.wikipedia.org/wiki/Binomial_distribution#Mean_and_variance ) as

sigma_1 = Sqrt[ 50000 * p_1 * ( 1 - p_1 ) ]
sigma_2 = Sqrt[ 50000 * p_2 * ( 1 - p_2 ) ],

where
p_1 = lambda_1/(lambda_1 + lambda_2)
p_2 = lambda_2/(lambda_1 + lambda_2).

where lambda_j is the variance (i.e. the number of events) in subprocess j. As expected, this indeed gives 112 for sigma_1 and sigma_2 for the example above. Suppose I generate 50k events, and divide them in two subprocesses with a 50% probability. Naively I would get 25000 +/- sqrt(25000) = 25000 +/- 158 events in the first subprocess and the same for the second subprocess. However, the total number of events is fixed and this makes that the uncertainties are correlated. We have to include a sqrt(2) for the uncertainty[[br]][[br]]25000 +/- 158/Sqrt = 25000 +/- 112 events per subprocess.[[br]][[br]]You can easily convince yourself that the uncertainty has to be smaller by taking the limit of 1 subprocess: I always have 50k events, '''without any spread'''.[[br]][[br]]Now, I looked a bit on wikipedia, and found actually a formula how to compute it in more general cases:[[br]][[br]]If the total number of events is fixed to be 50000 divided into two sets, the uncertainties in the two sets have to be equal. The distribution over the two sets is given by a binomial (see second bullet point of http://en.wikipedia.org/wiki/Poisson_distribution#Related_distributions ) and we can compute the square root of the variance (http://en.wikipedia.org/wiki/Binomial_distribution#Mean_and_variance ) as[[br]][[br]]sigma_1 = Sqrt[ 50000 * p_1 * ( 1 - p_1 ) ][[br]]sigma_2 = Sqrt[ 50000 * p_2 * ( 1 - p_2 ) ],[[br]][[br]]where[[br]]p_1 = lambda_1/(lambda_1 + lambda_2)[[br]]p_2 = lambda_2/(lambda_1 + lambda_2).[[br]][[br]]where lambda_j is the variance (i.e. the number of events) in subprocess j. As expected, this indeed gives 112 for sigma_1 and sigma_2 for the example above.

This can generalize this to more subprocesses. For example, if we have 5 channels/subprocesses with the following number of events:

lambda_1 = 20310, Sqrt = 143
lambda_2 = 13360, Sqrt = 116
lambda_3 = 8384, Sqrt[ 8384] = 92
lambda_4 = 5009, Sqrt[ 5009] = 71
lambda_5 = 2963, Sqrt[ 2963] = 54.

The p's are now given by

p_j = lambda_j/(lambda_1 + ... + lambda_5)

so the RMS of in the number of events per sample has to be:

sigma_1 = 110
sigma_2 = 99
sigma_3 = 84
sigma_4 = 67
sigma_5 = 53.

Which is indeed smaller than the square root of the number of events.
[[br]]This can generalize this to more subprocesses. For example, if we have 5 channels/subprocesses with the following number of events:[[br]][[br]]lambda_1 = 20310, Sqrt = 143[[br]]lambda_2 = 13360, Sqrt = 116[[br]]lambda_3 = 8384, Sqrt[ 8384] = 92[[br]]lambda_4 = 5009, Sqrt[ 5009] = 71[[br]]lambda_5 = 2963, Sqrt[ 2963] = 54.[[br]][[br]]The p's are now given by[[br]][[br]]p_j = lambda_j/(lambda_1 + ... + lambda_5)[[br]][[br]]so the RMS of in the number of events per sample has to be:[[br]][[br]]sigma_1 = 110[[br]]sigma_2 = 99[[br]]sigma_3 = 84[[br]]sigma_4 = 67[[br]]sigma_5 = 53.[[br]][[br]]Which is indeed smaller than the square root of the number of events. -- Main.RikkertFrederix - 2012-01-20