[OT] Weighted distribution of Numbers
hh
hh at hyperhh.de
Mon Aug 5 08:47:33 EDT 2019
When computing limits for distribution categories given
frequencies the following may be useful:
A number q is a p%-quantile of a data set
If the percentage of data nums <= q is >= p%
and the percentage of data nums >= q is >= (100-p)%
For each percentage p there is an interval
[lowerV,upperV] so that each number from that interval
is p%-quantile of the data set.
To make it unique some use in case lowerV < upperV the
average of lowerV and upperV.
-- d=data in lines, sorted ascending numeric
-- p=percentage (num in range 0-100)
function quantile p,d
put the num of lines of d into N
put N*p/100 into m0
put line ceil(N*p/100) of d into lowerV
put line N+1 - ceil(N*(100-p)/100) of d into upperV
-- return avg(lowerV,upperV) --> unique variant
if lowerV=upperV then return lowerV
else return lowerV,upperV
end quantile
For example quantile(50,d) returns the median of a data set,
quantile(25,d), quantile(50,d), quantile(75,d) the quartiles.
More information about the use-livecode
mailing list