Small samples warning
In order to follow normal distribution approximations, RaterBayes currently requires at least 10 data points (i.e. ratings) and at least 1 data point for each star rating. These requirements apply to both items A and B, and no result will be returned for queries not meeting them. Features will be developed to work around this limitation and provide output for small samples.
A1 : Number of 1-star reviews for item A
A2 : Number of 2-star reviews for item A
A3 : Number of 3-star reviews for item A
A4 : Number of 4-star reviews for item A
A5 : Number of 5-star reviews for item A
B1 : Number of 1-star reviews for item B
B2 : Number of 2-star reviews for item B
B3 : Number of 3-star reviews for item B
B4 : Number of 4-star reviews for item B
B5 : Number of 5-star reviews for item B
The output is 3 JSON fields giving:
- The estimated posterior probability that item A is truly rated better on average than item B,
- The estimated posterior probability that the difference between A and B is sufficiently small that they can’t be distiguished under the model,
- The estimated posterior probability that item B is truly rated better on average than item A.
Statistial method and modelling data
RaterBayes uses the same modelling approach as the Bigger or False Discovery Rate (BFDR), implemented in the priorsplitteR R package. For details of the statistical approach, simulation experiments, and applications of the BFDR method to human genetic data, please read this preprint.
More documentation will be added shortly on how the model was fitted to data from Amazon ratings. It is hoped that more ratings databases will be added in the future, to allow selection of the most appropriate prior distribution for the user’s rating data.