Conditional Relative Frequency

Frequency is a name given to the count of the number of times an event occurs. Relative frequency is the term used to indicate that the number has been transformed into a proportion expressing it in relation to the grand total of the number of events. The proportion may be expressed as a percentage. The conditional relative frequency rather than the grand total is sometimes used when constructing the proportion in order to make more sense of the data.

In the example (Figure 1), some data have been taken from the Crime Survey for England Teaching Dataset. For simplification, the data have been rounded. The rounded data suggest that we have obtained survey responses from 45,700 people. The table below gives the frequencies for the responses to the question "How safe do you feel walking alone after dark"? The table is split into two rows denoting the responses given by males and females.

Frequency is the number of times a specific response was observed. For example, frequency of responses that stated "very safe" regardless of gender was 15,600. Dividing this number by the overall number of respondents gives the relative frequency of a response stating "very safe" in reply to that question. For these data, the relative frequency is

98418267-96881.jpg

In other words approximately one third of respondents stated they felt "very safe" walking alone after dark. We can also see that the relative frequency for the survey being completed by a male was

98418267-96882.jpg

in other words, somewhat less than half the respondents were male.

Conditional relative frequency means that the statistician "conditions" on an event other than the event someone filled in on a form. Here, it is possible to either condition on the gender of the respondent, or the level of safety they stated they felt when walking alone. Essentially, relative frequencies are calculated using an appropriate total other than the grand total.

If using the gender of the respondent as the condition, only the row of data that applies to that gender is considered. If the condition on the event is "survey completed by male" there are 20,900 respondents. Of these, 10,100 stated that they felt "very safe" walking alone after dark, so the conditional relative frequency is

98418267-96883.jpg

For female responsdents, we obtain the conditional relative frequency

98418267-96884.jpg

Alternatively, we could condition on the event "respondent fills in the survey stating they feel very safe" and find the conditional relative frequency of being male or female. Here we work solely with the column that describes this response. So for relative frequency for males of

98418267-96885.jpg

conditional on reporting they felt safe walking alone at night and for females

98418267-96886.jpg

The question of which is the most appropriate conditional relative frequency depends on the question that is being asked of the data.

Bibliography

Blitzstein, Joseph K., and Jessica Hwang. Introduction to Probability. Boca Raton, FL: Chapman, 2015.

ESDS Government. Crime Survey for England and Wales, 2011-2012: Teaching Dataset. Manchester, UK: U Manchester, 2013.

Forbes, Catherine, et al. Statistical Distributions. Hoboken, NJ: Wiley, 2011.