6.1 Metaphor and quantitative evidence

6.1.1 Coding

One of the main challenges concerning metaphor in corpus linguistics is that it is hard, and sometimes impossible, to extract metaphorical uses automatically. The following list highlights the major implications of this.

Manual coding is time-consuming
Manual coding is error prone; requires rigorous operationalization
Frequencies of metaphorical uses are often dwarfed by non-metaphorical uses
There is often no way to distinguish “literal” and “metaphorical”

6.1.2 Homework discussion

Before I get into a summary of the main indicators we’ve used so far, here is a disclaimer first. The data from Deignan (2006) was very limited. The frequencies are extremely low and all the transformations and visualizations you can apply can easily become misleading. Generally, I advice against over-analyzing low counts. The purpose of the homework was to make you experiment on a very simple data set rather than to learn something new about the data.

6.1.3 Frequency and scales

Absolute frequency
- Basic measure
- Should always be reported since everything else is based on it
- Sometimes hard to visualize
- Hard to interpret across different sample or category sizes
Relative frequency
- Absolute frequency divided by all occurrences
- Either between 0 and 1 or 0% and 100%
- Makes it possible to compare between different sized samples or sub-categories
- extremely low relative frequency is sometimes reported as normalized frequency, e.g. 1 per Million vs. 0.000001 vs. 0.0001%

6.1.4 Frequency and scales

Log scale
- Most commonly base 10, i.e. 1 to 10 is the same distance as 10 to 100, 100 to 1,000, etc.
- Uses:
1. Visualize heavily skewed data
2. Make exponential data linear (e.g. word counts)
3. Approximate human perception of quantities
Pie charts
- variant of stacked bar chart
- becomes hard to interpret with many categories

6.1.5 How extreme are the differences?

If we take into account the proportional nature of our perception, we might get a better picture of frequency differences on a log scale, which essentially emphasizes proportional differences rather than absolute ones. If you compare the two graphs below you might not change your conclusion about the data, but the felt difference between the categories might be much smaller than you would think. As stated above, these considerations are to be taken with a grain of salt on such small data sets.

References

Deignan, Alice. 2006. The grammar of linguistic metaphors. In Anatol Stefanowitsch & Stefan Th. Gries (eds.), Corpus-based approaches to metaphor and metonymy, 106–122. Mouton de Gruyter.