UGA Arches UGA Tobacco Documents Project
Rhetorical Cases

R082D Basic Statistics

Text File		Overall	Draft	Final
Bytes			24,424	11,384	13,040
Tokens			3,599	1,659	1,940
Types			676	604	633
Type/Token Ratio		18.78	36.41	32.63
Standardised Type/Token	40.70	40.70	40.70
Ave. Word Length		5.31	5.38	5.27
Sentences		120	52	68
Sent.length		22.73	24.17	21.62
sd. Sent. Length		16.16	13.94	17.70
Paragraphs		93	43	50
Para. length		38.70	38.58	38.80
sd. Para. length		33.12	35.02	31.75
Headings			0	0	0
Heading length			
sd. Heading length			
1-letter words		101	40	61
2-letter words		533	232	301
3-letter words		613	280	333
4-letter words		445	215	230
5-letter words		368	171	197
6-letter words		295	135	160
7-letter words		388	188	200
8-letter words		283	129	154
9-letter words		234	111	123
10-letter words		164	73	91
11-letter words		112	56	56
12-letter words		34	14	20
13-letter words		17	9	8
14(+)-letter words	8	4	4

NIH-NCI Tobacco-Documents Project at the University of Georgia (Grant # 1 RO1 CA87490-01). Please contact Cati Brown for more information concerning the rhetorical cases.