UGA Arches UGA Tobacco Documents Project
Rhetorical Cases

R026D Basic Statistics

Text File		Overall	draft	final
Bytes			3,489	1,489	2,000
Tokens			504	216	288
Types			164	109	161
Type/Token Ratio		32.54	50.46	55.90
Standardised Type/Token			
Ave. Word Length		5.30	5.22	5.42
Sentences		31	13	18
Sent.length		13.84	13.77	13.89
sd. Sent. Length		11.50	12.19	11.34
Paragraphs		32	16	16
Para. length		15.75	13.50	18.00
sd. Para. length		17.68	16.59	18.97
Headings			0	0	0
Heading length			
sd. Heading length			
1-letter words		22	7	15
2-letter words		82	41	41
3-letter words		67	27	40
4-letter words		72	33	39
5-letter words		48	20	28
6-letter words		50	23	27
7-letter words		49	20	29
8-letter words		27	8	19
9-letter words		31	14	17
10-letter words		23	10	13
11-letter words		12	5	7
12-letter words		6	2	4
13-letter words		13	6	7
14(+)-letter words	1	0	1


NIH-NCI Tobacco-Documents Project at the University of Georgia (Grant # 1 RO1 CA87490-01). Please contact Cati Brown for more information concerning the rhetorical cases.