UGA Arches UGA Tobacco Documents Project
Rhetorical Cases

Basic Statistics for Rhetorical Case R092A

Text File		Overall	Booklet	Internal
					Report

Bytes			30,387	13,710	16,677
Tokens			4,409	2,173	2,236
Types			1,229	756	696
Type/Token Ratio		27.87	34.79	31.13
Standardised Type/Token	41.92	43.85	40.00
Ave. Word Length		5.16	4.74	5.60
Sentences		280	145	135
Sent.length		12.90	13.86	11.87
sd. Sent. Length		9.42	8.77	9.99
Paragraphs		218	85	133
Para. length		19.99	24.95	16.81
sd. Para. length		23.67	24.09	22.93
Headings			0	0	0
Heading length			
sd. Heading length			
1-letter words		142	57	85
2-letter words		645	343	302
3-letter words		752	421	331
4-letter words		680	425	255
5-letter words		447	236	211
6-letter words		382	192	190
7-letter words		429	172	257
8-letter words		308	115	193
9-letter words		235	91	144
10-letter words		156	51	105
11-letter words		123	47	76
12-letter words		47	7	40
13-letter words		51	13	38
14(+)-letter words	8	2	6


NIH-NCI Tobacco-Documents Project at the University of Georgia (Grant # 1 RO1 CA87490-01). Please contact Cati Brown for more information concerning the rhetorical cases.