UGA Arches UGA Tobacco Documents Project
Rhetorical Cases

R010D Basic Statistics

Text File		OVERALL	Draft	Final

Bytes			53,180	29,897	23,283
Tokens			5,532	3,055	2,477
Types			879	868	834
Type/Token Ratio		15.89	28.41	33.67
Standardised Type/Token	41.65	40.55	42.75
Ave. Word Length		4.89	4.85	5.09
Sentences		236	120	116
Sent.length		23.39	25.46	21.25
sd. Sent. Length		20.24	22.72	17.15
Paragraphs		0	0	0
Para. length			
sd. Para. length			
Headings			0	0	0
Heading length			
sd. Heading length			
1-letter words		250	145	105
2-letter words		1,041	625	416
3-letter words		885	478	407
4-letter words		775	441	334
5-letter words		521	275	246
6-letter words		512	273	239
7-letter words		525	277	248
8-letter words		290	150	140
9-letter words		250	142	108
10-letter words		176	90	86
11-letter words		162	85	77
12-letter words		75	38	37
13-letter words		38	19	19
14(+)-letter words	6	3	3

NIH-NCI Tobacco-Documents Project at the University of Georgia (Grant # 1 RO1 CA87490-01). Please contact Cati Brown for more information concerning the rhetorical cases.