UGA Arches UGA Tobacco Documents Project
Rhetorical Cases

R023D Basic Statistics

Text File		Overall	1draft	2draft	3draft
Bytes			69,198	23,422	22,782	22,994
Tokens			10,460	3,478	3,488	3,494
Types			882	857	855	852
Type/Token Ratio		8.43	24.64	24.51	24.38
Standardised Type/Token	38.68	38.70	38.77	38.57
Ave. Word Length		4.95	4.95	4.97	4.96
Sentences		614	200	203	211
Sent.length		15.86	15.99	16.00	15.58
sd. Sent. Length		8.81	8.41	9.06	8.96
Paragraphs		160	55	54	51
Para. length		64.75	63.24	64.59	66.55
sd. Para. length		65.06	65.37	66.25	64.69
Headings			0	0	0	0
Heading length				
sd. Heading length				
1-letter words		476	167	158	151
2-letter words		1,778	586	591	601
3-letter words		1,932	642	644	646
4-letter words		1,550	516	516	518
5-letter words		965	322	322	321
6-letter words		674	223	227	224
7-letter words		944	313	314	317
8-letter words		643	214	214	215
9-letter words		618	205	205	208
10-letter words		436	143	148	145
11-letter words		216	72	71	73
12-letter words		110	37	37	36
13-letter words		67	21	24	22
14(+)-letter words	42	14	14	14

NIH-NCI Tobacco-Documents Project at the University of Georgia (Grant # 1 RO1 CA87490-01). Please contact Cati Brown for more information concerning the rhetorical cases.