UGA Arches UGA Tobacco Documents Project
Rhetorical Cases

R027PRA Basic Statistics

Text File		Overall	External Internal
Bytes			13,064	6,814	6,250
Tokens			2,001	1,096	905
Types			637	388	360
Type/Token Ratio		31.83	35.40	39.78
Standardised Type/Token	37.00	37.00	
Ave. Word Length		5.10	4.92	5.46
Sentences		76	44	32
Sent.length		21.79	20.95	22.94
sd. Sent. Length		10.50	10.53	10.52
Paragraphs		57	26	31
Para. length		34.67	41.19	29.19
sd. Para. length		30.61	26.52	33.09
Headings			0	0	0
Heading length			
sd. Heading length			
1-letter words		47	27	20
2-letter words		307	177	130
3-letter words		400	239	161
4-letter words		297	172	125
5-letter words		190	95	95
6-letter words		149	86	63
7-letter words		184	93	91
8-letter words		128	64	64
9-letter words		100	60	40
10-letter words		91	41	50
11-letter words		49	21	28
12-letter words		24	6	18
13-letter words		18	12	6
14(+)-letter words	9	2	7

NIH-NCI Tobacco-Documents Project at the University of Georgia (Grant # 1 RO1 CA87490-01). Please contact Cati Brown for more information concerning the rhetorical cases.