Temporarily Served by symple.design
uga arches UGA Tobacco Documents Project
Go to Home Page


Input: Create Sub-Corpus

This tool allows you to create a sub-corpus by selecting documents from the entire Tobacco Document Corpus based on categories such as (1) the decade in which the documents were written, (2) the tobacco company which produced the document, (3) whether the target audience was internal or external to the industry, and (4) whether the target audience was a specific person or group, or a generalized audience. Once you have selected a subsample of documents with which to work, this tool also offers you the capability of selecting different components of the texts to display, such as marginalia, emphasis, cross-outs, or headings. Hint: The output is based on the choices you make. It is possible to limit the selection to the point of not getting any documents or text returned. If the output page returns "0" texts, back up and try again with a few more boxes checked. Follow the blue hyperlinks to the Glossary page for additional information.
1. Select Document Types to Include:
  Collection: Audience Type:
    Stratified Random Sample
Ext. Audience Supplemental Sample
Rhetorical Cases Collection
  Named Internal Audience
Named External Audience
Un-Named Internal Audience
Un-Named External Audience
  Decade Group: Industry Source:
    1950
1960
1970
1980
1990
Bliley
19xx
  American Tobacco
Brown and Williamson
Lorillard
Philip Morris
R. J. Reynolds
Council for Tobacco Research
Tobacco Institute
:
2. Select Items to Include:
  Display Metadata:
    Collection
Decade
Industry Source
Audience
Start Bates No.
End Bates No.
Doc. Date
No. Pages
No. Words (Main Text)
Notes
Attorneys' Index Information
  Include Document Components:
    Primary Document Divisions: Secondary Document Divisions:
      Predoc Data
Maindoc Data
Postdoc Data
Appendices
Xdoc Data
  Pre Text
Main Text
Post Text
    Display Text Items
      Display Normative Text
Emphasized Text
Marked Text
Lined-out Text
Inserted text
Titles/Headers
Quotes
Marginalia
Illegible Text
Image Text
  Form Text
Table Text
Image/Form Descriptions
Image/Form Captions
Symbols
Paragraph Breaks
Page Break Data
Footnotes
Footnote Anchors
Xitems
3. Select Output:
  Format: Destination:
    HTML
ASCII (Plain Text) Format
Plain Text with Basic XML
Full XML
  Quick View
View Online*
Download to File*

*Read Instructions Before Acting
4. or

NIH-NCI Tobacco-Documents Project at the University of Georgia (Grant # 1 RO1 CA87490). The scripts run by this server are the invention of Clayton Darwin using Python . All graphical displays are created on the fly using ChartDirector© from ASE .