User Guide

Chapter 3

How are your cases represented? Controls how the Text Wizard determines where

each case ends and the next one begins.

 Each line represents a case. Each line contains only one case. It is fairly common

for each case to be contained on a single line (row), even though this can be a

very long li

ne for data files with a large number of variables. If not all lines

contain the same number of data values, the number of variables for each case is

determined by the line with the greatest number of data values. Cases with fewer

data value

s are assigned missing values for the additional variables.

 A specifi

c number of variables represents a case.

The specified number of variables

for each case tells the Text Wizard where to stop reading one case and start

reading the next. Multiple cases can be contained on the same line, and cases can

start in th

e middle of one line and be continued on the next line. The Text Wizard

determines the end of each case based on the number of values read, regardless

of the number of lines. Each case must contain data values (or missing values

indicate

d by delimiters) for all variables, or the data file will be read incorrectly.

How many cases do you want to import? You can import all cases in the data file, the

first n cases (n is a number you specify), or a random sample of a specified percentage.

Since th

e random sampling routine makes an independent pseudo-random decision

for each case, the percentage of cases selected can only approximate the specified

percentage. The more cases there are in the data file, the closer the percentage of

cases se

lected is to the specified percentage.