DataLoader/MX Reference Manual (G06.24+)
Running DataLoader/MX
DataLoader/MX Reference Manual—525872-002
3-14
Analyzing Your Configuration
more import processes. Configuring and tuning your loading application are
discussed next.
You can create customized versions of DataLoader/MX with combinations of user
routines appropriate to what you want to do. As you customize each version with the
appropriate user exits, you should give these versions names that indicate their
purpose or position in the application hierarchy.
Configuration Recommendations
Many factors influence the design of your loading application, but first consider two
factors in your initial configuration decisions:
•
The data source. Your data can come from a single source (a tape drive) or from
multiple sources (multiple drives or communication lines). In either case, exactly
one DataLoader/MX process is required for each data source.
•
The data destination. Usually, your data is to be loaded into multiple partitions of an
SQL table using import. When this is the case, you must configure exactly one
import process for each partition, as two import processes writing to the same
partition would cause locking errors. To ensure this, the KEYRANGE values
specified for DataLoader/MX must match the table KEYRANGE values exactly.
These next suggestions will help when your requirements differ from those listed
previously, or when you are refining and tuning your initial configuration:
•
Loading with SQL INSERT in DataLoader/MX. You can write your own user exits to
perform table inserts or deletes in the DataLoader/MX process. In this case, you
need not maintain a one-to-one relationship between processes and partitions.
•
Load balancing. Under some conditions, you might be able to split or combine the
work done by the loading processes for better balance. You will need to experiment
with this strategy. For example, you can start with one data source and one
DataLoader/MX process feeding into multiple downstream processes. You might
discover that, if you use two or more DataLoader/MX processes for input, your load
becomes better balanced and throughput improves.
•
IPC costs. When splitting the load by adding processes, be aware of increased IPC
costs. With too many processes, your configuration can reach a point of
diminishing returns, and performance can actually decrease. Another important
consideration is that managing fewer processes is easier.
In addition to these guidelines, examples of the four basic loading configurations are
provided in Section 6, DataLoader/MX Examples. These scenarios are the basis for
nearly every loading situation you will encounter.
Analyzing Your Configuration
After your configuration is defined, you should analyze and measure it for performance
by running DataLoader/MX and analyzing the results of a few simple experiments.