HP MLIB User's Guide Vol. 1 7th Ed.
Chapter 1 Introduction to VECLIB 17
Optimization
SUBROUTINE mlib_routine(...)
ENTRY mlib_routine_(...)
...
And a C version might be:
#undef mlib_routine
#undif mlib_routine_
void mlib_routine (...){
...
}
void mlib_routine_(...){
mlib_routine(...);
}
Optimization
The key computational kernels in VECLIB have been optimized to take full
advantage of both PA-RISC and Itanium tightly integrated architectures.
Optimizations include:
• Instruction scheduling to maximize the number of machine instructions
executed per clock cycle
• Algorithm restructuring to increase the number of computations per
memory access
• Cache management to decrease the number of data cache misses
Refer to “Accessing VECLIB” on page 3 for instructions on linking the desired
library with your program.
Parallel processing
Parallel processing is available on multi-processor HP platforms running the
HP-UX 11i or greater operating system and Linux. These systems can divide a
single computational process into small streams of execution, called threads.
The result is that you can have more than one processor executing on behalf of
the same process. Also, refer to Appendix C, “Parallelized Subprograms,” for a
list of HP VECLIB parallelized subprograms.