Autovectorization

This simple code demonstrates autovectorization for a vector addition routine.

Building

There are two makefiles in this repo, one for a gcc build and another for an icc build. Ensure that you have the intel compiler loaded into your environment:

setpkgs -a intel_cluster_studio_compiler

To build run:

make -f makefile.gcc

or

make -f makefile.icc

to delete the executable you would then type:

make clean -f makefile.gcc

or

make clean -f makefile.icc

Running

The program accepts one command line argument, the length of the vector. For example:

./vec_add_icc 1000

Exercises

Build the binary with and without vectorization enabled using both icc and gcc. Building with level-three optimization will ensure autovectorization is run for both compilers. Time the execution of both versions of the binary for a vector length of 1000000000.
Compare the walltime with versions of the binary that are built without vectorization enabled. Using optimization level zero will ensure that vectorization is disabled.
Move the actual vector addition step (c = a + b) into a second for loop. With level three optimization enabled, how does the autovectorization behavior compare (read reporting carefully)? How does the walltime compare?
Revert back to the original code. Now start the loop at i=1 (ignore the first element for the vector addition) and add this line to the end of the loop body:

c[i] = c[i-1] + 82.3;

What changes? Why?

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
add.c		add.c
makefile.gcc		makefile.gcc
makefile.icc		makefile.icc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Autovectorization

Building

Running

Exercises

About

Uh oh!

Releases

Packages

Languages

sc3260s16/vectorization

Folders and files

Latest commit

History

Repository files navigation

Autovectorization

Building

Running

Exercises

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages