Matrix Operations

There are three operations we need to perform on matrixes:

pseudoinverse (pinv, ^-1)
transposition (tr, ^T)
multiplication (mul, X)

To handle larger sets of data, these will need to be performed on subsets of the data, stored in scratch space, and then folded into the final resulting matrix. The processing of these subsets can be scheduled weighting cache locality and freshness to minimize cache churn across cores.

CAPS Matrix Multiplication

CAPS is a communication-efficient method of parallizing Strassen's algorithm. It's well-suited to the non-uniform memory and processing architectures we're targeting. It "asymptotically minimizes computa- tional and bandwidth costs over all parallel Strassen-based algorithms. It also minimizes latency cost up to a logarithmic factor in the number of processors."

Strassen divides the matrix into a 7-ary tree, and CAPS recurses the problem tree in either of two ways: breadth-first and depth-first. Breadth-first (BFS) will divide the 7 subproblems across processors, so each processor will work on 1/7 of the problem. Depth-first (DFS) will use all processors on each subproblem in turn. BFS uses more memory while reducing communication, while DFS uses less memory but requires more communication overhead. CAPS minimizes communication costs by choosing an ordering of BFS/DFS that maximizes memory usage.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
bench		bench
datagen		datagen
src/TeraReg		src/TeraReg
tests		tests
validate		validate
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Setup.hs		Setup.hs
terareg.cabal		terareg.cabal

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Matrix Operations

CAPS Matrix Multiplication

Unlimited Memory Scheme

Target

References

About

Releases

Packages

Contributors 2

Languages

License

theunixman/terareg

Folders and files

Latest commit

History

Repository files navigation

Matrix Operations

CAPS Matrix Multiplication

Unlimited Memory Scheme

Target

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages