Marco De Lucia
|
77c6b84a6f
|
Commit CUDA code in src/cuda
|
2023-10-13 10:09:35 +02:00 |
|
Max Lübke
|
1091a55b81
|
adapt tiled version
|
2023-10-12 14:57:26 +02:00 |
|
Max Lübke
|
493be3576c
|
use correct compiler
|
2023-10-06 15:38:39 +02:00 |
|
Max Lübke
|
8145d533dd
|
adapt CMake and Readme
|
2023-10-06 15:31:40 +02:00 |
|
Max Lübke
|
9633302226
|
refactor code to compile with Intel oneAPI
|
2023-10-06 15:16:32 +02:00 |
|
Max Lübke
|
3fcd37fb1f
|
fix headers
|
2023-10-06 11:08:11 +02:00 |
|
Max Lübke
|
ca136ac2e6
|
update cmake instructions
|
2023-10-05 11:40:51 +02:00 |
|
Max Lübke
|
47331f9b00
|
rework Data section
|
2023-10-05 11:38:22 +02:00 |
|
Max Lübke
|
3598bb7642
|
add prerequisites to readme
|
2023-10-05 11:25:12 +02:00 |
|
Max Lübke
|
69350034eb
|
Extend Readme
|
2023-10-05 11:21:17 +02:00 |
|
Max Lübke
|
6bac8f5a22
|
add more matrices
|
2023-10-05 10:52:27 +02:00 |
|
Max Lübke
|
5bcb1c54e3
|
add comments
|
2023-10-05 10:51:58 +02:00 |
|
Max Lübke
|
a156e04a22
|
Add bigger matrices with dimensions of 2^13*2^13
|
2023-10-05 08:58:46 +02:00 |
|
Max Lübke
|
45e621f2b3
|
reintroduce template functions
|
2023-10-05 08:32:43 +02:00 |
|
Max Lübke
|
c9c0daa098
|
Add source of prevPower function
|
2023-10-05 08:22:08 +02:00 |
|
Max Lübke
|
505d0cb9da
|
add data
|
2023-10-04 19:49:06 +02:00 |
|
Max Lübke
|
0e6e99c5dd
|
add matmul tiled example
|
2023-10-04 19:47:58 +02:00 |
|
Max Lübke
|
86842a6a15
|
Move to src folder
|
2023-10-04 19:47:20 +02:00 |
|
Max Lübke
|
3850cd355d
|
no need to check for validity of thread ids
|
2023-10-04 11:52:33 +02:00 |
|
Max Lübke
|
d0673e61ca
|
add omp/gpu naive and transposed code
|
2023-10-04 09:54:15 +02:00 |
|
Max Lübke
|
d8e4d200f0
|
add macro to print results to console
|
2023-10-04 09:16:30 +02:00 |
|
Max Lübke
|
9b9167e810
|
fix indexing
|
2023-10-04 09:16:14 +02:00 |
|
Max Lübke
|
93257b1efc
|
add function to return bytes needed for matrix
|
2023-10-02 13:19:06 +02:00 |
|
Max Lübke
|
8bc248c8e5
|
rename data to mem
|
2023-10-02 13:08:41 +02:00 |
|
Max Lübke
|
4f92c2a976
|
implement CPU only functions
|
2023-10-02 12:59:02 +02:00 |
|
Max Lübke
|
381fa26aac
|
add hashfunction for checksum calculation
|
2023-10-02 12:58:45 +02:00 |
|
Max Lübke
|
2097c0fe36
|
add instrumentation for runtime measurement
|
2023-10-02 12:58:12 +02:00 |
|
Max Lübke
|
0751b7734c
|
change indices from col,row to row,col
|
2023-10-02 11:18:56 +02:00 |
|
Max Lübke
|
1d56c491d4
|
update gitignore
|
2023-10-02 11:03:57 +02:00 |
|
Max Lübke
|
6698c82691
|
provide matrix struct with basic operations
|
2023-10-02 09:51:49 +02:00 |
|
Max Lübke
|
db765fff8d
|
Initial commit
|
2023-09-29 17:16:41 +02:00 |
|