Commit Graph

276 Commits

Author SHA1 Message Date
af42b353c4 Use acc::maybeConjugate for cpu code 2023-01-26 01:50:53 +01:00
e4f326e394 Fix the reordering kernel in cuda 2023-01-26 01:50:34 +01:00
93cba3c3ab Implement zeroing of tensors through memcpy and cuMemcpy 2023-01-26 01:49:55 +01:00
4e2d1143e5 Add getSize static method to calculate the size of sources in SliceUnion 2023-01-25 16:25:09 +01:00
Ania Brown
933d556c84 Fix the reordering kernel in Equations 2023-01-25 13:53:22 +01:00
c7e3fa45bd Add old version of energies and only generate code for doubles 2023-01-25 12:50:19 +01:00
Anna Brown
2b8b3bd421 don't copy self sufficient slices when sources on gpu 2023-01-23 09:03:16 -08:00
122329eca7 Fix zeroing 2023-01-23 17:01:29 +01:00
Ania Brown
58c0bf078e Zero Tijk correctly in CPU code 2023-01-23 16:58:08 +01:00
3fe15e5e5c Fix bs and ths error in equations 2023-01-23 16:57:07 +01:00
0d223e6ed9 Fix vector types for energy in cpu 2023-01-23 14:44:54 +01:00
c8bdc4239f Fix an odd character in the warmup 2023-01-23 14:43:17 +01:00
Ania Brown
be96e4bf8c 1.syntax error fix 2.allocate temporary buffers only once per sim 2023-01-23 14:30:11 +01:00
Anna Brown
9003c218a3 don't need to copy to separate mpi_data array on the host when sources are resident on gpu 2023-01-23 14:25:25 +01:00
Ania Brown
4af47a0bb7 Initialize sources on gpus when ATRIP_SOURCES_IN_GPU 2023-01-23 14:21:51 +01:00
Ania Brown
9a5a2487be Add warmup in the SliceUnion 2023-01-23 13:46:20 +01:00
c4ec227185 Clean getEnergyDistinct 2023-01-13 16:59:19 +01:00
1ceb4cf0d6 Fix maybeConjugate cuda scope 2023-01-13 12:08:54 +01:00
34a4e79db0 Initial compiling implementation of the energy kernel 2023-01-13 11:33:42 +01:00
249f1c0b51 Add raven modules for cuda 2023-01-04 15:23:36 +01:00
1d96800d45 Add support for reading tensors from file in atrip bench 2022-12-06 21:20:03 +01:00
9087e3af19 Update workflows 2022-12-06 20:58:32 +01:00
418fd9d389 Add simple cuda bench configuration 2022-12-06 20:57:34 +01:00
895cd02778 Add some documentation about running the benches 2022-12-06 20:38:57 +01:00
8efa3d911e Add --max-iterations to main bench 2022-12-06 20:38:38 +01:00
0fa24404e5 Improve the documentation in the readme for benches building 2022-12-06 14:17:53 +01:00
8f7d05efda Add Building information and building for sources on GPU 2022-12-06 13:26:44 +01:00
ad542fe856 Add the slicing into the GPU 2022-12-05 21:16:30 +01:00
658397ebd7 Update in SliceUnion ATRIP_SOURCES_IN_GPU 2022-12-05 17:55:23 +01:00
26e2f2d109 Add ATRIP_SOURCES_IN_GPU and ATRIP_CUDA_AWARE_MPI defines in configure 2022-12-05 17:49:54 +01:00
871471aae3 Fix naive-tuples experimentation bench 2022-10-18 16:23:43 +02:00
65a64f3f8c Test on all pushes 2022-10-08 16:05:40 +02:00
4f9f09e965 Cleanup flags handling in configure 2022-10-08 16:04:55 +02:00
6dc943e10a Clean up DatabaseCommunicator 2022-10-08 16:04:37 +02:00
ed347ab0d9 Rename NAIVE_SLOW into ATRIP_NAIVE_SLOW 2022-10-08 16:04:03 +02:00
8c5c47e208 Add atrip-def m4 macro 2022-10-08 16:03:08 +02:00
6871372cac Add support for only calculating DGEMM parts ATRIP_ONLY_DGEMM 2022-10-08 16:02:49 +02:00
452c0fe001 Fix test_main name in automake 2022-10-08 15:59:48 +02:00
b636b89a64 Add configure-benches script 2022-10-08 15:59:25 +02:00
e59d298a01 Rename test_main.cxx to main.cxx 2022-10-08 00:17:57 +02:00
Gallo Alejandro
6143d1ae73 Rename test_main to atrip 2022-10-08 00:13:40 +02:00
Gallo Alejandro
3addd86826 Create a barrier before the iterations start 2022-10-07 22:51:43 +02:00
3ddd507c17 Fix a bug in the building the abc tuples fakely 2022-10-07 12:23:10 +02:00
ae6736fc21 Implement a faster version of naive computation 2022-10-07 03:19:24 +02:00
d5cfe31b12 Add naive tuples scracth file 2022-10-07 01:11:22 +02:00
ddb4574380 Fix DatabaseCommunicator 2022-10-06 12:27:00 +02:00
118df09128 Add tentative DatabaseCommunicator 2022-10-06 01:10:06 +02:00
1e391e3749 Update tuples-distribution script 2022-10-06 01:07:53 +02:00
7734efeb97 Add tuples distribution bench 2022-10-03 17:13:21 +02:00
fa1a29c583 Create an implementation file of the Tuples 2022-10-03 17:11:49 +02:00