|
|
af42b353c4
|
Use acc::maybeConjugate for cpu code
|
2023-01-26 01:50:53 +01:00 |
|
|
|
e4f326e394
|
Fix the reordering kernel in cuda
|
2023-01-26 01:50:34 +01:00 |
|
|
|
93cba3c3ab
|
Implement zeroing of tensors through memcpy and cuMemcpy
|
2023-01-26 01:49:55 +01:00 |
|
|
|
4e2d1143e5
|
Add getSize static method to calculate the size of sources in SliceUnion
|
2023-01-25 16:25:09 +01:00 |
|
Ania Brown
|
933d556c84
|
Fix the reordering kernel in Equations
|
2023-01-25 13:53:22 +01:00 |
|
|
|
c7e3fa45bd
|
Add old version of energies and only generate code for doubles
|
2023-01-25 12:50:19 +01:00 |
|
Anna Brown
|
2b8b3bd421
|
don't copy self sufficient slices when sources on gpu
|
2023-01-23 09:03:16 -08:00 |
|
|
|
122329eca7
|
Fix zeroing
|
2023-01-23 17:01:29 +01:00 |
|
Ania Brown
|
58c0bf078e
|
Zero Tijk correctly in CPU code
|
2023-01-23 16:58:08 +01:00 |
|
|
|
3fe15e5e5c
|
Fix bs and ths error in equations
|
2023-01-23 16:57:07 +01:00 |
|
|
|
0d223e6ed9
|
Fix vector types for energy in cpu
|
2023-01-23 14:44:54 +01:00 |
|
|
|
c8bdc4239f
|
Fix an odd character in the warmup
|
2023-01-23 14:43:17 +01:00 |
|
Ania Brown
|
be96e4bf8c
|
1.syntax error fix 2.allocate temporary buffers only once per sim
|
2023-01-23 14:30:11 +01:00 |
|
Anna Brown
|
9003c218a3
|
don't need to copy to separate mpi_data array on the host when sources are resident on gpu
|
2023-01-23 14:25:25 +01:00 |
|
Ania Brown
|
4af47a0bb7
|
Initialize sources on gpus when ATRIP_SOURCES_IN_GPU
|
2023-01-23 14:21:51 +01:00 |
|
Ania Brown
|
9a5a2487be
|
Add warmup in the SliceUnion
|
2023-01-23 13:46:20 +01:00 |
|
|
|
c4ec227185
|
Clean getEnergyDistinct
|
2023-01-13 16:59:19 +01:00 |
|
|
|
1ceb4cf0d6
|
Fix maybeConjugate cuda scope
|
2023-01-13 12:08:54 +01:00 |
|
|
|
34a4e79db0
|
Initial compiling implementation of the energy kernel
|
2023-01-13 11:33:42 +01:00 |
|
|
|
249f1c0b51
|
Add raven modules for cuda
|
2023-01-04 15:23:36 +01:00 |
|
|
|
1d96800d45
|
Add support for reading tensors from file in atrip bench
|
2022-12-06 21:20:03 +01:00 |
|
|
|
9087e3af19
|
Update workflows
|
2022-12-06 20:58:32 +01:00 |
|
|
|
418fd9d389
|
Add simple cuda bench configuration
|
2022-12-06 20:57:34 +01:00 |
|
|
|
895cd02778
|
Add some documentation about running the benches
|
2022-12-06 20:38:57 +01:00 |
|
|
|
8efa3d911e
|
Add --max-iterations to main bench
|
2022-12-06 20:38:38 +01:00 |
|
|
|
0fa24404e5
|
Improve the documentation in the readme for benches building
|
2022-12-06 14:17:53 +01:00 |
|
|
|
8f7d05efda
|
Add Building information and building for sources on GPU
|
2022-12-06 13:26:44 +01:00 |
|
|
|
ad542fe856
|
Add the slicing into the GPU
|
2022-12-05 21:16:30 +01:00 |
|
|
|
658397ebd7
|
Update in SliceUnion ATRIP_SOURCES_IN_GPU
|
2022-12-05 17:55:23 +01:00 |
|
|
|
26e2f2d109
|
Add ATRIP_SOURCES_IN_GPU and ATRIP_CUDA_AWARE_MPI defines in configure
|
2022-12-05 17:49:54 +01:00 |
|
|
|
871471aae3
|
Fix naive-tuples experimentation bench
|
2022-10-18 16:23:43 +02:00 |
|
|
|
65a64f3f8c
|
Test on all pushes
|
2022-10-08 16:05:40 +02:00 |
|
|
|
4f9f09e965
|
Cleanup flags handling in configure
|
2022-10-08 16:04:55 +02:00 |
|
|
|
6dc943e10a
|
Clean up DatabaseCommunicator
|
2022-10-08 16:04:37 +02:00 |
|
|
|
ed347ab0d9
|
Rename NAIVE_SLOW into ATRIP_NAIVE_SLOW
|
2022-10-08 16:04:03 +02:00 |
|
|
|
8c5c47e208
|
Add atrip-def m4 macro
|
2022-10-08 16:03:08 +02:00 |
|
|
|
6871372cac
|
Add support for only calculating DGEMM parts ATRIP_ONLY_DGEMM
|
2022-10-08 16:02:49 +02:00 |
|
|
|
452c0fe001
|
Fix test_main name in automake
|
2022-10-08 15:59:48 +02:00 |
|
|
|
b636b89a64
|
Add configure-benches script
|
2022-10-08 15:59:25 +02:00 |
|
|
|
e59d298a01
|
Rename test_main.cxx to main.cxx
|
2022-10-08 00:17:57 +02:00 |
|
Gallo Alejandro
|
6143d1ae73
|
Rename test_main to atrip
|
2022-10-08 00:13:40 +02:00 |
|
Gallo Alejandro
|
3addd86826
|
Create a barrier before the iterations start
|
2022-10-07 22:51:43 +02:00 |
|
|
|
3ddd507c17
|
Fix a bug in the building the abc tuples fakely
|
2022-10-07 12:23:10 +02:00 |
|
|
|
ae6736fc21
|
Implement a faster version of naive computation
|
2022-10-07 03:19:24 +02:00 |
|
|
|
d5cfe31b12
|
Add naive tuples scracth file
|
2022-10-07 01:11:22 +02:00 |
|
|
|
ddb4574380
|
Fix DatabaseCommunicator
|
2022-10-06 12:27:00 +02:00 |
|
|
|
118df09128
|
Add tentative DatabaseCommunicator
|
2022-10-06 01:10:06 +02:00 |
|
|
|
1e391e3749
|
Update tuples-distribution script
|
2022-10-06 01:07:53 +02:00 |
|
|
|
7734efeb97
|
Add tuples distribution bench
|
2022-10-03 17:13:21 +02:00 |
|
|
|
fa1a29c583
|
Create an implementation file of the Tuples
|
2022-10-03 17:11:49 +02:00 |
|