Commit Graph

206 Commits

Author SHA1 Message Date
Gallo Alejandro
23ad87214f Uncomment everything in Equations, (to test see comments)
For testing, comment out everything
that has REORDER, MAYBE_CONJ and zeroing.
2022-09-12 19:13:45 +02:00
Gallo Alejandro
3d7702d501 Minimal changes in Equations 2022-09-12 19:10:14 +02:00
Gallo Alejandro
c20b9e3bcb Add error checking in Blas.cxx 2022-09-12 19:07:48 +02:00
Gallo Alejandro
da704ad820 Add convenience _FORMAT macro 2022-09-12 18:42:36 +02:00
Gallo Alejandro
f1b2f37fe2 Check CUresult for mpi_data to device 2022-09-12 18:41:37 +02:00
Gallo Alejandro
1cd7bac187 Add macros to check CUerror and cublasStatus_t 2022-09-12 18:36:01 +02:00
Gallo Alejandro
68892d5dd8 Make bootstrap work from anywhere in the project 2022-09-12 18:35:30 +02:00
4277c07cc2 Add memory consumption in bench 2022-09-08 15:44:29 +02:00
0558148937 Fix small syntactic bug 2022-09-08 15:36:10 +02:00
49ff3b377c Add a chrono for mpi memcpy in cuda 2022-09-08 15:27:51 +02:00
00a28c990c Indent more conventionally 2022-09-08 15:27:38 +02:00
2c5a4620ca Lint and tidy up Equations 2022-09-08 13:51:49 +02:00
368c5619cc Fix the autotools atrip_cublas 2022-09-08 13:50:08 +02:00
0b14ac7704 Add bootstrap script 2022-09-06 15:26:28 +02:00
Gallo Alejandro
76a785044d Check ngcards agains ranks per node 2022-08-14 15:36:22 +02:00
Gallo Alejandro
7241bbe9fb Implement reordering on the GPU 2022-08-12 18:32:32 +02:00
Gallo Alejandro
c2e9e930ba Update main Atrip.cxx using several gpus 2022-08-12 18:30:55 +02:00
Gallo Alejandro
b4aef4db9e Fix compilation issues and add KernelSizes 2022-08-12 18:29:21 +02:00
Gallo Alejandro
4651231d3b Update test bench for CUDA 2022-08-12 18:28:20 +02:00
Gallo Alejandro
4101c89907 Improve cuda m4 2022-08-11 13:55:52 +02:00
f06cd7f562 Fix the request free problem 2022-08-08 18:28:51 +02:00
8c04280a65 Fix blas 2022-08-08 18:26:52 +02:00
a30c424781 Fix mkl nix 2022-08-08 18:13:32 +02:00
0b9ec59599 Fix --with-ctf 2022-08-08 18:13:14 +02:00
6b641e8581 Update openblas with debug symbols 2022-08-08 15:32:16 +02:00
Gallo Alejandro
a5b2a74e18 Changes in source files, makes cuda run 2022-08-05 13:42:04 +02:00
e03dd77904 Update nvcc compilation in src 2022-07-29 14:08:11 +02:00
b6ca4c8ff2 Update autoconf tests 2022-07-27 18:05:20 +02:00
ad75a3de13 Add __device__ to some functions 2022-07-26 15:12:09 +02:00
1392e8dc36 Fix some complex pointer errors 2022-07-18 19:27:27 +02:00
e8b733cb1a Fix datafieldtype after doubles 2022-07-18 16:48:57 +02:00
c757c4650c Fix problem with complex numbers 2022-07-14 00:17:24 +02:00
565fb1dcc8 Autotools changes 2022-07-05 12:45:26 +02:00
5765ae0dfb Add more implementation files to config 2022-07-05 12:40:25 +02:00
0f60ffdf0f Initial changes for naive cuda implementation 2022-07-05 12:39:46 +02:00
a4552e2cea Fix the linking of the tests for icc 2022-05-10 12:32:00 +02:00
40dd346f9a Separate building of ctf from the rest 2022-05-10 12:01:33 +02:00
e621bdda30 Try to factor with shell 2022-05-10 11:55:05 +02:00
bea9c7a75e Tangle sources 2022-05-06 13:58:26 +02:00
d76c33f9e8 Correctly set the world for the application 2022-05-06 13:53:51 +02:00
2bf8851962 Add Checkpoint functionality 2022-05-06 13:52:33 +02:00
be39eeb776 Add checkpoint readers and writers and tests 2022-04-26 18:42:29 +02:00
78d67070c0 Fix bench for deleting Vppph 2022-04-05 17:32:21 +02:00
fed19ff52c Implement deleting Vppph 2022-04-05 17:16:11 +02:00
7d37bd9f8b Update configurations with BLAS_LIBS and so on 2022-03-15 23:38:02 +01:00
b54cdc0573 Silence ctf errors 2022-03-15 23:30:28 +01:00
62522d0fa6 Add mor extra m4 packages 2022-03-15 23:30:08 +01:00
1fe8b81bcc Remove check for configure script 2022-03-15 14:57:19 +01:00
fb9fdab703 Update ci for docs with configure 2022-03-15 14:25:31 +01:00
764d27d67c Docs work 2022-03-14 15:39:13 +01:00