75 Commits

Author SHA1 Message Date
Violet
9c5f1ed9fb Handle new attributes in cuDeviceGetAttribute (#383) 2025-06-16 13:20:04 -07:00
Violet
9773d20945 Implement cudart_interface_fn2 (#382) 2025-06-13 14:01:14 -07:00
Violet
1715830d82 Implement cuModuleGetLoadingMode (#381) 2025-06-11 15:54:48 -07:00
Violet
25a9d1c40e Implement runtime_callback_hooks_fn2 (#380) 2025-06-11 15:15:43 -07:00
Violet
62f3e63355 Implement cuGetProcAddress and cuGetProcAddress_v2 (#377) 2025-06-10 16:07:35 -07:00
Andrzej Janik
3361046760 Fix mad.wide, replace external CUDA library in test with our own (#376) 2025-06-09 21:33:18 -07:00
Andrzej Janik
c790ab45ec Redo logging to better log dark API and performance libraries (#372) 2025-06-09 15:29:14 -07:00
Andrzej Janik
3d3e38aadc Fix ROCm 6.4 failures (#364)
Lazy load comgr and dispatch to different code paths based on the name of the comgr .dll/.so
2025-05-02 00:38:22 +02:00
Joëlle van Essen
867e4728d5 LLVM unit tests (#324)
* LLVM unit tests: add assembly files

* LLVM unit tests: first attempt

* LLVM unit tests: fix - parse bitcode in context

* LLVM unit tests: use pretty_assertions for line-by-line diff

* LLVM unit tests: Write IR to file for failed test

* LLVM unit tests: just use the stack

* LLVM unit tests: use MaybeUninit

* LLVM unit tests: add mul24.ll

* LLVM unit tests: Adjustments after review

* LLVM unit tests: Include emit_llvm::Context in emit_llvm::Module

* LLVM unit tests: Fix typo

* LLVM unit tests: Context need not be pub
2025-02-19 21:21:20 +01:00
Andrzej Janik
df5a96d935 Improve build system (#329)
Also fix Dockerfile and Windows build
2025-01-28 01:55:36 +01:00
Andrzej Janik
7ac67a89e9 Enable Geekbench 5 (#304) 2024-12-10 21:48:10 +01:00
Andrzej Janik
7a6df9dcbf Fix host code and update to CUDA 12.4 (#299) 2024-12-02 00:29:57 +01:00
Andrzej Janik
46def3e7e0 Connect new parser to LLVM bitcode backend (#269)
This is very incomplete. Just enough code to emit LLVM bitcode and continue further development
2024-09-13 01:07:31 +02:00
Andrzej Janik
816365e7df Fix shared munging pass and add fix cuModuleLoadData 2021-09-29 21:49:47 +00:00
Andrzej Janik
9609f86033 Fix minor bugs 2021-09-19 00:39:43 +00:00
Andrzej Janik
ccf3c02ac1 Minor fixes 2021-09-18 01:36:12 +00:00
Andrzej Janik
c37223fe67 Set linking output 2021-09-17 01:25:34 +00:00
Andrzej Janik
62ce1fd3a9 Implement linking 2021-09-16 23:26:02 +00:00
Andrzej Janik
04394dbb04 Oops 2021-09-16 23:13:10 +00:00
Andrzej Janik
314e3dcb49 Add missing V-RAY host functions 2021-09-16 23:11:34 +00:00
Andrzej Janik
986fa49097 Zero out buffer on creation 2021-09-13 23:43:50 +00:00
Andrzej Janik
dbb6f09ffa Continue HIP conversion 2021-09-13 17:59:40 +00:00
Andrzej Janik
2f951fa04c Support more export table functions 2021-09-12 22:10:07 +02:00
Andrzej Janik
5290190727 Add support for a new export table 2021-09-12 15:12:26 +02:00
Andrzej Janik
a71cd44104 HIP conversion part #3 2021-09-05 17:29:33 +02:00
Andrzej Janik
a63f004540 HIP conversion part #2 2021-09-05 14:58:15 +02:00
Andrzej Janik
5ec18f14a1 HIP conversion part #1 2021-08-27 23:04:52 +00:00
Andrzej Janik
4ae7feb93a Start converting host code to HIP 2021-08-27 17:27:48 +00:00
Andrzej Janik
9631a8d242 Don't wait on a blocking event 2021-08-08 17:43:14 +02:00
Andrzej Janik
e2432d0df1 Improve AMD compatibility 2021-08-08 14:26:56 +02:00
Andrzej Janik
043172bd9b Additional options to clang 2021-08-08 02:42:38 +02:00
Andrzej Janik
5969e59aae Explicitly mark input to AMD as bitcode 2021-08-08 02:35:17 +02:00
Andrzej Janik
4b4ba90219 Persist AMD kernels for later debug 2021-08-07 18:10:53 +02:00
Andrzej Janik
44decaf396 Use raw interop for building programs 2021-08-07 16:53:26 +02:00
Andrzej Janik
407664600a Hack to read clang output 2021-08-07 16:16:59 +02:00
Andrzej Janik
3ce6aee65d Try seeking before reading 2021-08-07 16:06:15 +02:00
Andrzej Janik
3070d983ab Take path to llvm-spirv from environment 2021-08-07 15:29:19 +02:00
Andrzej Janik
fefdd528d5 Handle xnack suffix in device name 2021-08-07 15:21:45 +02:00
Andrzej Janik
479014a783 Wire up AMD compilation 2021-08-06 13:19:55 +02:00
Andrzej Janik
5bfc2a56b9 Remove all use of L0 2021-08-06 02:15:57 +02:00
Andrzej Janik
becda31524 Convert OpenCL host code to SVM 2021-08-04 19:34:56 +02:00
Andrzej Janik
638786b0ec Hack enough functionality that AMD GPU code builds 2021-08-03 00:22:47 +02:00
Andrzej Janik
b4de21fbc5 Use calls to OpenCL builtins when translating sregs, do SPIRV->LLVM conversion on every build 2021-08-02 01:04:05 +02:00
Andrzej Janik
8f68287b18 Tune generated code, add a workaround for geekbench 2021-07-25 15:19:43 +02:00
Andrzej Janik
9d4f26bd07 Finish converting to OpenCL 2021-07-22 00:20:33 +02:00
Andrzej Janik
3d2024bf62 Start converting to OpenCL 2021-07-21 01:46:50 +02:00
Andrzej Janik
58fb8a234c Synchronize through barrier 2021-07-06 00:16:46 +02:00
Andrzej Janik
2c6bee4955 Fix overzealus check 2021-07-05 01:14:46 +02:00
Andrzej Janik
5b593ec185 Implement stream-wide event reuse 2021-07-04 19:06:37 +02:00
Andrzej Janik
ce25035051 Use immediate command lists 2021-07-04 16:46:07 +02:00