aiwhskruht
9bd8125c53
Implement more CUDA driver API to enable simple cuda-samples ( #405 )
2025-07-08 10:18:30 -07:00
aiwhskruht
8d5b734c30
Add initialized check to protect zluda from cuda driver calls during shutdown ( #404 )
2025-07-07 11:08:09 -07:00
Andrzej Janik
ef0c4afcf9
Run unit tests on every commit ( #401 )
2025-07-03 16:07:00 -07:00
aiwhskruht
d4ad17d75a
Unified fatbin versions behind a single iterator. ( #398 )
2025-06-27 15:56:46 -07:00
Violet
4da3978f94
Implement cuLibraryLoadData
( #388 )
2025-06-18 16:05:53 -07:00
Violet
8ce70c5095
Add integrity_check
implementation to ZLUDA ( #387 )
2025-06-17 15:00:10 -07:00
Violet
4d4053194a
Implement runtime_callback_hooks_fn6
( #386 )
2025-06-16 17:00:47 -07:00
Violet
9c5f1ed9fb
Handle new attributes in cuDeviceGetAttribute
( #383 )
2025-06-16 13:20:04 -07:00
Violet
9773d20945
Implement cudart_interface_fn2 ( #382 )
2025-06-13 14:01:14 -07:00
Violet
1715830d82
Implement cuModuleGetLoadingMode ( #381 )
2025-06-11 15:54:48 -07:00
Violet
25a9d1c40e
Implement runtime_callback_hooks_fn2 ( #380 )
2025-06-11 15:15:43 -07:00
Violet
62f3e63355
Implement cuGetProcAddress and cuGetProcAddress_v2 ( #377 )
2025-06-10 16:07:35 -07:00
Andrzej Janik
3361046760
Fix mad.wide, replace external CUDA library in test with our own ( #376 )
2025-06-09 21:33:18 -07:00
Andrzej Janik
c790ab45ec
Redo logging to better log dark API and performance libraries ( #372 )
2025-06-09 15:29:14 -07:00
Andrzej Janik
3d3e38aadc
Fix ROCm 6.4 failures ( #364 )
...
Lazy load comgr and dispatch to different code paths based on the name of the comgr .dll/.so
2025-05-02 00:38:22 +02:00
Joëlle van Essen
867e4728d5
LLVM unit tests ( #324 )
...
* LLVM unit tests: add assembly files
* LLVM unit tests: first attempt
* LLVM unit tests: fix - parse bitcode in context
* LLVM unit tests: use pretty_assertions for line-by-line diff
* LLVM unit tests: Write IR to file for failed test
* LLVM unit tests: just use the stack
* LLVM unit tests: use MaybeUninit
* LLVM unit tests: add mul24.ll
* LLVM unit tests: Adjustments after review
* LLVM unit tests: Include emit_llvm::Context in emit_llvm::Module
* LLVM unit tests: Fix typo
* LLVM unit tests: Context need not be pub
2025-02-19 21:21:20 +01:00
Andrzej Janik
df5a96d935
Improve build system ( #329 )
...
Also fix Dockerfile and Windows build
2025-01-28 01:55:36 +01:00
Andrzej Janik
7ac67a89e9
Enable Geekbench 5 ( #304 )
2024-12-10 21:48:10 +01:00
Andrzej Janik
7a6df9dcbf
Fix host code and update to CUDA 12.4 ( #299 )
2024-12-02 00:29:57 +01:00
Andrzej Janik
46def3e7e0
Connect new parser to LLVM bitcode backend ( #269 )
...
This is very incomplete. Just enough code to emit LLVM bitcode and continue further development
2024-09-13 01:07:31 +02:00
Andrzej Janik
816365e7df
Fix shared munging pass and add fix cuModuleLoadData
2021-09-29 21:49:47 +00:00
Andrzej Janik
9609f86033
Fix minor bugs
2021-09-19 00:39:43 +00:00
Andrzej Janik
ccf3c02ac1
Minor fixes
2021-09-18 01:36:12 +00:00
Andrzej Janik
c37223fe67
Set linking output
2021-09-17 01:25:34 +00:00
Andrzej Janik
62ce1fd3a9
Implement linking
2021-09-16 23:26:02 +00:00
Andrzej Janik
04394dbb04
Oops
2021-09-16 23:13:10 +00:00
Andrzej Janik
314e3dcb49
Add missing V-RAY host functions
2021-09-16 23:11:34 +00:00
Andrzej Janik
986fa49097
Zero out buffer on creation
2021-09-13 23:43:50 +00:00
Andrzej Janik
dbb6f09ffa
Continue HIP conversion
2021-09-13 17:59:40 +00:00
Andrzej Janik
2f951fa04c
Support more export table functions
2021-09-12 22:10:07 +02:00
Andrzej Janik
5290190727
Add support for a new export table
2021-09-12 15:12:26 +02:00
Andrzej Janik
a71cd44104
HIP conversion part #3
2021-09-05 17:29:33 +02:00
Andrzej Janik
a63f004540
HIP conversion part #2
2021-09-05 14:58:15 +02:00
Andrzej Janik
5ec18f14a1
HIP conversion part #1
2021-08-27 23:04:52 +00:00
Andrzej Janik
4ae7feb93a
Start converting host code to HIP
2021-08-27 17:27:48 +00:00
Andrzej Janik
9631a8d242
Don't wait on a blocking event
2021-08-08 17:43:14 +02:00
Andrzej Janik
e2432d0df1
Improve AMD compatibility
2021-08-08 14:26:56 +02:00
Andrzej Janik
043172bd9b
Additional options to clang
2021-08-08 02:42:38 +02:00
Andrzej Janik
5969e59aae
Explicitly mark input to AMD as bitcode
2021-08-08 02:35:17 +02:00
Andrzej Janik
4b4ba90219
Persist AMD kernels for later debug
2021-08-07 18:10:53 +02:00
Andrzej Janik
44decaf396
Use raw interop for building programs
2021-08-07 16:53:26 +02:00
Andrzej Janik
407664600a
Hack to read clang output
2021-08-07 16:16:59 +02:00
Andrzej Janik
3ce6aee65d
Try seeking before reading
2021-08-07 16:06:15 +02:00
Andrzej Janik
3070d983ab
Take path to llvm-spirv from environment
2021-08-07 15:29:19 +02:00
Andrzej Janik
fefdd528d5
Handle xnack suffix in device name
2021-08-07 15:21:45 +02:00
Andrzej Janik
479014a783
Wire up AMD compilation
2021-08-06 13:19:55 +02:00
Andrzej Janik
5bfc2a56b9
Remove all use of L0
2021-08-06 02:15:57 +02:00
Andrzej Janik
becda31524
Convert OpenCL host code to SVM
2021-08-04 19:34:56 +02:00
Andrzej Janik
638786b0ec
Hack enough functionality that AMD GPU code builds
2021-08-03 00:22:47 +02:00
Andrzej Janik
b4de21fbc5
Use calls to OpenCL builtins when translating sregs, do SPIRV->LLVM conversion on every build
2021-08-02 01:04:05 +02:00