83 Commits

Author SHA1 Message Date
Andrzej Janik
059b8ca0f6 Make sure it is possible to log 32bit PhysX (#374) 2025-07-08 10:19:49 -07:00
aiwhskruht
9bd8125c53 Implement more CUDA driver API to enable simple cuda-samples (#405) 2025-07-08 10:18:30 -07:00
aiwhskruht
8d5b734c30 Add initialized check to protect zluda from cuda driver calls during shutdown (#404) 2025-07-07 11:08:09 -07:00
Andrzej Janik
ef0c4afcf9 Run unit tests on every commit (#401) 2025-07-03 16:07:00 -07:00
aiwhskruht
d4ad17d75a Unified fatbin versions behind a single iterator. (#398) 2025-06-27 15:56:46 -07:00
Violet
4da3978f94 Implement cuLibraryLoadData (#388) 2025-06-18 16:05:53 -07:00
Violet
8ce70c5095 Add integrity_check implementation to ZLUDA (#387) 2025-06-17 15:00:10 -07:00
Violet
4d4053194a Implement runtime_callback_hooks_fn6 (#386) 2025-06-16 17:00:47 -07:00
Violet
9c5f1ed9fb Handle new attributes in cuDeviceGetAttribute (#383) 2025-06-16 13:20:04 -07:00
Violet
9773d20945 Implement cudart_interface_fn2 (#382) 2025-06-13 14:01:14 -07:00
Violet
1715830d82 Implement cuModuleGetLoadingMode (#381) 2025-06-11 15:54:48 -07:00
Violet
25a9d1c40e Implement runtime_callback_hooks_fn2 (#380) 2025-06-11 15:15:43 -07:00
Violet
62f3e63355 Implement cuGetProcAddress and cuGetProcAddress_v2 (#377) 2025-06-10 16:07:35 -07:00
Andrzej Janik
3361046760 Fix mad.wide, replace external CUDA library in test with our own (#376) 2025-06-09 21:33:18 -07:00
Andrzej Janik
c790ab45ec Redo logging to better log dark API and performance libraries (#372) 2025-06-09 15:29:14 -07:00
Andrzej Janik
3d3e38aadc Fix ROCm 6.4 failures (#364)
Lazy load comgr and dispatch to different code paths based on the name of the comgr .dll/.so
2025-05-02 00:38:22 +02:00
Joëlle van Essen
867e4728d5 LLVM unit tests (#324)
* LLVM unit tests: add assembly files

* LLVM unit tests: first attempt

* LLVM unit tests: fix - parse bitcode in context

* LLVM unit tests: use pretty_assertions for line-by-line diff

* LLVM unit tests: Write IR to file for failed test

* LLVM unit tests: just use the stack

* LLVM unit tests: use MaybeUninit

* LLVM unit tests: add mul24.ll

* LLVM unit tests: Adjustments after review

* LLVM unit tests: Include emit_llvm::Context in emit_llvm::Module

* LLVM unit tests: Fix typo

* LLVM unit tests: Context need not be pub
2025-02-19 21:21:20 +01:00
Andrzej Janik
df5a96d935 Improve build system (#329)
Also fix Dockerfile and Windows build
2025-01-28 01:55:36 +01:00
Andrzej Janik
7ac67a89e9 Enable Geekbench 5 (#304) 2024-12-10 21:48:10 +01:00
Andrzej Janik
7a6df9dcbf Fix host code and update to CUDA 12.4 (#299) 2024-12-02 00:29:57 +01:00
Andrzej Janik
46def3e7e0 Connect new parser to LLVM bitcode backend (#269)
This is very incomplete. Just enough code to emit LLVM bitcode and continue further development
2024-09-13 01:07:31 +02:00
Andrzej Janik
816365e7df Fix shared munging pass and add fix cuModuleLoadData 2021-09-29 21:49:47 +00:00
Andrzej Janik
9609f86033 Fix minor bugs 2021-09-19 00:39:43 +00:00
Andrzej Janik
ccf3c02ac1 Minor fixes 2021-09-18 01:36:12 +00:00
Andrzej Janik
c37223fe67 Set linking output 2021-09-17 01:25:34 +00:00
Andrzej Janik
62ce1fd3a9 Implement linking 2021-09-16 23:26:02 +00:00
Andrzej Janik
04394dbb04 Oops 2021-09-16 23:13:10 +00:00
Andrzej Janik
314e3dcb49 Add missing V-RAY host functions 2021-09-16 23:11:34 +00:00
Andrzej Janik
986fa49097 Zero out buffer on creation 2021-09-13 23:43:50 +00:00
Andrzej Janik
dbb6f09ffa Continue HIP conversion 2021-09-13 17:59:40 +00:00
Andrzej Janik
2f951fa04c Support more export table functions 2021-09-12 22:10:07 +02:00
Andrzej Janik
5290190727 Add support for a new export table 2021-09-12 15:12:26 +02:00
Andrzej Janik
a71cd44104 HIP conversion part #3 2021-09-05 17:29:33 +02:00
Andrzej Janik
a63f004540 HIP conversion part #2 2021-09-05 14:58:15 +02:00
Andrzej Janik
5ec18f14a1 HIP conversion part #1 2021-08-27 23:04:52 +00:00
Andrzej Janik
4ae7feb93a Start converting host code to HIP 2021-08-27 17:27:48 +00:00
Andrzej Janik
9631a8d242 Don't wait on a blocking event 2021-08-08 17:43:14 +02:00
Andrzej Janik
e2432d0df1 Improve AMD compatibility 2021-08-08 14:26:56 +02:00
Andrzej Janik
043172bd9b Additional options to clang 2021-08-08 02:42:38 +02:00
Andrzej Janik
5969e59aae Explicitly mark input to AMD as bitcode 2021-08-08 02:35:17 +02:00
Andrzej Janik
4b4ba90219 Persist AMD kernels for later debug 2021-08-07 18:10:53 +02:00
Andrzej Janik
44decaf396 Use raw interop for building programs 2021-08-07 16:53:26 +02:00
Andrzej Janik
407664600a Hack to read clang output 2021-08-07 16:16:59 +02:00
Andrzej Janik
3ce6aee65d Try seeking before reading 2021-08-07 16:06:15 +02:00
Andrzej Janik
3070d983ab Take path to llvm-spirv from environment 2021-08-07 15:29:19 +02:00
Andrzej Janik
fefdd528d5 Handle xnack suffix in device name 2021-08-07 15:21:45 +02:00
Andrzej Janik
479014a783 Wire up AMD compilation 2021-08-06 13:19:55 +02:00
Andrzej Janik
5bfc2a56b9 Remove all use of L0 2021-08-06 02:15:57 +02:00
Andrzej Janik
becda31524 Convert OpenCL host code to SVM 2021-08-04 19:34:56 +02:00
Andrzej Janik
638786b0ec Hack enough functionality that AMD GPU code builds 2021-08-03 00:22:47 +02:00