Joëlle van Essen
867e4728d5
LLVM unit tests ( #324 )
...
* LLVM unit tests: add assembly files
* LLVM unit tests: first attempt
* LLVM unit tests: fix - parse bitcode in context
* LLVM unit tests: use pretty_assertions for line-by-line diff
* LLVM unit tests: Write IR to file for failed test
* LLVM unit tests: just use the stack
* LLVM unit tests: use MaybeUninit
* LLVM unit tests: add mul24.ll
* LLVM unit tests: Adjustments after review
* LLVM unit tests: Include emit_llvm::Context in emit_llvm::Module
* LLVM unit tests: Fix typo
* LLVM unit tests: Context need not be pub
2025-02-19 21:21:20 +01:00
Andrzej Janik
df5a96d935
Improve build system ( #329 )
...
Also fix Dockerfile and Windows build
2025-01-28 01:55:36 +01:00
Andrzej Janik
7ac67a89e9
Enable Geekbench 5 ( #304 )
2024-12-10 21:48:10 +01:00
Andrzej Janik
7a6df9dcbf
Fix host code and update to CUDA 12.4 ( #299 )
2024-12-02 00:29:57 +01:00
Andrzej Janik
46def3e7e0
Connect new parser to LLVM bitcode backend ( #269 )
...
This is very incomplete. Just enough code to emit LLVM bitcode and continue further development
2024-09-13 01:07:31 +02:00
Andrzej Janik
816365e7df
Fix shared munging pass and add fix cuModuleLoadData
2021-09-29 21:49:47 +00:00
Andrzej Janik
9609f86033
Fix minor bugs
2021-09-19 00:39:43 +00:00
Andrzej Janik
ccf3c02ac1
Minor fixes
2021-09-18 01:36:12 +00:00
Andrzej Janik
c37223fe67
Set linking output
2021-09-17 01:25:34 +00:00
Andrzej Janik
62ce1fd3a9
Implement linking
2021-09-16 23:26:02 +00:00
Andrzej Janik
04394dbb04
Oops
2021-09-16 23:13:10 +00:00
Andrzej Janik
314e3dcb49
Add missing V-RAY host functions
2021-09-16 23:11:34 +00:00
Andrzej Janik
986fa49097
Zero out buffer on creation
2021-09-13 23:43:50 +00:00
Andrzej Janik
dbb6f09ffa
Continue HIP conversion
2021-09-13 17:59:40 +00:00
Andrzej Janik
2f951fa04c
Support more export table functions
2021-09-12 22:10:07 +02:00
Andrzej Janik
5290190727
Add support for a new export table
2021-09-12 15:12:26 +02:00
Andrzej Janik
a71cd44104
HIP conversion part #3
2021-09-05 17:29:33 +02:00
Andrzej Janik
a63f004540
HIP conversion part #2
2021-09-05 14:58:15 +02:00
Andrzej Janik
5ec18f14a1
HIP conversion part #1
2021-08-27 23:04:52 +00:00
Andrzej Janik
4ae7feb93a
Start converting host code to HIP
2021-08-27 17:27:48 +00:00
Andrzej Janik
9631a8d242
Don't wait on a blocking event
2021-08-08 17:43:14 +02:00
Andrzej Janik
e2432d0df1
Improve AMD compatibility
2021-08-08 14:26:56 +02:00
Andrzej Janik
043172bd9b
Additional options to clang
2021-08-08 02:42:38 +02:00
Andrzej Janik
5969e59aae
Explicitly mark input to AMD as bitcode
2021-08-08 02:35:17 +02:00
Andrzej Janik
4b4ba90219
Persist AMD kernels for later debug
2021-08-07 18:10:53 +02:00
Andrzej Janik
44decaf396
Use raw interop for building programs
2021-08-07 16:53:26 +02:00
Andrzej Janik
407664600a
Hack to read clang output
2021-08-07 16:16:59 +02:00
Andrzej Janik
3ce6aee65d
Try seeking before reading
2021-08-07 16:06:15 +02:00
Andrzej Janik
3070d983ab
Take path to llvm-spirv from environment
2021-08-07 15:29:19 +02:00
Andrzej Janik
fefdd528d5
Handle xnack suffix in device name
2021-08-07 15:21:45 +02:00
Andrzej Janik
479014a783
Wire up AMD compilation
2021-08-06 13:19:55 +02:00
Andrzej Janik
5bfc2a56b9
Remove all use of L0
2021-08-06 02:15:57 +02:00
Andrzej Janik
becda31524
Convert OpenCL host code to SVM
2021-08-04 19:34:56 +02:00
Andrzej Janik
638786b0ec
Hack enough functionality that AMD GPU code builds
2021-08-03 00:22:47 +02:00
Andrzej Janik
b4de21fbc5
Use calls to OpenCL builtins when translating sregs, do SPIRV->LLVM conversion on every build
2021-08-02 01:04:05 +02:00
Andrzej Janik
8f68287b18
Tune generated code, add a workaround for geekbench
2021-07-25 15:19:43 +02:00
Andrzej Janik
9d4f26bd07
Finish converting to OpenCL
2021-07-22 00:20:33 +02:00
Andrzej Janik
3d2024bf62
Start converting to OpenCL
2021-07-21 01:46:50 +02:00
Andrzej Janik
58fb8a234c
Synchronize through barrier
2021-07-06 00:16:46 +02:00
Andrzej Janik
2c6bee4955
Fix overzealus check
2021-07-05 01:14:46 +02:00
Andrzej Janik
5b593ec185
Implement stream-wide event reuse
2021-07-04 19:06:37 +02:00
Andrzej Janik
ce25035051
Use immediate command lists
2021-07-04 16:46:07 +02:00
Andrzej Janik
ecc33f7b10
Make everything async
2021-07-04 15:40:08 +02:00
Andrzej Janik
d76ffd691c
Remember to actually submit workload
2021-07-04 13:24:35 +02:00
Andrzej Janik
b460e359ae
First attempt at async host side
2021-07-04 12:54:27 +02:00
Andrzej Janik
b2a455e12e
Bunch of tiny fixes and improvements
2021-06-28 01:40:35 +02:00
Andrzej Janik
196242b410
Revert "Fix offset calculation in kernel launch"
...
This reverts commit d7d38256e0580c3cfb649a641d9ed62c6ff0fc20.
2021-06-27 21:55:08 +02:00
Andrzej Janik
55fbe1abb5
Fix bugs related to replay on Linux
2021-06-27 16:19:04 +02:00
Andrzej Janik
d7d38256e0
Fix offset calculation in kernel launch
2021-06-27 13:08:46 +02:00
Andrzej Janik
b2765370e5
Fix more bugs
2021-06-27 11:46:59 +02:00