Violet
99c36092be
Use FromCuda
in zluda_blas
( #455 )
2025-07-31 09:52:10 -07:00
Andrzej Janik
49aabffdcc
Rename zluda_dump to zluda_trace ( #456 )
...
* Rename zluda_dump to zluda_trace
* Minor naming fixes
2025-07-31 08:07:03 -07:00
Violet
66db19a061
Move FromCuda
and ZludaObject
into a common crate ( #452 )
...
* Refactor FromCuda error type to be generic
* Create zluda_common crate
* Move FromCuda trait into zluda_common
* Write some doc comments
* Fix typo
* Edit comment
* Fix formatting
2025-07-30 15:53:22 -07:00
Violet
481c3550fa
Convert CUDA performance lib statuses to Rust result types ( #444 )
...
These changes replicate how the main library is handled. cuDNN still needs to have zluda_bindgen run and zluda_dump_common updated
2025-07-29 14:28:14 -07:00
Violet
d81404eb70
Add support for rocblas
to zluda_bindgen
( #440 )
...
One step of several for adding cublas support
2025-07-28 15:07:22 -07:00
Violet
8c23ef1ded
Rename cuda_base cuda_macros ( #435 )
2025-07-25 11:09:50 -07:00
aiwhskruht
9bd8125c53
Implement more CUDA driver API to enable simple cuda-samples ( #405 )
2025-07-08 10:18:30 -07:00
aiwhskruht
8d5b734c30
Add initialized check to protect zluda from cuda driver calls during shutdown ( #404 )
2025-07-07 11:08:09 -07:00
Andrzej Janik
22608d7420
Bump dependencies ( #392 )
...
zip 2.6.1 was yanked and microlp 2.10 has a major bug
2025-06-23 18:04:08 -07:00
Violet
4da3978f94
Implement cuLibraryLoadData
( #388 )
2025-06-18 16:05:53 -07:00
Andrzej Janik
f179868b8e
Add automated builds ( #358 )
2025-06-16 09:53:18 -07:00
Violet
25a9d1c40e
Implement runtime_callback_hooks_fn2 ( #380 )
2025-06-11 15:15:43 -07:00
Andrzej Janik
3361046760
Fix mad.wide, replace external CUDA library in test with our own ( #376 )
2025-06-09 21:33:18 -07:00
Andrzej Janik
c790ab45ec
Redo logging to better log dark API and performance libraries ( #372 )
2025-06-09 15:29:14 -07:00
Andrzej Janik
3d3e38aadc
Fix ROCm 6.4 failures ( #364 )
...
Lazy load comgr and dispatch to different code paths based on the name of the comgr .dll/.so
2025-05-02 00:38:22 +02:00
Andrzej Janik
cc83b9f1f6
Create infrastructure for performance libraries ( #363 )
2025-05-01 22:37:18 +02:00
Andrzej Janik
d704e92c97
Support instruction modes (denormal and rounding) on AMD GPUs ( #342 )
2025-03-17 21:37:26 +01:00
Joëlle van Essen
867e4728d5
LLVM unit tests ( #324 )
...
* LLVM unit tests: add assembly files
* LLVM unit tests: first attempt
* LLVM unit tests: fix - parse bitcode in context
* LLVM unit tests: use pretty_assertions for line-by-line diff
* LLVM unit tests: Write IR to file for failed test
* LLVM unit tests: just use the stack
* LLVM unit tests: use MaybeUninit
* LLVM unit tests: add mul24.ll
* LLVM unit tests: Adjustments after review
* LLVM unit tests: Include emit_llvm::Context in emit_llvm::Module
* LLVM unit tests: Fix typo
* LLVM unit tests: Context need not be pub
2025-02-19 21:21:20 +01:00
Andrzej Janik
df5a96d935
Improve build system ( #329 )
...
Also fix Dockerfile and Windows build
2025-01-28 01:55:36 +01:00