18 Commits

Author SHA1 Message Date
Andrzej Janik
49aabffdcc Rename zluda_dump to zluda_trace (#456)
* Rename zluda_dump to zluda_trace

* Minor naming fixes
2025-07-31 08:07:03 -07:00
Violet
66db19a061 Move FromCuda and ZludaObject into a common crate (#452)
* Refactor FromCuda error type to be generic

* Create zluda_common crate

* Move FromCuda trait into zluda_common

* Write some doc comments

* Fix typo

* Edit comment

* Fix formatting
2025-07-30 15:53:22 -07:00
Violet
481c3550fa Convert CUDA performance lib statuses to Rust result types (#444)
These changes replicate how the main library is handled. cuDNN still needs to have zluda_bindgen run and zluda_dump_common updated
2025-07-29 14:28:14 -07:00
Violet
d81404eb70 Add support for rocblas to zluda_bindgen (#440)
One step of several for adding cublas support
2025-07-28 15:07:22 -07:00
Violet
8c23ef1ded Rename cuda_base cuda_macros (#435) 2025-07-25 11:09:50 -07:00
aiwhskruht
9bd8125c53 Implement more CUDA driver API to enable simple cuda-samples (#405) 2025-07-08 10:18:30 -07:00
aiwhskruht
8d5b734c30 Add initialized check to protect zluda from cuda driver calls during shutdown (#404) 2025-07-07 11:08:09 -07:00
Andrzej Janik
22608d7420 Bump dependencies (#392)
zip 2.6.1 was yanked and microlp 2.10 has a major bug
2025-06-23 18:04:08 -07:00
Violet
4da3978f94 Implement cuLibraryLoadData (#388) 2025-06-18 16:05:53 -07:00
Andrzej Janik
f179868b8e Add automated builds (#358) 2025-06-16 09:53:18 -07:00
Violet
25a9d1c40e Implement runtime_callback_hooks_fn2 (#380) 2025-06-11 15:15:43 -07:00
Andrzej Janik
3361046760 Fix mad.wide, replace external CUDA library in test with our own (#376) 2025-06-09 21:33:18 -07:00
Andrzej Janik
c790ab45ec Redo logging to better log dark API and performance libraries (#372) 2025-06-09 15:29:14 -07:00
Andrzej Janik
3d3e38aadc Fix ROCm 6.4 failures (#364)
Lazy load comgr and dispatch to different code paths based on the name of the comgr .dll/.so
2025-05-02 00:38:22 +02:00
Andrzej Janik
cc83b9f1f6 Create infrastructure for performance libraries (#363) 2025-05-01 22:37:18 +02:00
Andrzej Janik
d704e92c97 Support instruction modes (denormal and rounding) on AMD GPUs (#342) 2025-03-17 21:37:26 +01:00
Joëlle van Essen
867e4728d5 LLVM unit tests (#324)
* LLVM unit tests: add assembly files

* LLVM unit tests: first attempt

* LLVM unit tests: fix - parse bitcode in context

* LLVM unit tests: use pretty_assertions for line-by-line diff

* LLVM unit tests: Write IR to file for failed test

* LLVM unit tests: just use the stack

* LLVM unit tests: use MaybeUninit

* LLVM unit tests: add mul24.ll

* LLVM unit tests: Adjustments after review

* LLVM unit tests: Include emit_llvm::Context in emit_llvm::Module

* LLVM unit tests: Fix typo

* LLVM unit tests: Context need not be pub
2025-02-19 21:21:20 +01:00
Andrzej Janik
df5a96d935 Improve build system (#329)
Also fix Dockerfile and Windows build
2025-01-28 01:55:36 +01:00