Andrzej Janik
e248a2c9a9
Dump modules passed for linking
2021-09-13 18:11:47 +02:00
Andrzej Janik
2f951fa04c
Support more export table functions
2021-09-12 22:10:07 +02:00
Andrzej Janik
5290190727
Add support for a new export table
2021-09-12 15:12:26 +02:00
Andrzej Janik
ab67cd46fc
Fix dumping on x64 Linux (and possibly Windows)
2021-09-10 19:21:25 +02:00
Andrzej Janik
8f68287b18
Tune generated code, add a workaround for geekbench
2021-07-25 15:19:43 +02:00
Andrzej Janik
1897d33916
Fix typo
2021-07-05 01:04:19 +02:00
Andrzej Janik
7d4fbedfcf
Allow to set range of dump kernels
2021-06-30 23:56:15 +02:00
Andrzej Janik
b2a455e12e
Bunch of tiny fixes and improvements
2021-06-28 01:40:35 +02:00
Andrzej Janik
196242b410
Revert "Fix offset calculation in kernel launch"
...
This reverts commit d7d38256e0580c3cfb649a641d9ed62c6ff0fc20.
2021-06-27 21:55:08 +02:00
Andrzej Janik
55fbe1abb5
Fix bugs related to replay on Linux
2021-06-27 16:19:04 +02:00
Andrzej Janik
e89b0c5d9c
Check for presence of ".version" instead of ".address_size" (which is optional)
2021-06-27 14:58:36 +02:00
Andrzej Janik
d7d38256e0
Fix offset calculation in kernel launch
2021-06-27 13:08:46 +02:00
Andrzej Janik
8ef6c3d8b6
Merge branch 'one_type_type2'
2021-06-25 01:20:38 +02:00
Andrzej Janik
f0771e1fb6
Slightly improve stateful optimization
2021-06-11 00:00:56 +02:00
Andrzej Janik
89e72e4e95
Handle even more export table functions
2021-05-17 01:25:38 +02:00
Andrzej Janik
dca4c5bd21
Report calls to unsupported exportad table functions
2021-05-16 01:08:59 +02:00
Andrzej Janik
425edfcdd4
Simplify typing
2021-05-07 18:22:09 +02:00
Andrzej Janik
a005c92c61
Index from 0
2021-04-12 01:07:45 +02:00
Andrzej Janik
fedf88180a
Dump all modules, even if not enqueued
2021-04-12 00:42:35 +02:00
Andrzej Janik
96f95d59ce
Make zluda_dump more robust
2021-04-12 00:18:27 +02:00
Andrzej Janik
a39dda67d1
Make dumper compatible with older versions of CUDA
2021-04-10 23:01:01 +02:00
Andrzej Janik
8393dbd6e9
More fixes for 32bit
2021-04-09 22:00:23 +02:00
Andrzej Janik
9dcfb45aa2
Make dumper 32-bit compatible
2021-04-09 21:34:41 +02:00
Andrzej Janik
eec55d9d02
Inform about ELF binaries in dumper
2021-02-28 12:49:25 +01:00
Andrzej Janik
ba83bb28f7
Inject our own NVML
2021-02-28 01:50:04 +01:00
Andrzej Janik
bfae2e0d21
Allow overriding device compute version in dumper
2021-02-27 20:55:19 +01:00
Andrzej Janik
36514bd6eb
Improve ZLUDA injection ( #37 )
...
Improve injector&redirector so it's no longer required to manually mess with files if the application links nvcuda.dll. Additionally inject into child processes
2021-02-20 21:40:19 +01:00
Andrzej Janik
972f612562
Fix signed integer conversion ( #36 )
...
This fixes the last remaining bug preventing end-to-end GeekBench run, so also update Geekbench results in README
2021-01-26 21:05:09 +01:00
Andrzej Janik
3e2e73ac33
Add script for replaying dumped kernel ( #34 )
...
zluda_dump can already create traces of GPU execution, this script can replay those traces.
Additionally, changed added just enough code in core ZLUDA to support simple PyCUDAexecution
2021-01-23 16:57:07 +01:00
Andrzej Janik
ff8135e8a3
Add a library for dumping kernels arguments before and after launch ( #18 )
2021-01-16 22:28:48 +01:00