Georgi Gerganov
|
6309a60bac
|
ggml : vectorized quantize_row_q4_0 (ARM)
|
2 years ago |
Georgi Gerganov
|
ea97a5f469
|
ggml : vectorized mad q4_0 (ARM)
|
2 years ago |
Georgi Gerganov
|
8ce6d1e492
|
gq : add method 6 (ARM)
|
2 years ago |
Georgi Gerganov
|
1ca898f94b
|
gq : method 5 (ARM)
|
2 years ago |
Georgi Gerganov
|
5a96c91bea
|
gq : method 4 (AVX2 attempt) + method 5 (no min)
|
2 years ago |
Georgi Gerganov
|
cde7c22ab1
|
gq : method 4 (ARM)
|
2 years ago |
Georgi Gerganov
|
054d97e0e1
|
gq : method 4 (AVX2)
|
2 years ago |
Georgi Gerganov
|
37dcfad83b
|
gq : progress on method 2
|
2 years ago |
Georgi Gerganov
|
bf709e45de
|
gq : add amax based method 3
|
2 years ago |
Georgi Gerganov
|
0a7debb7bf
|
gq : attempt at n-bit quantization
|
2 years ago |
Georgi Gerganov
|
efa2cc36a2
|
tests : fix cblas_sgemm call
|
2 years ago |
Georgi Gerganov
|
3b3ad42906
|
tests : add SVD experiments
|
2 years ago |
Georgi Gerganov
|
78af1420bf
|
tests : change test2 eps
|
2 years ago |
Georgi Gerganov
|
73a7916d30
|
tests : some more quantization experiments
|
2 years ago |
Georgi Gerganov
|
e0abac1be7
|
sync : forgot to sync ggml.h
|
2 years ago |
Georgi Gerganov
|
deb0c486c7
|
tests : wip quantized matrix multiplication method 2
|
2 years ago |
Georgi Gerganov
|
d677c7f61d
|
tests : minor fixes for x86
|
2 years ago |
Georgi Gerganov
|
446ccf3ab1
|
tests : experiments with n-bit quantized matrix multiplication
|
2 years ago |
Georgi Gerganov
|
7b70c5a561
|
Minor fixes
|
2 years ago |
Georgi Gerganov
|
ea0ef2a41e
|
Performance tests - trying to optimize mul_mat
|
2 years ago |
Georgi Gerganov
|
fb558f78d9
|
Initial release
|
2 years ago |