Georgi Gerganov
|
6309a60bac
|
ggml : vectorized quantize_row_q4_0 (ARM)
|
2 years ago |
Georgi Gerganov
|
ea97a5f469
|
ggml : vectorized mad q4_0 (ARM)
|
2 years ago |
Georgi Gerganov
|
8ce6d1e492
|
gq : add method 6 (ARM)
|
2 years ago |
Georgi Gerganov
|
1ca898f94b
|
gq : method 5 (ARM)
|
2 years ago |
Georgi Gerganov
|
5a96c91bea
|
gq : method 4 (AVX2 attempt) + method 5 (no min)
|
2 years ago |
Georgi Gerganov
|
cde7c22ab1
|
gq : method 4 (ARM)
|
2 years ago |
Georgi Gerganov
|
054d97e0e1
|
gq : method 4 (AVX2)
|
2 years ago |
Georgi Gerganov
|
37dcfad83b
|
gq : progress on method 2
|
2 years ago |
Georgi Gerganov
|
bf709e45de
|
gq : add amax based method 3
|
2 years ago |
Georgi Gerganov
|
0a7debb7bf
|
gq : attempt at n-bit quantization
|
2 years ago |
Georgi Gerganov
|
73a7916d30
|
tests : some more quantization experiments
|
2 years ago |
Georgi Gerganov
|
e0abac1be7
|
sync : forgot to sync ggml.h
|
2 years ago |
Georgi Gerganov
|
deb0c486c7
|
tests : wip quantized matrix multiplication method 2
|
2 years ago |
Georgi Gerganov
|
d677c7f61d
|
tests : minor fixes for x86
|
2 years ago |
Georgi Gerganov
|
446ccf3ab1
|
tests : experiments with n-bit quantized matrix multiplication
|
2 years ago |