Commit Graph

21 Commits (llama)

Author SHA1 Message Date
Georgi Gerganov 6309a60bac
ggml : vectorized quantize_row_q4_0 (ARM)
2 years ago
Georgi Gerganov ea97a5f469
ggml : vectorized mad q4_0 (ARM)
2 years ago
Georgi Gerganov 8ce6d1e492
gq : add method 6 (ARM)
2 years ago
Georgi Gerganov 1ca898f94b
gq : method 5 (ARM)
2 years ago
Georgi Gerganov 5a96c91bea
gq : method 4 (AVX2 attempt) + method 5 (no min)
2 years ago
Georgi Gerganov cde7c22ab1
gq : method 4 (ARM)
2 years ago
Georgi Gerganov 054d97e0e1
gq : method 4 (AVX2)
2 years ago
Georgi Gerganov 37dcfad83b
gq : progress on method 2
2 years ago
Georgi Gerganov bf709e45de
gq : add amax based method 3
2 years ago
Georgi Gerganov 0a7debb7bf
gq : attempt at n-bit quantization
2 years ago
Georgi Gerganov efa2cc36a2
tests : fix cblas_sgemm call
2 years ago
Georgi Gerganov 3b3ad42906
tests : add SVD experiments
2 years ago
Georgi Gerganov 78af1420bf
tests : change test2 eps
2 years ago
Georgi Gerganov 73a7916d30
tests : some more quantization experiments
2 years ago
Georgi Gerganov e0abac1be7
sync : forgot to sync ggml.h
2 years ago
Georgi Gerganov deb0c486c7
tests : wip quantized matrix multiplication method 2
2 years ago
Georgi Gerganov d677c7f61d tests : minor fixes for x86
2 years ago
Georgi Gerganov 446ccf3ab1
tests : experiments with n-bit quantized matrix multiplication
2 years ago
Georgi Gerganov 7b70c5a561
Minor fixes
2 years ago
Georgi Gerganov ea0ef2a41e
Performance tests - trying to optimize mul_mat
2 years ago
Georgi Gerganov fb558f78d9
Initial release
2 years ago