Logo
Explore Help
Register Sign In
Migrations
/
ggml
1
0
Fork
You've already forked ggml
0
Code Issues Pull Requests Packages Projects Releases Wiki Activity
85 Commits
6 Branches
0 Tags
582 KiB
llama
gq
master
4bit
t5
experiments/blocking
Branches Tags
${ item.name }
Create tag ${ searchTerm }
Create branch ${ searchTerm }
from '036268500e'
${ noResults }
Commit Graph

14 Commits (036268500e4ba8ed39fa8aef99899c265dc75b45)

Author SHA1 Message Date
Georgi Gerganov 10356cdcdd
gpt : seems not worth to use FP16 for KV cache
2 years ago
Georgi Gerganov eaa4006047
gpt : fix memory usage computation
2 years ago
Georgi Gerganov fde29bd005
ggml : add ggml_compute_forward_rope_f16()
2 years ago
Georgi Gerganov 5bd952ac3f
gpt-2 : minor
2 years ago
Georgi Gerganov 86b1e356b0
gpt : avoid ggml_transpose on model tensors (new models!)
2 years ago
Georgi Gerganov b48b09c37f
gpt-2 : add gpt-2-quantize tool for quantizing f32 GPT-2 models
2 years ago
Georgi Gerganov a366dd31cc
ggml : q4_1 quantization support (seems to work for bigger models)
2 years ago
Georgi Gerganov 751aa84f1a
gpt-2 : loading Q4_0 quantized model
2 years ago
Georgi Gerganov fb64edddb7
gpt : fix sampling to use the temperature (close #16)
3 years ago
Georgi Gerganov a0f2f68cdb
gpt-2 : fix broken prompt due to recent experiments
No idea why I commited that!?
3 years ago
Georgi Gerganov 1dcbe86a0c
gpt-2 : experimenting with attention mask
3 years ago
Georgi Gerganov 99f1afb613
gpt-2 : fix off-by-one error in batching logic
3 years ago
Georgi Gerganov 787efb4d2e
Adding Whisper inference example
3 years ago
Georgi Gerganov fb558f78d9
Initial release
3 years ago
Powered by Gitea Version: 1.18.1 Page: 777ms Template: 11ms
English
Bahasa Indonesia Deutsch English Español Français Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API