Logo
Explore Help
Register Sign In
Migrations
/
ggml
1
0
Fork
You've already forked ggml
0
Code Issues Pull Requests Packages Projects Releases Wiki Activity
84 Commits
6 Branches
0 Tags
582 KiB
llama
gq
master
4bit
t5
experiments/blocking
Branches Tags
${ item.name }
Create tag ${ searchTerm }
Create branch ${ searchTerm }
from 'gq'
${ noResults }
Commit Graph

8 Commits (gq)

Author SHA1 Message Date
Georgi Gerganov 10356cdcdd
gpt : seems not worth to use FP16 for KV cache
2 years ago
Georgi Gerganov eaa4006047
gpt : fix memory usage computation
2 years ago
Georgi Gerganov fde29bd005
ggml : add ggml_compute_forward_rope_f16()
2 years ago
Georgi Gerganov 86b1e356b0
gpt : avoid ggml_transpose on model tensors (new models!)
2 years ago
Georgi Gerganov 11295af7a6
gpt-j : support for 4-bit quantized model inference
2 years ago
Georgi Gerganov fb64edddb7
gpt : fix sampling to use the temperature (close #16)
3 years ago
Georgi Gerganov 787efb4d2e
Adding Whisper inference example
3 years ago
Georgi Gerganov fb558f78d9
Initial release
3 years ago
Powered by Gitea Version: 1.18.1 Page: 659ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API