This website requires JavaScript.
Explore
Help
Register
Sign In
sleepy
/
llama.cpp
Watch
1
Star
0
Fork
0
You've already forked llama.cpp
Code
Issues
16
Pull Requests
Actions
29
Packages
Projects
Releases
Wiki
Activity
Files
fcf6538ba6702c55eaec70da9a75c81d04900a72
llama.cpp
/
gguf-py
/
gguf
T
History
Georgi Gerganov
fabf30b4c4
llama : remove Persimmon (
#7408
)
...
* llama : remove Persimmon * requirements : remove
2024-05-21 02:35:28 +10:00
..
__init__.py
convert-hf : support direct Q8_0 conversion (
#7234
)
2024-05-13 14:10:51 -04:00
constants.py
llama : remove Persimmon (
#7408
)
2024-05-21 02:35:28 +10:00
gguf_reader.py
convert-hf : save memory with lazy evaluation (
#7075
)
2024-05-08 18:16:38 -04:00
gguf_writer.py
convert-hf : support direct Q8_0 conversion (
#7234
)
2024-05-13 14:10:51 -04:00
gguf.py
gguf-py: Refactor and allow reading/modifying existing GGUF files (
#3981
)
2023-11-11 08:04:50 +03:00
lazy.py
convert-hf : support direct Q8_0 conversion (
#7234
)
2024-05-13 14:10:51 -04:00
py.typed
convert : various script cleanups/fixes + merges and special token handling (
#2842
)
2023-08-30 11:25:50 +03:00
quants.py
convert-hf : support direct Q8_0 conversion (
#7234
)
2024-05-13 14:10:51 -04:00
tensor_mapping.py
llama : add Jina Embeddings architecture (
#6826
)
2024-05-11 10:46:09 +03:00
vocab.py
convert-hf : save memory with lazy evaluation (
#7075
)
2024-05-08 18:16:38 -04:00