Commit Graph

  • 8eea5ae0e5 Delete SHA256SUMS for now (#416) anzz1 2023-03-23 12:26:19 +02:00
  • 93208cfb92 Adjust repetition penalty .. Georgi Gerganov 2023-03-23 10:46:58 +02:00
  • 03ace14cfd Add link to recent podcast about whisper.cpp and llama.cpp Georgi Gerganov 2023-03-23 09:48:51 +02:00
  • e4412b45e3 CI: CMake: Separate build and test steps (#376) master-e4412b4 anzz1 2023-03-23 04:20:34 +02:00
  • f7dc43bc0d Fix instruct mode broken by PR #354 (#409) master-f7dc43b tjohnman 2023-03-23 01:30:23 +01:00
  • ee8a788786 Update issue template so people will use it (#404) Gary Mulder 2023-03-22 19:06:18 +00:00
  • 69c92298a9 Deduplicate q4 quantization functions (#383) master-69c9229 Stephan Walter 2023-03-22 17:29:06 +00:00
  • 97940520e8 fix: add POSIX functionality for Linux compilation (#51) master-9794052 master-305ba6f Valentyn Bezshapkin 2023-03-22 18:20:25 +01:00
  • 305ba6f0e6 Don't force immediate interactive without -i (#354) tjohnman 2023-03-22 18:16:35 +01:00
  • 4122dffff9 cmake: make llama an actual library (#392) master-4122dff Erik Scholz 2023-03-22 17:37:10 +01:00
  • 56e659a0b2 fix perplexity after c-api refactor (#390) master-56e659a Erik Scholz 2023-03-22 17:09:38 +01:00
  • 40ea807a97 Add details on perplexity to README.md (#395) Gary Linscott 2023-03-22 08:53:54 -07:00
  • d5850c53ca Add missing header for memcpy (#386) master-d5850c5 Yusuf Kağan Hanoğlu 2023-03-22 11:55:45 +03:00
  • ae44e23ee3 When seed <= 0 - use the clock to generate one master-ae44e23 master-928480e Georgi Gerganov 2023-03-22 07:47:15 +02:00
  • 928480ef5b Init llama_context_params properly from CLI (#370) Georgi Gerganov 2023-03-22 07:45:00 +02:00
  • 56817b1f88 Remove temporary notice and update hot topics master-f5a77a6 Georgi Gerganov 2023-03-22 07:34:02 +02:00
  • f5a77a629b Introduce C-style API (#370) Georgi Gerganov 2023-03-22 07:32:36 +02:00
  • da0e9fe90c Add SHA256SUMS file and instructions to README how to obtain and verify the downloads Gary Mulder 2023-03-20 20:14:06 +00:00
  • e6c9e0986c Fix bin dir for win ci master-e6c9e09 anzz1 2023-03-21 23:49:24 +02:00
  • 01a297b099 specify build type for ctest on windows (#371) master-01a297b Erik Scholz 2023-03-21 22:34:25 +01:00
  • 3366853e41 Add notice about pending change Georgi Gerganov 2023-03-21 22:57:35 +02:00
  • 3f9c6135e4 fix typo in chatLLaMa (#368) Mathieu Nayrolles 2023-03-21 16:52:27 -04:00
  • 0f61352708 Update issue templates Georgi Gerganov 2023-03-21 19:47:27 +02:00
  • 353ec251a4 We could use std::unordered_map over std::map (#305) Fabio R. Sluzala 2023-03-21 14:21:50 -03:00
  • 89d5d90f3b Fix color codes emitting mid-UTF8 code. (#312) Matvey Soloviev 2023-03-21 18:11:01 +01:00
  • 16ffc013c6 Importer for GPTQ quantized LLaMA models (#301) comex 2023-03-21 09:42:25 -07:00
  • 486ae645fd Compute perplexity over prompt (#270) Gary Linscott 2023-03-21 09:27:42 -07:00
  • 3ab3e6582f Add chatLLaMa script (#198) Jean-Christophe Hoelt 2023-03-21 18:23:15 +02:00
  • f157088cb7 makefile: Fix CPU feature detection on Haiku (#218) Alex von Gluck IV 2023-03-21 11:21:06 -05:00
  • c86ba036e6 Enable ANSI colors on Windows 10+ (#311) anzz1 2023-03-21 18:14:46 +02:00
  • 1daf4dd712 Minor style changes Georgi Gerganov 2023-03-21 18:10:32 +02:00
  • dc6a845b85 Add chat.sh script Georgi Gerganov 2023-03-21 18:09:37 +02:00
  • 6a612959e1 Check for reverse prompt by characters instead of tokens (#292) (#330) tjohnman 2023-03-21 17:05:06 +01:00
  • d5f56a5e5a Check for reverse prompt by characters instead of tokens (#292) (#330) tjohnman 2023-03-21 17:04:43 +01:00
  • 3bfa3b43b7 Fix convert script, warnings alpaca instructions, default params Georgi Gerganov 2023-03-21 17:59:16 +02:00
  • 715d292ee0 Add OpenBSD support (#314) Kevin Lo 2023-03-21 09:50:09 -06:00
  • c98ae02668 fix typo in comment (#318) Mack Straight 2023-03-21 08:49:43 -07:00
  • c3b2306b18 Makefile: slightly cleanup for Mac Intel; echo instead of run ./main -h (#335) Qingyou Meng 2023-03-21 23:44:11 +08:00
  • 975d2cebf9 cmdline option for custom amount of model parts (--n_parts N) (#348) anzz1 2023-03-21 17:42:43 +02:00
  • e0ffc861fa Update IPFS links to quantized alpaca with new tokenizer format (#352) Kevin Kwok 2023-03-21 08:34:49 -07:00
  • 8f644a0a85 Change default repeat_penalty to 1.0 Georgi Gerganov 2023-03-21 17:32:14 +02:00
  • eb34620aec Add tokenizer test + revert to C++11 (#355) Georgi Gerganov 2023-03-21 17:29:41 +02:00
  • 2e664f1ff4 Add initial AVX512 support for dot product on Linux (#320) master-2e664f1 Casey Primozic 2023-03-21 07:35:42 -07:00
  • 8cf9f34edd Adding missing features of CMakeLists.txt & Refactoring (#131) master-8cf9f34 nusu-github 2023-03-21 09:37:16 +09:00
  • bd4b46d6ba Nix flake: set meta.mainProgram to llama Ben Siraphob 2023-03-20 16:44:30 -05:00
  • 6b6d5b5024 Fixed tokenizer.model not found error when model dir is symlink (#325) Qingyou Meng 2023-03-21 03:33:10 +08:00
  • a791a68b61 move file magic/version to header, print expected version (#319) master-a791a68 Mack Straight 2023-03-20 12:26:01 -07:00
  • 0f1b21cb90 Docker - Fix publish docker image in GitHub Registry (#235) master-0f1b21c Bernat Vadell 2023-03-20 18:05:20 +01:00
  • 074bea2eb1 sentencepiece bpe compatible tokenizer (#252) master-074bea2 Mack Straight 2023-03-20 03:17:23 -07:00
  • 5cb63e2493 Add tqdm to Python requirements (#293) Stephan Walter 2023-03-20 08:24:11 +00:00
  • da5303c1ea bugfix: default should not be interactive (#304) master-da5303c cocktailpeanut 2023-03-19 17:44:20 -04:00
  • 4545539d71 Rename script Georgi Gerganov 2023-03-19 21:58:51 +02:00
  • edeba28366 Add temporary helper script for Alpaca chat Georgi Gerganov 2023-03-19 21:57:28 +02:00
  • 5c19c70ba6 fix coloring of last n_batch of prompt, and refactor line input (#221) master-5c19c70 Rickey Bowers Jr 2023-03-19 13:44:30 -06:00
  • 24568371ae Support for multiple reverse prompts. (#299) master-2456837 tjohnman 2023-03-19 20:33:06 +01:00
  • 7392f1cd2c Improved quantize script (#222) master-ad5fd5b Suaj Carrot 2023-03-19 12:38:44 -06:00
  • ad5fd5b60c Make prompt randomization optional. (#300) tjohnman 2023-03-19 19:36:19 +01:00
  • 368d0c8a9e Respect the maximum number of tokens in interactive. (#298) master-368d0c8 tjohnman 2023-03-19 19:31:17 +01:00
  • 50fae10d03 Add --ignore-eos parameter (#181) master-50fae10 slaren 2023-03-19 19:22:48 +01:00
  • 084e2f0ec0 interactive mode: print '\n' in sigint_handler, this flush stdout thus ensure color reset. (#283) master-084e2f0 Qingyou Meng 2023-03-20 02:10:00 +08:00
  • 0b366e7357 Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294) master-0b366e7 Erik Scholz 2023-03-19 18:57:00 +01:00
  • 160bfb217d Update hot topics to mention Alpaca support Georgi Gerganov 2023-03-19 19:51:55 +02:00
  • c494ed5b94 Fix off-by-one bug (#115) master-c494ed5 Georgi Gerganov 2023-03-19 19:46:32 +02:00
  • c1c7026b47 Fix python stuff (#109) Georgi Gerganov 2023-03-19 19:33:18 +02:00
  • 467b149761 Refactoring convert-pth-to-ggml.py: more concise and readable (#109) qunash 2023-03-19 20:17:39 +03:00
  • 70f01cb863 Drop trailing new line from file prompts (#80) master-70f01cb Georgi Gerganov 2023-03-19 19:04:44 +02:00
  • a4e63b73df Add instruction for using Alpaca (#240) Georgi Gerganov 2023-03-19 18:49:50 +02:00
  • 9e1707218a Add "--instruct" argument for usage with Alpaca (#240) master-9e17072 Georgi Gerganov 2023-03-19 18:37:02 +02:00
  • 22213a17b5 Change RMSNorm eps to 1e-6 (#173) master-22213a1 Georgi Gerganov 2023-03-19 17:30:00 +02:00
  • d7def1a752 Warn user if a context size greater than 2048 tokens is specified (#274) master-d7def1a Ronsor 2023-03-18 17:10:47 -07:00
  • 6f61c18ec9 Fix typo in readme Pavol Rusnak 2023-03-18 22:39:46 +01:00
  • 1e5a6d088d Add note about Python 3.11 to readme Pavol Rusnak 2023-03-18 22:20:04 +01:00
  • 554b541521 Add memory/disk requirements to readme Pavol Rusnak 2023-03-18 21:58:46 +01:00
  • d3f202d57b Remove unused code since n_vocab is model.hparams.n_vocab (#262) master-d3f202d Alex Nguyen 2023-03-18 20:51:49 +07:00
  • e03e359730 fixed warning with std::ignore about unused function result (#151) Justin Suess 2023-03-18 07:44:09 -04:00
  • a81d0c2a17 Fix n^2 loop in tokenization (#254) Gary Linscott 2023-03-18 04:17:19 -07:00
  • b2de7f18df CI Improvements (#230) anzz1 2023-03-18 09:27:12 +02:00
  • a292747893 Nix flake (#40) Niklas Korz 2023-03-17 23:03:48 +01:00
  • c9f670a177 Implement non-greedy tokenizer that tries to maximize token lengths (#242) thement 2023-03-17 21:05:58 +01:00
  • 4f54609110 Default to 4 threads (#243) Georgi Gerganov 2023-03-17 21:46:46 +02:00
  • e81b9c81c1 Update Contributing section Georgi Gerganov 2023-03-17 20:30:04 +02:00
  • 367946c668 Don't tell users to use a bad number of threads (#243) Stephan Walter 2023-03-17 17:47:35 +00:00
  • 6b0df5ccf3 add ptread link to fix cmake build under linux (#114) mmyjona 2023-03-18 00:38:24 +08:00
  • 2af23d3043 🚀 Dockerize llamacpp (#132) Bernat Vadell 2023-03-17 10:47:06 +01:00
  • 904d2a8d6a Q4_1 quantization (#193) Matvey Soloviev 2023-03-17 05:48:39 +01:00
  • 721311070e Update README.md Georgi Gerganov 2023-03-16 15:00:09 +02:00
  • ac15de7895 Expand "Contributing" section Georgi Gerganov 2023-03-16 08:55:13 +02:00
  • 273abc47ff Update hot topics - RMSnorm Georgi Gerganov 2023-03-16 07:12:12 +02:00
  • 9b4a15b17d Fix RMS norm in GGML (#191) Nebula 2023-03-15 19:29:25 -04:00
  • 6eac39ba95 Add RMS norm and use it (#187) hoangmit 2023-03-15 18:41:38 -04:00
  • 27944c4206 fixed typo (#178) moritzbrantner 2023-03-15 21:35:25 +01:00
  • 2d15d6c9a9 add SIGINT support for _WIN32 environments (#120) Rickey Bowers Jr 2023-03-15 13:56:24 -06:00
  • 2d64715ad4 added ctx_size parameter (#148) Justin Suess 2023-03-15 15:42:40 -04:00
  • 16b2c61a22 fixed color reset on exit (#149) Justin Suess 2023-03-15 15:39:38 -04:00
  • 977295c700 Fix potential licensing issue (#126) Musab Gultekin 2023-03-15 22:39:06 +03:00
  • 956dfda8ad Use tokenizer.vocab_size() instead of hardcoding 32000 in convert-pth-to-ggml.py (#142) Ronsor 2023-03-15 12:37:50 -07:00
  • 113e685d18 inline -> static inline for "bytesFromNibbles" (#161) hoangmit 2023-03-15 15:05:14 -04:00
  • 47857e564c Don't use vdotq_s32 if it's not available (#139) Ronsor 2023-03-14 12:34:37 -07:00
  • 60f819a2b1 Add section to README on how to run the project on Android (#130) Radoslav Gerganov 2023-03-14 15:30:08 +02:00
  • 97ab2b2578 Add Misc section + update hot topics + minor fixes Georgi Gerganov 2023-03-14 09:43:52 +02:00