807b0c49ff
* llama : add inference support and model types for T5 and FLAN-T5 model families * llama : add new API functions to support encoder-decoder models: llama_encode(), llama_model_has_encoder(), llama_model_decoder_start_token() * common, llama-cli, llama-batched : add support for encoder-decoder models * convert-hf : handle shared token embeddings tensors in T5Model * convert-hf : add support for SentencePiece BPE tokenizer in T5Model (for Pile-T5 models) * convert-hf : add MT5ForConditionalGeneration and UMT5ForConditionalGeneration to architectures supported by T5Model * convert : add t5 tokenizer tests, use "slow" HF tokenizer for t5 --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
113 lines
1.9 KiB
Plaintext
113 lines
1.9 KiB
Plaintext
ied 4 ½ months
|
|
__ggml_vocab_test__
|
|
Führer
|
|
__ggml_vocab_test__
|
|
|
|
__ggml_vocab_test__
|
|
|
|
__ggml_vocab_test__
|
|
|
|
__ggml_vocab_test__
|
|
|
|
__ggml_vocab_test__
|
|
|
|
__ggml_vocab_test__
|
|
|
|
|
|
__ggml_vocab_test__
|
|
|
|
|
|
|
|
__ggml_vocab_test__
|
|
|
|
|
|
|
|
|
|
__ggml_vocab_test__
|
|
|
|
|
|
__ggml_vocab_test__
|
|
Hello world
|
|
__ggml_vocab_test__
|
|
Hello world
|
|
__ggml_vocab_test__
|
|
Hello World
|
|
__ggml_vocab_test__
|
|
Hello World
|
|
__ggml_vocab_test__
|
|
Hello World!
|
|
__ggml_vocab_test__
|
|
Hello, world!
|
|
__ggml_vocab_test__
|
|
Hello, world!
|
|
__ggml_vocab_test__
|
|
this is 🦙.cpp
|
|
__ggml_vocab_test__
|
|
w048 7tuijk dsdfhu
|
|
__ggml_vocab_test__
|
|
нещо на Български
|
|
__ggml_vocab_test__
|
|
កាន់តែពិសេសអាចខលចេញ
|
|
__ggml_vocab_test__
|
|
🚀 (normal) 😶🌫️ (multiple emojis concatenated) ✅ (only emoji that has its own token)
|
|
__ggml_vocab_test__
|
|
Hello
|
|
__ggml_vocab_test__
|
|
Hello
|
|
__ggml_vocab_test__
|
|
Hello
|
|
__ggml_vocab_test__
|
|
Hello
|
|
__ggml_vocab_test__
|
|
Hello
|
|
__ggml_vocab_test__
|
|
Hello
|
|
Hello
|
|
__ggml_vocab_test__
|
|
(
|
|
__ggml_vocab_test__
|
|
|
|
=
|
|
__ggml_vocab_test__
|
|
' era
|
|
__ggml_vocab_test__
|
|
Hello, y'all! How are you 😁 ?我想在apple工作1314151天~
|
|
__ggml_vocab_test__
|
|
!!!!!!
|
|
__ggml_vocab_test__
|
|
3
|
|
__ggml_vocab_test__
|
|
33
|
|
__ggml_vocab_test__
|
|
333
|
|
__ggml_vocab_test__
|
|
3333
|
|
__ggml_vocab_test__
|
|
33333
|
|
__ggml_vocab_test__
|
|
333333
|
|
__ggml_vocab_test__
|
|
3333333
|
|
__ggml_vocab_test__
|
|
33333333
|
|
__ggml_vocab_test__
|
|
333333333
|
|
__ggml_vocab_test__
|
|
Cửa Việt
|
|
__ggml_vocab_test__
|
|
discards
|
|
__ggml_vocab_test__
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
🚀 (normal) 😶🌫️ (multiple emojis concatenated) ✅ 🦙🦙 3 33 333 3333 33333 333333 3333333 33333333 3.3 3..3 3...3 កាន់តែពិសេសអាច😁 ?我想在apple工作1314151天~ ------======= нещо на Български ''''''```````""""......!!!!!!?????? I've been 'told he's there, 'RE you sure? 'M not sure I'll make it, 'D you like some tea? We'Ve a'lL
|
|
__ggml_vocab_test__
|