128142fe7d
* tests: allow loading test-backend-ops tests from json * add error threshold based on op * add error when file cannot be read * add graph operator json extraction tool * add nb parameter for non-contiguous input tensors * fix view check * only use view if non-contiguous/permuted, use C++ random instead of rand() * replace internal API calls with public llama_graph_reserve call * reduce test description length * fix nb[0] not getting set for view * add name to tests * fix inplace error * use text file instead of json * move llama_graph_reserve function to new llama-ext header, move export-graph-ops to tests/ * fix missing declaration * use pragma once * fix indent * fix Windows build
13 lines
337 B
C
13 lines
337 B
C
#pragma once
|
|
|
|
#include "llama-context.h"
|
|
#include "ggml.h"
|
|
#include "stdint.h"
|
|
|
|
// Reserve a new compute graph. It is valid until the next call to llama_graph_reserve.
|
|
LLAMA_API struct ggml_cgraph * llama_graph_reserve(
|
|
struct llama_context * ctx,
|
|
uint32_t n_tokens,
|
|
uint32_t n_seqs,
|
|
uint32_t n_outputs);
|