213c4a0b81
* support flash-attention for fp32/fp16/Q4/Q5/Q8 * rm warining * update for JIT
* support flash-attention for fp32/fp16/Q4/Q5/Q8 * rm warining * update for JIT