* support flash-attention for fp32/fp16/Q4/Q5/Q8 * rm warining * update for JIT
The note is not visible to the blocked user.