A individual contribution was pointed out in which a user established a fused GEMM for int4, which can be helpful for teaching with fixed sequence lengths, furnishing the fastest Resolution.LORA overfitting concerns: A different user queried irrespective of whether appreciably decreased training decline compared to validation reduction signals over