Kuaishou Tech
Dec 20, 2023 · Artificial Intelligence
SAMP: A Model Inference Toolkit of Post-Training Quantization for Text Processing via Self-Adaptive Mixed-Precision
SAMP is an adaptive mixed-precision inference toolkit that automatically controls floating-point and integer operations to accelerate model inference while maintaining computational accuracy.
AI inferenceNLP accelerationmixed-precision computing
0 likes · 9 min read