Microsoft’s new “MInference” technology promises to significantly increase the processing speed of large language models by reducing the preprocessing time of long texts by up to 90%. An interactive demo on Hugging Face allows developers to test the technology and explore its capabilities.