Microsoft’s new ‘flash’ reasoning AI model ships with a hybrid architecture — making its responses 10x faster with a “2 to 3 times average reduction in latency”
Microsoft recently unveiled Phi-mini-flash-reasoning as its latest entry in the Phi family of small AI models, developed using a new hybrid architecture (SambaY), making its responses 10x faster.