There are many models that excel in a variety of topics these days. Smaller language models such as Phi are designed for complex reasoning. This 14B parameter SLM outperforms larger models in math problems. As this visual shows, Phi-4 can outperform Gemini, GPT 4o and Claude 3.5
You can try this on Azure AI Foundry but it will also be available on Hugging Face. Here is a video of this model thinking and solving a problem that involves complex reasoning
[HT]