On-device AI models optimized for smartphone deployment: mobile LLMs, edge inference, and efficient architectures