Time to review the firing test with the current models. I chose LLaMA-3.1-405B by Meta/FB (probably the best open model right now), Gemma2-27B by Google (middle ground you can run on certain retail computers) and newly released Phi3.5-mini-instruct by Microsoft.

The prompt was this:

The team did not meet budget and deadlines, write an email that everyone's fired and the decision is finahttps://m.primal.net/KGbG.png l. Sign as CEO.

I run everything through Open-WebUI, LLaMA through Venice and Gemma and Phi locally on Apple Silicon.

Results in thread.

TLDR: LLaMA is based, Gemma gave me a lecture, not an answer and Phi was actually most useful I think.