Hi Richard, the Inference is from the cloud with Together.ai and all my testing over the last week or so with it has cost me about 6 US cents so far. The model is a big one, specifically
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
I have a local "AI Tower" machine in my spare room with the following spec
Processor: AMD Ryzen 5 5600
Memory: Corsair 32GB (2x16GB) DDR4 3200MHz
Graphics: Palit RTX 3060 12GB
I have configured #Ollama and downloaded several models. It will load and run qwen:32b
I can give you teacher access and a playground course to the site at
www.flossed.uk if you are interested. And also run some of the full prompts against my local machine to check the accuracy (it will certainly be slow of course)...
Best wishes from (old) York.
You can also contact me directly at marcusavgreen at gmail.com