Comments for Phoenix Game Development

Comment on Spiritus Astrum [P0-B10]: p1488: Towards Early Access by Z Image

Z Image — Sat, 03 Jan 2026 11:39:30 +0000

The navmesh detail really caught my attention! It sounds challenging to have NPCs follow commands reliably, especially considering terrain issues

Comment on Spiritus Astrum [P0-B10]: p1488: Towards Early Access by AI ASMR

AI ASMR — Sat, 27 Dec 2025 10:35:13 +0000

I love reading about the complex navmesh generation and NPC commands-a real look into game development! It’s fascinating how these systems evolve over time

Comment on AI: Estimated token generation rates for selected model sizes (and algorithm to calculate same) by PhoenixGames

PhoenixGames — Thu, 30 Oct 2025 16:00:41 +0000

In reply to Name.

Thank you for your reply.

The numbers and algorithms posted above were always rough rules of thumb, and having tested them in real world conditions, they are certainly not entirely accurate, however, they are not entirely inaccurate either.

I was running 120b Q8 models with two 3090s (The rest of the model being on the CPU) and I was getting anything from 1-2 tk/s depending on the context, etc.

The graph above shows 3.43 tk/s for 140 gb/s RAM bandwidth, which is noticeable higher than I was getting in reality, but again, it’s a rough estimate.

The formula I used (memorybandwidth / model size) was based on internet searches, reddit, etc. I admit, not the most reliable sources, but I couldn’t find anything better at the time. Can you tell me where you got yours?

Another problem that I have noticed is that you are dividing by the GPU bandwidth, so, as the GPU bandwidth increases, the token generation rate decreases, which, surely, should be the opposite?

For example, you said:

(2 * 120 * 1000) / (936) = 256.410

So, for a GPU with a bandwidth of 936 gb/s, we would get 256 tk/s.

Ok, lets now assume that we double the bandwidth, to 1800 gb/s. We would expect to get twice the tokens, right? But with your algorithm:

(2*120*1000)/(1800), we get: 133.33

So instead of doubling, the tk/s is halving? Which can’t be right surely, unless I’m missing something?

Comment on AI: Estimated token generation rates for selected model sizes (and algorithm to calculate same) by Name

Name — Thu, 30 Oct 2025 15:41:40 +0000

Hellom the information in this article is absolutely wrong. If your inference was really memory bound and you could fit the model into single GPU, you could get away with:

(2 * n * 1000) / (b_w)

where

n – number of billion parameters
b_w – GPU bandwidth in GB/s

Taking your example:

> So, if the model is running entirely on the GPU (With a bandwidth of 936 GB/s) and the model size is 120GB, then the token generation speed would be:

we would have

(2 * 120 * 1000) / (936) = 256.410

which again, would be close if the model wouldn’t need to be split across several GPUs and the card was 100% memory bound – this depends on the model as much as on the card

Comment on Dependency Hell: The Linux Experience by PhoenixGames

PhoenixGames — Wed, 15 Oct 2025 13:26:18 +0000

In reply to Disponat. What areas are you stuck on? I might be able to offer some advice?

Comment on Taming the Beast: Running DeepSeek V3-0324 Locally by PhoenixGames

PhoenixGames — Wed, 15 Oct 2025 13:25:28 +0000

In reply to Miss Byron. You are very welcome, thank you for your comment!

Comment on Taming the Beast: Running DeepSeek V3-0324 Locally by Miss Byron

Miss Byron — Wed, 15 Oct 2025 13:12:04 +0000

OMG Thank you SO MUCH for such detailed description! I’ve been banging my head against the wall for 3 months, trying to figure out how to run it on my fairly good setup! Your insights helped me A LOT. Wishing you all the best!!

Comment on Dependency Hell: The Linux Experience by Disponat

Disponat — Mon, 13 Oct 2025 01:50:18 +0000

As someone currently stuck in dependency hell (also related to ComfyUI), this was cathartic to read.

Comment on SillyTavern Extension: Email Checker by photo to coloring

photo to coloring — Wed, 08 Oct 2025 15:49:50 +0000

Just stumbled on this post about the SillyTavern Email Checker extension—game-changer for anyone using SillyTavern regularly! I’ve had issues with wonky email setups before, so this tool sounds like it’ll save me tons of troubleshooting time. Thanks for sharing the scoop!

Comment on Dependency Hell: The Linux Experience by PhoenixGames

PhoenixGames — Fri, 03 Oct 2025 04:11:13 +0000

In reply to Aphid.

Thanks for your comment!

Yeah, I get that Linux isn’t as bad now as it used to be (Or so I read, anyway), and I’m sure if you’re using Linux for regular computing tasks, it’s probably fine.

But if you’re on the cutting edge (Such as installing new repos from GitHub, or using rapidly changing AI toolkits, etc), you are going to break stuff a lot.

You made a great point about the win32 API, and I design my code like that too (At least, I try to!), so that new functionality doesn’t break old functionality.

Yes, it can be messy to have mylibraryA, mylibraryB, etc, but you know for a fact that you aren’t breaking anything, and creating hell for the dev’s relying on that functionality.

Linux doesn’t seem to have the same attitude.

I think that Linux assumes that the people using the OS are experienced devs with the knowledge and time needed to fix broken code and find workarounds. That’s what I mean in my post: Linux is really a dev’s os, which is great! But Windows is for when you want to get stuff done, and not waste time fixing things.

I’m not saying Linux can’t be used for professional work, of course it can, and it is, but in general, the attitude regarding reliability and functionality is different.

Exactly, users really shouldn’t have to rely on virtualisation to get code to work. I use virtualisation for 20-year old games, not software that’s just a few months old!