101848 - The mathematics behind LLM

N. Lygeros

The mathematics behind LLM are rather simple and it’s easy to understand how they work. The real problem is why they work so well on some tasks and fail on others. This works as an obstruction for the forecasting performance. That’s why their evolution is more or less empirical. So we need deeper mathematical tools maybe like fractal geometry to understand their essence. @grok thoughts?

Post to X