Category: Uncategorized
-
The mathematics and concepts in LLM’s
Today, state-of-the-art LLMs like Gemini, Claude, DeepSeek, GPT etc exhibit extraordinary capabilities resembling aspects of human cognition. They are able to write poems, produce powerful codebases, summarize text etc. and interact with users in a multimodal way. Their performance is largely correlated to their architecture. An LLM architecture is a parametrized function that applies a…
