Emsi - EMSI

Post 2 gave us the math: 2 bytes per parameter in BF16. Apply that to a…

AI / ML ARTICLE

LLMs in Production #2: How Much VRAM Do I Need?

2026-03-13

Mariusz Woloszyn

1 Comment

Before you download the 30 gigabytes, before you request the cluster, before you spin up the…

AI / ML ARTICLE

LLMs in Production #1: Precision Explained

2026-03-13

Mariusz Woloszyn

You download the model. Thirty gigabytes of something arrives on your drive. You run the loading…

ARTICLE

The Hidden Human Costs Behind Today’s AI

2025-11-22

Mariusz Woloszyn

The most striking insights about artificial intelligence rarely come from glossy tech demos or corporate press…

AI / ML ARTICLE

The Switchboard Paradox: Are We Solving Yesterday’s Problems with Tomorrow’s Tools?

2025-10-28

Mariusz Woloszyn

When intelligence becomes a substitute for innovation Imagine it’s 1956. Bell Labs has just achieved the…

AI / ML ARTICLE

We Panic About AI Hallucinations While Ignoring 94% Human Error Rates

2025-08-26

Mariusz Woloszyn

Picture this: It’s 2001, and Enron is riding high as one of America’ds most innovative companies.…

AI / ML ARTICLE

When Code Training Goes Wrong: The Surprising Case of Emergent AI Misalignment

2025-08-21

Mariusz Woloszyn

Imagine you fine-tune an LLM on your company’s internal codebase, hoping the model will better understand…

AI / ML ARTICLE

GPT-5 is Here, and It’s Not What You Expected

2025-08-07

Mariusz Woloszyn

OpenAI just dropped their GPT-5 System Card, and while everyone was expecting another monolithic model upgrade,…

AI / ML ARTICLE

The AI Agent That Actually Knows How to Build ML Models

2025-08-05

Mariusz Woloszyn

How Google’s MLE-STAR is changing the game by doing what most ML engineers do: Google first,…

AI / ML ARTICLE

Qwen-Image: Finally, an AI That Can Actually Write

2025-08-04

Mariusz Woloszyn

How Qwen’s new 20B parameter model solved the text rendering problem that’s been plaguing image generation…

AI / ML ARTICLE

Putting Math Behind the Madness: A Theoretical Framework for LLM Hallucinations

2025-08-02

Mariusz Woloszyn

How researchers are organizing rigorous mathematical foundations for one of AI’s most persistent problems The Problem…

AI / ML ARTICLE

The Hidden Homework Problem: How ArxivRoll Exposed AI’s Inflated Test Scores

2025-08-01

Mariusz Woloszyn

A new framework reveals that some leading AI models may be getting significant artificial score boosts…

AI / ML ARTICLE

Teaching AI Models to Debug Themselves: The Reflect, Retry, Reward Method

2025-07-28

Mariusz Woloszyn

When Small Models Beat Giants Here’s a result that should make anyone rethinking the “bigger is…

AI / ML ARTICLE

Financial Dynamics in Agentic AI: Cursor’s Rise Versus GitHub Copilot

2025-06-12

Mariusz Woloszyn

AI startups have been reshaping investment landscapes, and a closer look at the financial dynamics of…

AI / ML ARTICLE

Mistral AI Releases Codestral Embed: A Specialized Code Embedding Model

2025-05-28

Mariusz Woloszyn

Mistral AI has released Codestral Embed, their first embedding model designed specifically for code representation and…

ARTICLE Math

Holy Bayes! When a Math Guy Becomes Pope

2025-05-09

Mariusz Woloszyn

Prelude: From Priors to Pontiff When the white smoke finally curled above St Peter’s, statisticians everywhere refreshed…

AI / ML ARTICLE

In Pursuit of Efficiency: Rethinking AI with DeepSeek-V3-0324

2025-03-25

Mariusz Woloszyn

When technical prowess meets practical efficiency, the outcome challenges both conventional wisdom and entrenched market hierarchies.…

AI / ML ARTICLE Tools

Awesome MCP Clients, A New Way To Interact With LLMs

2025-03-18

Mariusz Woloszyn

The Model Context Protocol (MCP) is rapidly establishing itself as a foundational framework in the AI…

AI / ML ARTICLE

The New OpenAI Responses API: A Technical Deep Dive

2025-03-11

Mariusz Woloszyn

The recent introduction of OpenAI’s Responses API marks an evolution in how developers interact with large…

AI / ML ARTICLE

Anthropic’s Claude Code: Terminal-Based AI Coding Assistant That Might Change Your Dev Workflow

2025-02-24

Mariusz Woloszyn

Anthropic has recently launched Claude Code, a terminal-based AI coding assistant that integrates directly into developers’…

Category: Emsi

LLMs in Production #3: Reading the Model Spec

LLMs in Production #2: How Much VRAM Do I Need?

LLMs in Production #1: Precision Explained

The Hidden Human Costs Behind Today’s AI

The Switchboard Paradox: Are We Solving Yesterday’s Problems with Tomorrow’s Tools?

We Panic About AI Hallucinations While Ignoring 94% Human Error Rates

When Code Training Goes Wrong: The Surprising Case of Emergent AI Misalignment

GPT-5 is Here, and It’s Not What You Expected

The AI Agent That Actually Knows How to Build ML Models

Qwen-Image: Finally, an AI That Can Actually Write

Putting Math Behind the Madness: A Theoretical Framework for LLM Hallucinations

The Hidden Homework Problem: How ArxivRoll Exposed AI’s Inflated Test Scores

Teaching AI Models to Debug Themselves: The Reflect, Retry, Reward Method

Financial Dynamics in Agentic AI: Cursor’s Rise Versus GitHub Copilot

Mistral AI Releases Codestral Embed: A Specialized Code Embedding Model

Holy Bayes! When a Math Guy Becomes Pope

In Pursuit of Efficiency: Rethinking AI with DeepSeek-V3-0324

Awesome MCP Clients, A New Way To Interact With LLMs

The New OpenAI Responses API: A Technical Deep Dive

Anthropic’s Claude Code: Terminal-Based AI Coding Assistant That Might Change Your Dev Workflow

US Government Halts Anthropic’s AI Models Citing Security Fears, Sparks Industry Controversy

The Build Log That Spoke to AI Agents

Half a Billion Dollar AI Blunder: The Hidden Costs of Unchecked Tech Spending

ECC v2.0: Elevating Agentic Work with Versatile Operator Systems and Open-Source Innovation

The Vulnerability Bottleneck Has Moved

China’s First Real Gaming GPU Is Here — And That Matters More Than FPS

Shai-Hulud and the Danger of Trusted Packages

When the Future Remembers First

YellowKey Turns BitLocker Into an Open Door