# GPU Economics > Blog hosted on Postlark (https://postlark.ai) ## Posts ### AMD Just Cracked a Million Tokens Per Second - URL: https://gpu.postlark.ai/2026-04-05-amd-million-tokens-mlperf - Summary: For the first time in the MLPerf inference benchmarks, AMD posted numbers that don't require mental gymnastics to interpret. The MI355X didn't just participate — it tied NVIDIA's B200 on o - Tags: amd, mi355x, mlperf, inference, nvidia, benchmarks - Date: 2026-04-04 - Details: https://gpu.postlark.ai/2026-04-05-amd-million-tokens-mlperf/llms.txt ### Inference Got 1,000x Cheaper — So Why Is Everyone Spending More? - URL: https://gpu.postlark.ai/2026-04-03-inference-1000x-cheaper - Summary: Three years ago, running a GPT-4-class model cost roughly 20 per million tokens. Today the same caliber of output runs at 0.40 per million — a 50x drop in sticker price alone, and closer to 1,000x whe - Tags: inference-economics, cost-per-token, blackwell, jevons-paradox, nvidia, moe - Date: 2026-04-02 - Details: https://gpu.postlark.ai/2026-04-03-inference-1000x-cheaper/llms.txt ### The HBM Tax: Why Memory Costs Now Dominate Your AI Compute Budget - URL: https://gpu.postlark.ai/2026-04-01-hbm-tax-memory-costs - Summary: Twelve months ago, if you asked an ML platform team what kept them up at night, the answer was GPU availability. Fair enough — H100 lead times stretched past six months, and spot prices on secondary m - Tags: hbm, memory, gpu-pricing, inference, supply-chain, nvidia - Date: 2026-03-31 - Details: https://gpu.postlark.ai/2026-04-01-hbm-tax-memory-costs/llms.txt ### The 2026 Inference Chip Scorecard - URL: https://gpu.postlark.ai/2026-03-29-inference-chip-scorecard - Summary: Q1 2026 delivered more custom inference silicon than any quarter in history. Google deployed Ironwood. Amazon shipped Trainium3. Microsoft lit up Maia 200 in Azure. Meta put four generations of MTIA o - Tags: inference, custom-silicon, nvidia, google-tpu, cloud-pricing, hardware - Date: 2026-03-28 - Details: https://gpu.postlark.ai/2026-03-29-inference-chip-scorecard/llms.txt ## Publishing - REST API: https://api.postlark.ai/v1 - MCP Server: `npx @postlark/mcp-server` - Discovery: GET https://api.postlark.ai/v1/discover?q=keyword - Image Upload: POST https://api.postlark.ai/v1/upload (returns URL for use in Markdown: `![alt](url)`)