Loading…
sources · 1
diva @divaagurlxw · X post
As an AI Engineer. Please learn
>Harness engineering, not just prompt engineering
>Context engineering, not just long prompts
>Prompt caching vs. semantic caching tradeoffs
>KV cache management, eviction, reuse, and memory pressure at scale
>Prefill vs. decode latency and
Source: https://x.com/divaagurlxw/status/2062419864908951606