Our First Mistake Was Treating LLMs Like APIs

https://hackernoon.imgix.net/images/5wpKgV75aONqkTJlafw2yQmK9yd2-hb03bo4.png

One of the common mistakes we made in our first LLM system was using it as a standard API.

Send a request. Get a response. Return it to the user.

It started out fine. The first one was simple to build, simple to demo, and good enough for early users. However, when the traffic started to grow, the issues became more noticeable. The expenses began to increase at a rate that was higher than anticipated. Latency became inconsistent. Slightly different results were obtained with similar requests. It was difficult to debug, since we had almost no visibility into what was going on in the flow.

It's not that the LLM was bad. The issue was the architecture that surrounded it.

It Was A Mistake to Think of LLMs as Simple Endpoints

Typical APIs are predictable. The same input gives the same type of output. You can measure the response time,...

Copyright of this story solely belongs to hackernoon.com. To see the full text click HERE

Read more

https://image.theregister.com/5242949.jpg?imageId=5242949&x=0&y=0&cropw=100&croph=100&panox=0&panoy=0&panow=100&panoh=100&width=1200&height=683

America's top cyber-defense agency left a GitHub repo open with passwords, keys, tokens – and incredibly obvious filenames

I wonder what's in 'external-secret-repo-creds.yaml' and 'AWS-Workspace-Firefox-Passwords.csv'? The US Cybersecurity and Infrastructure Security Agency (CISA) left open a GitHub repository named “Private-CISA” containing plain-text passwords, private keys, tokens, and secrets – with obvious file names like “external-secret-repo-creds.yaml” and “AWS-Workspace-Firefox-Passwords.csv” – for six

https://techcrunch.com/wp-content/uploads/2026/05/GettyImages-2259661359.jpg?w=1024

SpaceX S-1: xAI had a $6.4B operating loss on $3.2B in revenue in 2025; Grok and X had 550M MAUs combined as of March 2026, and 117M used Grok's AI features

Sponsor Posts Niantic Spatial: World models need real-world data — Scaniverse is the gateway to spatial services — self-serve and built for AI and robotics. Large-area 3D reconstruction from 360° cameras and precise localization, anywhere machines operate. App Spotlight: Quo for Zoho CRM — App Spotlight brings you hand-picked solutions that enhance your