Chip Huyen’s Post

View profile for Chip Huyen
Chip Huyen Chip Huyen is an Influencer

Building something new | AI x storytelling x education

LinkedIn has published one of the best reports I’ve read on deploying LLM applications: what worked and what didn’t. 1. Structured outputs They chose YAML over JSON as the output format because YAML uses less output tokens. Initially, only 90% of the outputs are correctly formatted YAML. They used re-prompting (asking the model to fix its YAML responses), which increased the number of API calls significantly. They then analyzed the common formatting errors, added those hints to the original prompt, and wrote an error fixing script. This reduced their errors to 0.01%. 2. Sacrificing throughput for latency Originally, they focused on TTFT (Time To First Token), but realized that TBT (Time Between Token) hurt them a lot more, especially with Chain-of-Thought queries where users don’t see the intermediate outputs. They found that TTFT and TBT inversely correlate with TPS (Tokens per Second). To achieve good TTFT and TBT, they had to sacrifice TPS. 3. Automatic evaluation is hard One core challenge of evaluation is coming up with a guideline on what a good response is. For example, for skill fit assessment, the response: “You’re not a good fit for this job” can be correct, but not helpful. Originally, evaluation was ad-hoc. Everyone could chime in. That didn’t work. They then have linguists build tooling and processes to standardize annotation, evaluating up to 500 daily conversations and these manual annotations guide their iteration. Their next goal is to get automatic evaluation, but it’s not easy. 4. Initial success with LLMs can be misleading It took them 1 month to achieve 80% of the experience they wanted, and additional 4 months to surpass 95%. The initial success made them underestimate how challenging it is to improve the product, especially dealing with hallucinations. They found it discouraging how slow it was to achieve each subsequent 1% gain. #aiengineering #llms #aiapplication

  • Lessons from deploying LLM applications at LinkedIn
Chip Huyen

Building something new | AI x storytelling x education

1y

I'd highly recommend this report to anyone interested in building AI applications. Great write up Juan Pablo Bottaro and Karthik Ramgopal! https://www.linkedin.com/blog/engineering/generative-ai/musings-on-building-a-generative-ai-product

Elia Ahadi

Operations & Program Management | Solution Architecture | AI Strategy

1y

I tried the premium feature of automatic evaluation Chip Huyen. It’s not bad but not amazing. I would’ve liked more concrete actions to be a better fit. It’s in beta as stated.

  • No alternative text description for this image
Raphaël Hoogvliets

0.5x Engineer | MLOps | Leadership | Satire | Creating the future’s technical debt, today

1y

Super insightful, thanks, I'm passing this on to teams in my org!

Dave Costenaro

Lead Principal AI Architect @ MRO | AI & Data Leader | Bridging Tech & Strategy | Ex-Boeing, Ameren, Capacity

1y

Very insightful

Like
Reply
Julie Choi

🦄 Building Unicorn Technology Companies

1y

Always on point. Thanks for sharing.

Like
Reply
Raju Ivaturi

Client Account Executive at Accenture | Connect the dots | Innovate for results | Deliver outcomes |

1y

Can’t agree more on #4, 1 seems very interesting thought. Thanks for sharing.

Like
Reply
Andrew Malinow, PhD

Founder of Hey Junior ("almost me") I Genai | Fractional CTO | Data Scientist

1y

Judging from the LLM generated content in my feed it is discouraging to me that LinedIn feels they have exceeded 95% of the desired experience

Elias Helou

Activating Sovereign + Secure AI. Applied Generative AI / HCI.

1y

Alon Bochman worth speaking to the LinkedIn team for RagMetrics / Eval software?

Mohammad Shooqur

GenAI || Machine Learning || Data Scientist || FinTech

1y

Thanks for sharing definitively insightful

Like
Reply

This is very insightful!!

Like
Reply
See more comments

To view or add a comment, sign in

Explore content categories