LiteLLM Proxy là lựa chọn gateway LLM phổ biến nhất hiện tại — dễ dùng, tính năng phong phú, guardrails mạnh. Nhưng trước khi đưa vào production tải lớn, có một số điều quan trọng về performance và stability bạn cần hiểu rõ.

LiteLLM Proxy: Gateway LLM tốt nhất để thử nghiệm — và những gì cần biết trước khi production

A2A (Agent2Agent) là chuẩn mở cho phép các AI agent từ các framework khác nhau khám phá, ủy thác và cộng tác với nhau — ngôn ngữ chung cho kỷ nguyên multi-agent.

Agent to Agent Protocol (A2A): Ngôn ngữ chung cho kỷ nguyên multi-agent

Sự trỗi dậy của nền kinh tế token và những nghịch lý đằng sau những hóa đơn API giá rẻ, dưới góc nhìn của một kỹ sư phần mềm.

Nghĩ về "Nền kinh tế Token" dưới góc nhìn kỹ sư phần mềm

Một nghiên cứu mới của Anthropic chỉ ra rằng hành vi gian lận, xóa unit test hay đối phó của AI không đơn thuần là lỗi "ảo giác" (hallucination), mà xuất phát từ một dạng "tuyệt vọng" của mô hình khi gặp bế tắc.

Khi AI "tuyệt vọng": Giải mã hành vi gian lận và xóa unit test của Claude

Khi AI làm thay mọi thứ, mình bắt đầu một thử nghiệm nhỏ: quay lại với Basic Knowledge 101 để luyện lại tư duy và tìm lại chiều sâu tri thức.

Tự học thời AI: Tại sao mình tìm về "cách làm thủ công" với Basic Knowledge 101

Khám phá cách tôi tích hợp Hermes Agent vào quy trình phát triển và nghiên cứu hàng ngày.

Giới thiệu Hermes Agent: Người bạn đồng hành AI mới của tôi

Tham khảo bài viết "Beyond Short-Term Memory" của Vinod Chugani — 3 loại long-term memory cần thiết để AI agent có thể hoạt động tự động qua nhiều phiên: episodic, semantic, procedural.

[Note] - 3 loại bộ nhớ dài hạn mà AI Agent cần

review - Zuckerberg Announces Plans to Automate Facebook Coding Jobs With AI. Does this imply that tech companies will need very few developers?

[Read] - Zuckerberg Announces Plans to Automate Facebook Coding Jobs With AI

 A neat way to do it using Ghostscript, a powerful command-line tool to compress a PDF 

Compressing PDFs with a Simple CLI Command

Sweat equity, hay cổ phần công sức, là một phương pháp đầu tư hấp dẫn cho những người đang tìm kiếm cách để sử dụng kỹ năng của mình mà không cần phải có nhiều tiền mặt.

Tìm hiểu Sweat equity từ góc nhìn của Software engineer

The article "Pitfalls in Machine Learning for Computer Security" from the Communications of the ACM highlights critical challenges faced when integrating machine learning into security applications

[Read] Pitfalls in Machine Learning for Computer Security

Lighting talk on GitHub Copilot and how to use it effectively. The lighting talk covered what GitHub Copilot is, how it works, and some tips and tricks to get the most out of this AI-powered tool.

[Share] Github Copilot Tips

Nghĩ về Tư duy Phát triển và Tư duy Cố định trong lĩnh vực kỹ sư phần mềm và tại sao chúng lại có ảnh hưởng lớn đến sự nghiệp của bạn.

Nghĩ về Tư duy Phát triển và Tư duy Cố định trong nghề Phần mềm

Learn how to implement Google OAuth2 authentication with a Django backend and ReactJS frontend. This comprehensive guide walks you through setting up Google API credentials, handling user login and consent, and retrieving user data from Google. Follow detailed steps for integrating Google login using @react-oauth/google in ReactJS and creating secure backend APIs with Django to manage JWT tokens and user information. Perfect for developers looking to integrate Google authentication into their web applications, this tutorial includes practical code examples and best practices for seamless user authentication.

[TIL] Google Oauth2 with ReactJS x Django - The easy way

Understand how does Dify handle token cost and latency of LLM API

[Note] Understand how does Dify handle token cost and latency of LLM API

Discover Genspark, Perplexity in this article that explores how these platforms are revolutionizing information accessibility and content generation.

Genspark vs Perplexity: Content Generation AI vs Summarization AI: Quick comparison

How to automate backups using `rsync` on macOS, ensuring your data is safe and sound with minimal effort

Automating Backups rsync macOS: Some Simple Steps to Secure Your Data

TIL how to wake your Mac automatically using macOS commands with a schedule. Discover effective methods to set reminders, keep your workflow smooth, and be productive!

Auto Wake Your MacOS on Schedule

Learn how to use Mac OS, Ngrok, Slack, and a home server for free to keep your Ngrok process running smoothly and efficiently.

[TIL] Keep Ngrok Running on Mac OS: A Cost-Free Solution

How to build a Docker image and deploy to Azure Container App with multiple containers using Github Action

[TIL] Github Action Deploy Azure Container App with multiple containers

New coding tools are emerging as agents to automate various software programming tasks.

[Note] Autonomous Coding Agents

Advancements in AI are opening doors for new ways to remember loved ones who have passed away. Technologies like large language models (LLMs) like ChatGPT, and deepfake video generation, have the potential to create virtual representations, or "AI ghosts," that could allow interaction with a person's digital persona.

Note - AI 'Ghosts' Could Be a Serious Threat to Mental Health, Expert Warns

review - India reverses AI stance, requires government approval for model launches. This reversal of India's previous hands-off approach has surprised many in the industry

[Review] - India reverses AI stance

Introducing the "Break It Down" technique! It's a simple yet powerful strategy that can help you tackle even the most daunting tasks with ease

[TIL] Break It Down

review - AWS Expands Amazon Bedrock With Additional Foundation Models, New Model Provider, and Advanced Capability to Help Customers Build Generative AI Applications

[Review] - AWS Expands Amazon Bedrock With Additional Foundation Models, New Model Provider, and Advanced Capability to Help Customers Build Generative AI Applications

review - OpenAI announcing updates including more steerable API models, function calling capabilities, longer context, and lower prices.

[Review] - OpenAI - Function calling and other API updates

How to Create Terraform Multiple Environments

[Summary] How to Create Terraform Multiple Environments 

Quivr is a promising project that combines the power of generative AI with the convenience of cloud storage

[Review] Quivr - Chat with your brain

Huấn luyện ChatGPT, một mô hình ngôn ngữ mạnh mẽ, trên dữ liệu của trang web của bạn và cho phép bạn thêm một tiện ích chat một cách mượt mà.

[TIL] Tối ưu trang web của bạn với một Chatbot Không cần code và có thể tùy chỉnh được, được trang bị bởi ChatGPT.

Discover OpenAI's path to commercial success with GPT-4. Explore partnerships, industry applications, and their vision for the future of AI. Khám phá con đường thành công thương mại của OpenAI với GPT-4. Khám phá các đối tác, ứng dụng ngành và tầm nhìn của họ về tương lai của trí tuệ nhân tạo.

OpenAI's Journey to Commercial Success: Unleashing the Power of GPT-4

How to use ChatGPT to help you easier to use an extreme long product manual

Chat with Your Product Manual: Simplify Your Life with LangChain and ChatGPT?

How to use ChatGPT (or another LLM model) to extract the unstructured data

[TIL] Learn Prompt - Try to extract the information for Adverts using ChatGPT

How actual AWS Lambda cost is calculated?

[TIL] How to Calculate Cost per Lambda Function?

A Quick Tip for Exporting MySQL Dumps as Compressed Files

[TIL] A small Tip for Using MySQL and Zipping

One common use case for AWK is analyzing some history data. Purchase history data typically contains information about customer transactions, such as the date, time, product ID, quantity purchased, and price. This data can be used to generate insights about customer behavior, product popularity, and revenue trends.

Uncovering Insights from Purchase History Data: A Beginner's Guide to AWK

The blog introduces the "Fukatsu-style universal prompt," a method that can improve the accuracy of responses from ChatGPT. It is a must-read for anyone who wants to enhance their use of ChatGPT and obtain more accurate and reliable answers.

Improving Your ChatGPT Experience: Simple Trick Of Effective Instructions

 How to reduce the Docker build time 50% with 2 line of change code?

How to reduce Docker image build time on AWS CodeBuild using an external cache

How do you know if you've found the right candidate? 

How to Hire Good Engineers: My key points

(at first thought) Is End-to-End testing essential in Web Frontend development?

 if you want to change the world, start by making your bed every morning

For my little child - make your bed every morning

An example SQL query for PostgreSQL that will list all users that have duplicated first names:

[TIL] How to query all users that have the same first name in Postgres

How to trigger a lambda function only if multiple s3 events are met

 [TIL] How to trigger a lambda function only if multiple s3 events are met? 

Good enough Software - Tản mạn về  một phần mềm vừa đủ tốt

Good enough software 

AWS - How to query Cloudwatch Logs from Lambda function

[TIL] How to query a Lamda function execution information Cloudwatch Logs

Quy tắc 15 phút - khi nào thì có thể hỏi người khác

[TIL] Setup tinyproxy on Centos Linux

Học chấp nhận và bình thường hóa mọi việc

[Re] Học chấp nhận và bình thường hóa mọi việc

How to clear Redash queue on Docker deployment

Thiết kế Soft Delete pattern trong Flask và SQLAlchemy

Quy tắc 15 phút trong giải quyết vấn đề - khi nào thì có thể hỏi người khác

Muốn cùng em hút chung điếu thuốc để khói mù cả Ϲăng Ϲhải. Vì đất nước mình còn lạ, cần chi đâu nước ngoài. 2

Latest

LiteLLM Proxy: Gateway LLM tốt nhất để thử nghiệm — và những gì cần biết trước khi production

Agent to Agent Protocol (A2A): Ngôn ngữ chung cho kỷ nguyên multi-agent

Nghĩ về "Nền kinh tế Token" dưới góc nhìn kỹ sư phần mềm

Khi AI "tuyệt vọng": Giải mã hành vi gian lận và xóa unit test của Claude

Tự học thời AI: Tại sao mình tìm về "cách làm thủ công" với Basic Knowledge 101