Context Management and Filtering in Prompts

Background

When developing a ChatAgent for document analysis, in multi-turn conversations, users upload a file for analysis, and the LLM answers user questions based on the analysis results of that file. However, during the user’s 2nd, 3rd, 4th… questions, sometimes the questions are completely unrelated to the file. In that case, always including the file analysis results in the prompt context becomes undesirable because the input tokens become excessive, raising costs. Managing and filtering the prompt context is therefore essential.

Analysis

The final outcome effect is to dynamically manage the structure and length of the prompt context based on each of the user’s queries.

To dynamically manage the prompt context’s structure and length based on each user query, it is necessary to perform intent analysis on each question to decide whether the file should be analyzed again.

If the intent analysis shows that the user’s question is unrelated to the file, then the prompt context can be dynamically composed without including the file analysis results.

If the intent analysis shows that the user’s question is similar to the initial file analysis query, the previous file analysis results can be reused to answer. This involves scoring the relevance between “question <=> result.”

If the intent analysis indicates the user’s question differs from the initial file query but must be answered based on the file content, then the file should be re-analyzed via LLM.

Workflow Diagram

关于作者

我是Louis,一名长期从事iOS与AI相关工程实践的工程师,也是一个正在探索产品与商业可能性的准创始人.

这里的文章,更多是我在项目中用过,踩过坑,反复验证过的东西,而不是为了流量而写的“快内容”.

☕ 打赏

如果这篇文章对你有帮助,欢迎请我喝一杯咖啡☕️

PayPal
https://www.paypal.me/luochuan188

PayPay

You can support my work via PayPay by searching my PayPay ID:

PayPay ID: luochuan

微信支付

支付宝

你的支持会让我有更多时间,把真实项目中的经验持续整理和分享出来.

不打赏也完全没关系,感谢你读到这里.

联系与合作

如果你:

· 正在做iOS App / AI / 自动化相关的项目

· 对技术选型、架构设计、产品落地有困惑

· 或希望进行技术交流、合作探讨

欢迎通过以下邮箱联系我:

luochuanad@gmail.com

Technical Blog Title