Enterprise-level AI Agent for Plan-and-Execute + Function Calling - Louis

Background

In a complete AI Agent, the invocation of tools here follows the concept of ChatGPT’s Function Calling, connecting the LLM with tool calls by defining a “Tools Schema.”

1. Define the Tools Schema First

The tools used here include: Search Database, Send Mail, Get Weather, etc. These tools correspond to the names of custom tool methods.

tools = [

	{
		"type": "function",
		"function": {
			"name": "Search Database",
			"description": "Retrieve profile from database",
			"parameters":{
				"type": "object",
				"properties": {
					"name": {"type": "string"}
				},
				"required": ["name"]
			}
		}
	},
	{
		"type": "function",
		"function": {
			"name": "Send Mail",
			"description": "Send Mail to Gmail",
			"parameters":{
				"to": {"type": "string"},
				"subject": {"type": "string"},
				"body": {"type": "string"}
			}
		}
	},
	{
		"type": "function",
		"function": {
			"name": "Get Weather",
			"description": "Get Weather from location",
			"parameters":{
				"type": "object",
				"properties": {
					"location": {"type": "string"}
				},
				"required": ["location"]
			}
		}
	},
......

]

2. Loop Execution of Execute (Plan-and-Execute Framework)

Planner Prompt:

prompt = ‘
	You are a task planner in a multi-step AI system.
	User query: {Query}
	Your job:
	1, Break down the user query into executable steps.
	2, Each step must be atomic and tool-executable.
	3, Steps should be ordered logically.
	4, Do NOT execute anything.
	5, Only return JSON.
’

import json
 
# 1️⃣ Let the Planner generate the plan
planner_response = client.chat.completions.create(
    model="gpt-4o",
    messages=[
        {"role": "system", "content": planner_prompt},
        {"role": "user", "content": user_query}
    ]
)
 
plans = json.loads(planner_response.choices[0].message.content)["plans"]
 
# 2️⃣ Execute step-by-step
context_memory = {}
 
for step in plans:
    tool_name = step["tool"]
    args = step["arguments"]
 
    result = execute_tool(tool_name, args, context_memory)
 
    # Add the result to context
    context_memory[f"step_{step['step_id']}"] = result

def execute_tool(tool_name, args, memory):
 
    if tool_name == "Search Database":
        return search_Database(args["name"])
 
    elif tool_name == "Send Mail":
        
        return send_Mail(args["to"],args["subject"],args["body"])
 
    elif tool_name == "Get Weather":
        return get_Weather(args["location"])
 
    else:
        raise Exception("Unknown tool")

Evaluation: (Plan-and-Execute Framework + Function Calling Recommendation: 🌟🌟)

Advantages:

Task Decomposition: Breaking down large tasks into smaller ones enables effective management and resolution of complex tasks.
Detailed Guidance: Providing detailed instructions improves the quality and accuracy of reasoning steps.
Adaptability: Can be adjusted for different types of tasks and performs well on various complex problems.

Disadvantages:

The Planner Prompt generates the plan all at once; steps may be missed, unordered, and intermediate states are not perceivable.
The Execute loop is too mechanical, lacking conditional jumps, branching, dynamic decisions, and failure retries.
The Planner does not know the execution results, such as whether they are empty or not.

Suitable Scenarios:

Fixed steps and stable workflows
No need for dynamic decision-making or branching logic

Unsuitable Scenarios:

Decision-driven tasks
Conditional jumps
Data-driven reasoning
Long-chain tasks

关于作者

我是Louis,一名长期从事iOS与AI相关工程实践的工程师,也是一个正在探索产品与商业可能性的准创始人.

这里的文章,更多是我在项目中用过,踩过坑,反复验证过的东西,而不是为了流量而写的“快内容”.

☕ 打赏

如果这篇文章对你有帮助,欢迎请我喝一杯咖啡☕️

PayPal
https://www.paypal.me/luochuan188

PayPay

You can support my work via PayPay by searching my PayPay ID:

PayPay ID: luochuan

微信支付

支付宝

你的支持会让我有更多时间,把真实项目中的经验持续整理和分享出来.

不打赏也完全没关系,感谢你读到这里.

联系与合作

如果你:

· 正在做iOS App / AI / 自动化相关的项目

· 对技术选型、架构设计、产品落地有困惑

· 或希望进行技术交流、合作探讨

欢迎通过以下邮箱联系我:

luochuanad@gmail.com