High-Quality SFT Data Structure Design for Fine-tuning - Louis

Background

Fine-tuning model capability = 70% Data Structure Design + 20% Data Quality + 10% Training Parameters

This article focuses solely on “Data Structure Design.”

5 Levels of High-Quality SFT Data

Level 1 Basic Q&A
Level 2 System Prompt
Level 3 Chain of Thought (CoT)
Level 4 Multi-turn Dialogue
Level 5 Tool / RAG

Analysis

Level 1 Basic Q&A

Basic response capability, prone to hallucinations, lacks reasoning

{
 "messages":[
  {"role":"user","content":"What is XAI?"},
  {"role":"assistant","content":"XAI is the process..."}
 ]
}

Level 2 System Prompt

Adds a system prompt to train the model on role behavior and personality

{
 "messages":[
  {
   "role":"system",
   "content":"You are a professional biology teacher."
  },
  {
   "role":"user",
   "content":"Explain photosynthesis"
  },
  {
   "role":"assistant",
   "content":"Photosynthesis is..."
  }
 ]
}

Level 3 Chain of Thought (CoT)

Enhances reasoning ability

{
 "messages":[
  {"role":"user","content":"If I have 20 apples and give away 7, how many remain?"},
  {"role":"assistant","content":
   "Let's think step by step.\n
    1. Start with 20 apples\n
    2. Give away 7 apples\n
    3. 20 - 7 = 13\n
    Therefore, 13 apples remain."
  }
 ]
}

Level 4 Multi-turn Dialogue

Real conversations are multi-turn; the model learns context memory and continuous reasoning

{
 "messages":[
  {"role":"system","content":"You are a math tutor."},
  {"role":"user","content":"What is 12 * 8?"},
  {"role":"assistant","content":"12 × 8 = 96."},
  {"role":"user","content":"What about 96 ÷ 3?"},
  {"role":"assistant","content":"96 ÷ 3 = 32."}
 ]
}

Level 5 Tool / RAG

The model learns to select tools, invoke APIs, and return answers based on user queries

{
 "messages":[
  {"role":"user","content":"What's the weather in Tokyo?"},
  {
   "role":"assistant",
   "content":"I will check the weather API.",
   "tool_call":{
      "name":"get_weather",
      "arguments":{"city":"Tokyo"}
   }
  },
  {
   "role":"tool",
   "content":"Temperature: 20°C"
  },
  {
   "role":"assistant",
   "content":"The temperature in Tokyo is 20°C."
  }
 ]
}

The model uses provided Context

{
 "messages":[
  {
   "role":"system",
   "content":"Use the provided context to answer."
  },
  {
   "role":"user",
   "content":"Context: Tokyo is the capital of Japan.\nQuestion: What is the capital of Japan?"
  },
  {
   "role":"assistant",
   "content":"The capital of Japan is Tokyo."
  }
 ]
}

Recommended High-Quality SFT Data Structure

{
 "messages":[
  {"role":"system","content":"You are a helpful AI."},

  {"role":"user","content":"Calculate 25 * 14"},

  {"role":"assistant","content":
  "Let's solve step by step.\n
   25 × 14 = (25 × 10) + (25 × 4)\n
   = 250 + 100\n
   = 350\n
   Therefore the answer is 350."
  }
 ]
}

关于作者

我是Louis,一名长期从事iOS与AI相关工程实践的工程师,也是一个正在探索产品与商业可能性的准创始人.

这里的文章,更多是我在项目中用过,踩过坑,反复验证过的东西,而不是为了流量而写的“快内容”.

☕ 打赏

如果这篇文章对你有帮助,欢迎请我喝一杯咖啡☕️

PayPal
https://www.paypal.me/luochuan188

PayPay

You can support my work via PayPay by searching my PayPay ID:

PayPay ID: luochuan

微信支付

支付宝

你的支持会让我有更多时间,把真实项目中的经验持续整理和分享出来.

不打赏也完全没关系,感谢你读到这里.

联系与合作

如果你:

· 正在做iOS App / AI / 自动化相关的项目

· 对技术选型、架构设计、产品落地有困惑

· 或希望进行技术交流、合作探讨

欢迎通过以下邮箱联系我:

luochuanad@gmail.com