Scoring Methodology
RAG vs Fine-Tuning Decision Engineã¯ã4ã€ã®ã¢ãŒããã¯ãã£ã»ã¯ã©ã¹ â RAGãFine-TuningãLong-ContextãHybrid â ããŠãŒã¹ã±ãŒã¹ã®9ã€ã®æ¬¡å ã«å¯ŸããŠã¹ã³ã¢ãªã³ã°ããŸãããã®ããŒãžã¯ã忬¡å ãã©ã®ããã«éã¿ä»ããããããã³ã¹ãèŠç©ãããã©ã®ããã«å°åºãããããä¿¡é ŒåºŠãšãªã¹ã¯ãã©ã®ããã«å ±åããããã説æããŸãã
1. 9ã€ã®ã¹ã³ã¢ãªã³ã°æ¬¡å
忬¡å ã¯ã1ã€ãŸãã¯è€æ°ã®ã¢ãŒããã¯ãã£ã»ã¯ã©ã¹ã«ãã©ã¹ãŸãã¯ãã€ãã¹ã®ãã€ã³ããå¯äžããŸãããã€ã³ãã¯ããŒã»ã³ããŒãžã§ã¯ãããŸãã â å ç®ãããã·ã°ãã«ã§ããåèšã¹ã³ã¢ãæãé«ãã¯ã©ã¹ãåã¡ãŸãã1äœãš2äœã®å·®ãä¿¡é ŒåºŠãæ±ºå®ããŸãã
ããŒã¿ã®é®®åºŠ
ãœãŒã¹ã»ããŒã¿ãå€åããé »åºŠããªã¢ã«ã¿ã€ã ã»ããŒã¿ïŒ1ïŒã¯RAGã匷ã奜ã¿ãŸãããã¡ã€ã³ãã¥ãŒã³ãããã¢ãã«ã¯ãåãã¬ãŒãã³ã°ã»ãµã€ã¯ã«ãªãã«æ°ããæ å ±ãåã蟌ããªãããã§ããéçããŒã¿ïŒ5ïŒã¯RAGã®äž»èŠãªå©ç¹ãåãé€ããŸãã
ããã¥ã¡ã³ãã»ããªã¥ãŒã
ãã¬ããžã»ã³ãŒãã¹ã®ãµã€ãºãå°ããªã³ãŒãã¹ïŒ<10Kããã¥ã¡ã³ããã¹ã³ã¢1ïŒã¯long-contextãŠã£ã³ããŠã«åãŸãå¯èœæ§ããããŸããå€§èŠæš¡ãªã³ãŒãã¹ïŒ>10Mããã¥ã¡ã³ããã¹ã³ã¢5ïŒã¯long-contextãé€å€ãããã¯ãã«ããŒã¹ã®ãªããªãŒãã«ã匷ã奜ã¿ãŸãã
æéã¯ãšãªã»ããªã¥ãŒã
æãããã®æšè«ã³ãŒã«ç·æ°ãéåžžã«é«ãããªã¥ãŒã ïŒ>1M/æïŒã§ã¯ãã¯ãšãªãããã®ãªããªãŒãã«ã»ã³ã¹ãã环ç©ãããã¡ã€ã³ãã¥ãŒãã³ã°ã®æ¹ãã³ã¹ãå¹ççã«ãªãå¯èœæ§ããããŸããäœããªã¥ãŒã ïŒ<10K/æïŒã§ã¯ãã€ã³ãã©ã»ãªãŒããŒããããlong-contextã«åŸããŸãã
åŒçšç²ŸåºŠ
ãŠãŒã¹ã±ãŒã¹ãæ€èšŒå¯èœãªãœãŒã¹åç §ãå¿ èŠãšãããã©ãããç£æ»ã°ã¬ãŒãã®åŒçšïŒ4ïŒã¯RAGãŸãã¯hybridã匷ã奜ã¿ãŸãããã¡ã€ã³ãã¥ãŒã³ãããã¢ãã«ã¯åºæãå¹»èŠããããã§ã â ãã¬ãŒãã³ã°æã«èŠãŠããªããœãŒã¹ãåŒçšã§ããŸããã
ã¬ã€ãã³ã·SLA
ãšã³ãããŒãšã³ãã®ã¬ã€ãã³ã·äºç®ïŒããªç§ïŒãRAGã¯100ã400 msã®ãªããªãŒãã«ã»ãããã远å ããŸããSLAã500 msæªæºã®å Žåããã¡ã€ã³ãã¥ãŒãã³ã°ïŒãªããªãŒãã«ãªãïŒãå¿ èŠãããããŸãããLong-contextã¯å€§ããªããŒã¯ã³æ°ã§TTFTãªãŒããŒãããã远å ããŸãã
ããŒã¿æ©å¯æ§
ããŒã¿ã®èŠå¶ããã³æ©å¯æ§åé¡ã髿©å¯æ§ïŒ4ã5ïŒã¯ããªããªãŒãã«ã«äœ¿çšã§ãããã¹ãAPIãããã€ããŒãå¶éããã»ã«ããã¹ãã®ãšã³ããã£ã³ã°ãšæšè«ã€ã³ãã©ãå¿ èŠã«ãªãå¯èœæ§ããããŸãã
ãã¡ã€ã³ç¹ç°æ§
ãã¡ã€ã³ã®èªåœãšåºåãã©ãŒãããã®å°éæ§ãå°æçšèªãåºåã¹ããŒãããã©ã³ãã»ãã€ã¹ãæã€é«åºŠã«å°éåããããã¡ã€ã³ïŒ4ã5ïŒã¯ããªããªãŒãã«åç¬ããããã¡ã€ã³ãã¥ãŒãã³ã°ã®éã¿ã¬ãã«ã®é©å¿ããããå€ãã®æ©æµãåããŸãã
MLèœå
瀟å ã®MLãšã³ãžãã¢ãªã³ã°æç床ïŒ1 = MLããŒã ãªãã5 = äžçã¯ã©ã¹ïŒããã¡ã€ã³ãã¥ãŒãã³ã°ãšhybridã¢ãŒããã¯ãã£ã¯ãèšèšããã¬ãŒãã³ã°ãè©äŸ¡ãã¡ã³ããã³ã¹ã«MLå°éç¥èãå¿ èŠãšããŸããäœèœåããŒã ã¯RAGãŸãã¯long-contextãããã©ã«ãã«ãã¹ãã§ãã
äºç®äžé
æå€§æé¡æ¯åºããªãŒãã£ã³ã°ã»ã¢ãããŒãã®æšå®ã³ã¹ããäžéã®120%ãè¶ ããå Žåããšã³ãžã³ã¯ããã«ãã£ãé©çšããŸããäºç® < $2Kã¯äžè¬çã«hybridãé€å€ãã<$5Kã¯ãã¬ãŒãã³ã°ãååŽãããå Žåã«ãã¡ã€ã³ãã¥ãŒãã³ã°ãé€å€ããå¯èœæ§ããããŸãã
2. è€åã·ã°ãã«
åå¥ã®æ¬¡å ã¹ã³ã¢ãè¶ ããŠããšã³ãžã³ã¯æ¬¡å éã®çžäºäœçšãæããè€åã·ã°ãã«ãé©çšããŸãïŒ
- é«ããªã¥ãŒã + 峿 ŒãªåŒçšïŒæéã¯ãšãª ⥠1Mãã€åŒçš = 4ã®å ŽåãHybridã¯è¿œå ã®+20ãåãåããŸããRAFTã¯åŒçšç²ŸåºŠãç¶æããªãããã¬ãŒãã³ã°ã»ã³ã¹ããååŽããããã§ãã
- äœããªã¥ãŒã + äœäºç® + ãšã¢ã®ã£ãããªãïŒLong-contextã¯+15ãåãåããŸãããã¯ãã«ã»ã€ã³ãã©ãç«ã¡äžããããšãçµæžçã«æ£åœåãããªãããã§ãã
- ãªã³ãã¬ãã¹ãŸãã¯ãšã¢ã®ã£ããïŒFine-TuningãšHybridã¯+15/+10ãåãåããŸããã»ã«ããã¹ãã§ãããã€å¯èœãªããã§ããäžæ¹ãlong-contextïŒãã¹ãAPIã³ãŒã«ãå¿ èŠïŒã¯â20ã§ããã«ãã£ãåããŸãã
- äºç®ããã«ãã£ïŒããã¢ãããŒãã®æšå®æé¡ã³ã¹ããæå®äžéã®120%ãè¶ ããå Žåããã®ã¢ãããŒãã¯â15ãã€ã³ããåããŸãã
3. ã³ã¹ãèŠç©ããæ¹æ³è«
ã³ã¹ãèŠç©ããã¯ãæéã¯ãšãªæ°ãå¹³åããŒã¯ã³æ°ãã¢ãã«ã»ããŒã¿ããŒã¹ããååŸããã©ã€ãLLMäŸ¡æ ŒããŒã¿ããå°åºãããŸããåã¯ã©ã¹ã®åŒïŒ
RAGïŒæé¡ïŒ
ãšã³ããã£ã³ã°ååã³ã¹ãïŒ6ã¶æã§ååŽïŒ+ Vector DBææ°æïŒã³ãŒãã¹ã»ããªã¥ãŒã ããšã«æ®µéçïŒ+ ãªããªãŒãã«ã»ããŒã¯ã³ïŒçæã¢ãã«å ¥åäŸ¡æ ŒïŒ+ çæå ¥å&åºåããŒã¯ã³ + 15%ã®éçšãªãŒããŒãããã
Fine-TuningïŒæé¡ïŒ
ãã¬ãŒãã³ã°å®è¡ã³ã¹ãïŒ$1,200ã$25,000ãç¹ç°æ§ã決å®ïŒã6ã¶æã§ååŽ + 1.2åã®åºæ¬ã¢ãã«äŸ¡æ Œã§ã®ãã¡ã€ã³ãã¥ãŒã³æšè« + åãã¬ãŒãã³ã°äºåïŒå¹Žé2åã®åæã³ã¹ãïŒã
Long-ContextïŒæé¡ïŒ
ã¯ãšãªãããã®ããã¥ã¡ã³ãã»ããŒã¯ã³ à çæã¢ãã«å ¥åäŸ¡æ Œ + åºåããŒã¯ã³ à åºåäŸ¡æ ŒãåŒããŠããã³ããã»ãã£ãã·ã¥ã®ç¯çŽïŒãã£ãã·ã¥ã»ãããç à 70%å²åŒïŒãšãããAPIç¯çŽïŒããã察象ç à 50%å²åŒïŒã
Hybrid / RAFTïŒæé¡ïŒ
å šRAGã³ã¹ã + Fine-Tuningã³ã¹ãã®60%ïŒRAFTããªããªãŒãã«ã»ã€ã³ãã©ãšãã¬ãŒãã³ã°å®è¡ã®äž¡æ¹ãå¿ èŠãšããããã¯ãšãªæã®æšè«ã¯çŽç²ãªRAGããå¹ççãšããçŸå®ãåæ ïŒã
Vector DBäŸ¡æ Œã¯ã³ãŒãã¹ã»ããªã¥ãŒã ã§æ®µéçïŒ1ã5ã¹ã±ãŒã«ã$70ã$3,000/æã«ãããïŒãQ1 2026æç¹ã®pgvectorãPineconeãWeaviateãQdrantã§èгå¯ãããäŸ¡æ Œã«åºã¥ããŠããŸããLLMããŒã¯ã³äŸ¡æ Œã¯ã¢ãã«ã»ããŒã¿ããŒã¹ããã©ã€ãã§ååŸãããããŒã¿ããŒã¹ãå©çšã§ããªãå Žåã¯ä¿å®çãªããã©ã«ãïŒ$3/1Må ¥åã$12/1MåºåïŒã«ãã©ãŒã«ããã¯ããŸãã
4. ä¿¡é ŒåºŠããŒãžã³
ä¿¡é ŒåºŠã¯ãåè ã¯ã©ã¹ãš2äœã®éã®ãã€ã³ãã»ããŒãžã³ã«ãã£ãŠæ±ºå®ãããŸãïŒ
- é«ä¿¡é ŒåºŠïŒããŒãžã³ ⥠25ãã€ã³ã â 1ã€ã®ã¢ãããŒããæããã«æ¯é ã
- äžä¿¡é ŒåºŠïŒããŒãžã³10ã24ãã€ã³ã â æç¢ºãªãªãŒããŒããã ã2äœãå®çŸå¯èœã
- äœä¿¡é ŒåºŠïŒããŒãžã³ < 10ãã€ã³ã â è€æ°ã®ã¢ãããŒããæ®æãäž¡æ¹ã§PoCãæšå¥šãããŸãã
åè ã¹ã³ã¢ã40æªæºã®å Žåããšã³ãžã³ã¯ãåã¹ã³ãŒãã»ãã©ã°ããèšå®ããåäžã®ã¢ãããŒããæ¯é ããªãããšã瀺ããŸã â éåžžãã€ã³ãã©ã«ã³ãããããåã«ãŠãŒã¹ã±ãŒã¹ç¯å²ãçããã¹ãå åã§ãã
5. ãªã¹ã¯ã»ã¬ãžã¹ã¿ãŒ
ãšã³ãžã³ã¯7ã€ã®ãªã¹ã¯ã»ããªã¬ãŒãå ¥åãšåè ã®æšå¥šã«å¯ŸããŠè©äŸ¡ããŸããåãªã¹ã¯ã«ã¯é倧床ã¬ãã«ïŒé«ãäžãäœïŒãšç·©åæšå¥šããããŸãïŒ
- å¹»èŠåŒçšãªã¹ã¯ïŒé«ïŒïŒFine-Tuningæšå¥š + åŒçš ⥠3ã
- äºç®äžéãªã¹ã¯ïŒäžïŒïŒæšå®ã³ã¹ã > æå®äžéã®90%ã
- ããŒã¿åžžé§éåãªã¹ã¯ïŒé«ïŒïŒEUåžžé§ãŸãã¯é«æ©å¯ + Long-Contextæšå¥šã
- MLèœåã®ã£ããïŒäžïŒïŒèœå †2 + Fine-TuningãŸãã¯Hybridæšå¥šã
- å€ãäŸ¡æ ŒããŒã¿ïŒäœïŒïŒVector DBäŸ¡æ ŒããŒã¿ã90æ¥ä»¥äžåã
- ã³ãŒãã¹ã»ããªããã»ãªã¹ã¯ïŒäžïŒïŒé®®åºŠ †2 + Fine-Tuningæšå¥šã
- ã¬ã€ãã³ã·äºç®ãªã¹ã¯ïŒé«ïŒïŒã¬ã€ãã³ã·SLA < 500 ms + RAGãŸãã¯Hybridæšå¥šã
6. å¶éãšåæ
- ã³ã¹ãèŠç©ããã¯ææšçãªãã®ã«éããŸãããå®éã®ã³ã¹ãã¯ãããã€ããŒãã¢ãã«ã»ãµã€ãºãã€ã³ãã©æ§æã亀æžäŸ¡æ Œã«äŸåããŸãã
- ã¹ã³ã¢ãªã³ã°ã»ã¢ãã«ã¯æå³çã«æèŠçã§ãQ1 2026æç¹ã§Buzziã®ã¯ã©ã€ã¢ã³ãã§èгå¯ãããæ¬çªãã¿ãŒã³ã«åºã¥ããŠããŸããçµéšè±å¯ãªMLãšã³ãžãã¢ã«ããã¢ãŒããã¯ãã£ã»ã¬ãã¥ãŒã®ä»£æ¿ã§ã¯ãããŸããã
- ãšã³ãžã³ã¯ãã«ãããã³ã·ãŒãA/Bãã¹ãã»ãªãŒããŒããããè©äŸ¡ãã€ãã©ã€ã³ã»ã³ã¹ããFine-Tuningçšã®ããŒã¿ã©ããªã³ã°ã»ã³ã¹ããã¢ãã«åããŸããã
- Hybrid / RAFTã³ã¹ãã¯6ã¶æãŠã£ã³ããŠããã1åã®åãã¬ãŒãã³ã°ã»ãµã€ã¯ã«ãæ³å®ããŠããŸããããé »ç¹ãªåãã¬ãŒãã³ã°ãå¿ èŠãªããŒã ã¯ããã¬ãŒãã³ã°ååŽé€æ°ãå¢ããã¹ãã§ãã